0% found this document useful (0 votes)

13 views4 pages

Dsbda 5

The document outlines a Python script for building a logistic regression model using a dataset of social network ads. It includes steps for data loading, preprocessing, model training, and evaluation, achieving an accuracy of 89%. Additional performance metrics such as precision and recall are also calculated and displayed.

Uploaded by

Manasi Deshmukh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views4 pages

Dsbda 5

Uploaded by

Manasi Deshmukh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 4

# Import necessary libraries

import pandas as pd #pandas is used for data

manipulation,
from sklearn.model_selection import train_test_split # for
splitting the dataset
from sklearn.preprocessing import StandardScaler #StandardScaler
for feature scaling
from sklearn.linear_model import LogisticRegression #for logistic
regression modeling, and accuracy_score
from sklearn.metrics import accuracy_score, classification_report,
confusion_matrix #classification_report, and confusion_matrix for
evaluating the model

# Load the dataset

url = ('C:\\Users\\rashi\\OneDrive\\Desktop\\DSBD PRACTICAL\\Practical
5\\Social_Network_Ads.csv')
dataset = pd.read_csv(url)

# Display the first few rows of the dataset to understand its

structure
print(dataset.head())

User ID Gender Age EstimatedSalary Purchased

0 15624510 Male 19 19000 0
1 15810944 Male 35 20000 0
2 15668575 Female 26 43000 0
3 15603246 Female 27 57000 0
4 15804002 Male 19 76000 0

# Define features and target variable

X = dataset.iloc[:, [2, 3]].values # Assuming columns 2 and 3 are the
relevant features
y = dataset.iloc[:, 4].values # Assuming column 4 is the target
variable

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.25, random_state=0)

# Feature scaling (optional, but can improve convergence speed)

sc = StandardScaler() # creates an instance of the
StandardScaler class.
X_train = sc.fit_transform(X_train) #fits the StandardScaler on
the training data (X_train)
X_test = sc.transform(X_test)

# Initialize the logistic regression model

classifier = LogisticRegression(random_state=0)

# Fit the model to the training data

classifier.fit(X_train, y_train)
LogisticRegression(random_state=0)

# Make predictions on the test set

y_pred = classifier.predict(X_test)

# Evaluate the performance of the model

accuracy = accuracy_score(y_test, y_pred)
conf_matrix = confusion_matrix(y_test, y_pred)
classification_rep = classification_report(y_test, y_pred)

print(f'Accuracy: {accuracy}')
print(f'Confusion Matrix:\n{conf_matrix}')
print(f'Classification Report:\n{classification_rep}')

Accuracy: 0.89
Confusion Matrix:
[[65 3]
[ 8 24]]
Classification Report:
precision recall f1-score support

0 0.89 0.96 0.92 68

1 0.89 0.75 0.81 32

accuracy 0.89 100

macro avg 0.89 0.85 0.87 100
weighted avg 0.89 0.89 0.89 100

# Combine the actual labels and predicted labels into a DataFrame for
comparison
results_df = pd.DataFrame({'Actual': y_test, 'Predicted': y_pred})

# Print the DataFrame to see the actual and predicted labels side by
side
print("\nActual vs Predicted Labels:")
print(results_df)

Actual vs Predicted Labels:

Actual Predicted
0 0 0
1 0 0
2 0 0
3 0 0
4 0 0
.. ... ...
95 1 0
96 0 0
97 1 0
98 1 1
99 1 1

[100 rows x 2 columns]

correctly_classified_samples = results_df[results_df['Actual'] ==
results_df['Predicted']].head(10)
print("\nFirst 10 Samples with Correct Classification:")
print(correctly_classified_samples)

First 10 Samples with Correct Classification:

Actual Predicted
0 0 0
1 0 0
2 0 0
3 0 0
4 0 0
5 0 0
6 0 0
7 1 1
8 0 0
10 0 0

# Compute additional performance metrics

TP = conf_matrix[1, 1] # True Positives
TN = conf_matrix[0, 0] # True Negatives
FP = conf_matrix[0, 1] # False Positives
FN = conf_matrix[1, 0] # False Negatives

# Metrics calculations
accuracy = (TP + TN) / (TP + TN + FP + FN)
error_rate = (FP + FN) / (TP + TN + FP + FN)
precision = TP / (TP + FP)
recall = TP / (TP + FN)

# Print additional performance metrics

print(f'\nTrue Positives (TP): {TP}')
print(f'True Negatives (TN): {TN}')
print(f'False Positives (FP): {FP}')
print(f'False Negatives (FN): {FN}')
print(f'Accuracy: {accuracy}')
print(f'Error Rate: {error_rate}')
print(f'Precision: {precision}')
print(f'Recall: {recall}')

True Positives (TP): 24

True Negatives (TN): 65
False Positives (FP): 3
False Negatives (FN): 8
Accuracy: 0.89
Error Rate: 0.11
Precision: 0.8888888888888888
Recall: 0.75

Ombc 106 Research Methodologies J 22
100% (1)
Ombc 106 Research Methodologies J 22
27 pages
How To Prepare Statistics For SSC CGL Tier II Study Notes in PDF
No ratings yet
How To Prepare Statistics For SSC CGL Tier II Study Notes in PDF
10 pages
Ebs 351 - Statistics and Probability II
No ratings yet
Ebs 351 - Statistics and Probability II
7 pages
Statistics in Assessment of Learning
No ratings yet
Statistics in Assessment of Learning
11 pages
Assessing The Validity and Reliability of Diagnostic and Screening Tests
67% (3)
Assessing The Validity and Reliability of Diagnostic and Screening Tests
38 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Module 8 Inferential Statistics NonParametric Test
No ratings yet
Module 8 Inferential Statistics NonParametric Test
112 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
21CSC305P ML - Lab Programs 1 - 9
No ratings yet
21CSC305P ML - Lab Programs 1 - 9
36 pages
Classification
No ratings yet
Classification
3 pages
Lampiran Anova
No ratings yet
Lampiran Anova
12 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
Validation of Analytical Methods Using A Regression Procedure
No ratings yet
Validation of Analytical Methods Using A Regression Procedure
4 pages
Class 03 04 Confidence Interval, Hypothesis Testing
No ratings yet
Class 03 04 Confidence Interval, Hypothesis Testing
87 pages
Tesis Magister Manajemen Pemasaran
83% (6)
Tesis Magister Manajemen Pemasaran
16 pages
Statistics and Optimization Techniques 2021 Q PAPER
100% (1)
Statistics and Optimization Techniques 2021 Q PAPER
3 pages
Multiple Linear Regression Case
0% (1)
Multiple Linear Regression Case
7 pages
22K61A0654 2 Sasi Auto
No ratings yet
22K61A0654 2 Sasi Auto
24 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
PCA - Colab
No ratings yet
PCA - Colab
2 pages
Friedman's Two-Way Analysis of Variance by Ranks
No ratings yet
Friedman's Two-Way Analysis of Variance by Ranks
5 pages
DataAnalytics Lab Manual
No ratings yet
DataAnalytics Lab Manual
35 pages
CH 11
No ratings yet
CH 11
111 pages
ML Lab Record - 250625 - 105014
No ratings yet
ML Lab Record - 250625 - 105014
29 pages
Analisis Data
No ratings yet
Analisis Data
72 pages
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
No ratings yet
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
5 pages
Bacdeaf 23032025 115708 Split 1
No ratings yet
Bacdeaf 23032025 115708 Split 1
37 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
Week 7 and 8
No ratings yet
Week 7 and 8
32 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
Ritesh Mangla ML PracticalFile
No ratings yet
Ritesh Mangla ML PracticalFile
55 pages
M.E Machine Learning - CP4252 Lab Manual4716718074353656238
No ratings yet
M.E Machine Learning - CP4252 Lab Manual4716718074353656238
26 pages
Performance Measures
No ratings yet
Performance Measures
25 pages
Central Tendency
No ratings yet
Central Tendency
105 pages
ML New Record
No ratings yet
ML New Record
51 pages
BCS301.Module 5
No ratings yet
BCS301.Module 5
43 pages
ML Manual Final
No ratings yet
ML Manual Final
35 pages
DSBDA Practicals
No ratings yet
DSBDA Practicals
16 pages
Openlab 1
No ratings yet
Openlab 1
17 pages
PAL Codes
No ratings yet
PAL Codes
18 pages
Fertilizer and Travancore Limited
No ratings yet
Fertilizer and Travancore Limited
25 pages
DS Food
No ratings yet
DS Food
23 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
Train
No ratings yet
Train
17 pages
ML Record
No ratings yet
ML Record
23 pages
Question 1 The Given Dataset Can Be Visualized As Follows
No ratings yet
Question 1 The Given Dataset Can Be Visualized As Follows
13 pages
ML Manual
No ratings yet
ML Manual
24 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
ML Lab Programs For Exam
No ratings yet
ML Lab Programs For Exam
10 pages
Dsbda 3a
No ratings yet
Dsbda 3a
11 pages
First Order Logic Syntax Semantics
No ratings yet
First Order Logic Syntax Semantics
8 pages
ML Lap
No ratings yet
ML Lap
23 pages
CCD - Ipynb - Colab
No ratings yet
CCD - Ipynb - Colab
6 pages
ML Internal Answers
No ratings yet
ML Internal Answers
9 pages
ML 6 7 8
No ratings yet
ML 6 7 8
10 pages
حجم الاثر الاختبارات-غير-المعلمية
No ratings yet
حجم الاثر الاختبارات-غير-المعلمية
22 pages
Detect Fake Profiles in Online Social Networks Using Support Vector Machine
No ratings yet
Detect Fake Profiles in Online Social Networks Using Support Vector Machine
8 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
Dsbda 3B
No ratings yet
Dsbda 3B
5 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
ADS Expt5 BE9 29
No ratings yet
ADS Expt5 BE9 29
3 pages
Deep Learning Assignments
No ratings yet
Deep Learning Assignments
6 pages
Those Who Do Not Remember The Past Are Condemned To Repeat It George Santayana Spanish Philosopher, Poet and Novelist (1863-1952)
No ratings yet
Those Who Do Not Remember The Past Are Condemned To Repeat It George Santayana Spanish Philosopher, Poet and Novelist (1863-1952)
32 pages
Da 012307
No ratings yet
Da 012307
8 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
Aquif Ibrar 1212
No ratings yet
Aquif Ibrar 1212
9 pages
Model Evaluation - II
No ratings yet
Model Evaluation - II
12 pages
ML Lab Programs
No ratings yet
ML Lab Programs
9 pages
Data SPSS Kak Ela Persen Inhibisi
No ratings yet
Data SPSS Kak Ela Persen Inhibisi
11 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
Dsbda 3B
No ratings yet
Dsbda 3B
5 pages
Relative Bias Assessment of D4815 As An Alternative To D5599 For Determination of Ethanol in Gasoline
No ratings yet
Relative Bias Assessment of D4815 As An Alternative To D5599 For Determination of Ethanol in Gasoline
38 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
Tutorial 14 Confidence Interval (Mean) - SOLUTIONS
No ratings yet
Tutorial 14 Confidence Interval (Mean) - SOLUTIONS
5 pages
Datascience PR 6 Veda
No ratings yet
Datascience PR 6 Veda
6 pages
Assignment 2: Hive
No ratings yet
Assignment 2: Hive
11 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Data Analytcs 2
No ratings yet
Data Analytcs 2
2 pages
Assignment 9
No ratings yet
Assignment 9
2 pages
Dsbda Prac1
No ratings yet
Dsbda Prac1
1 page
DSBDA Prac2
No ratings yet
DSBDA Prac2
2 pages
Linear Regression and Anova
No ratings yet
Linear Regression and Anova
11 pages
Practice Final Exam, STATS 401 W18
No ratings yet
Practice Final Exam, STATS 401 W18
9 pages
Pract5 1
No ratings yet
Pract5 1
3 pages
Data Analysis in Python-3
No ratings yet
Data Analysis in Python-3
4 pages
End of Job: 18 Command Lines 1 Errors 1 Warnings 1 CPU Seconds
No ratings yet
End of Job: 18 Command Lines 1 Errors 1 Warnings 1 CPU Seconds
4 pages
Cfa - R4
No ratings yet
Cfa - R4
1 page
Six Sigma Green Belt Roadmap - Lynda
No ratings yet
Six Sigma Green Belt Roadmap - Lynda
4 pages
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
No ratings yet
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
9 pages
MJC/2011 JC2 Preliminary Exam Paper 2/9740
No ratings yet
MJC/2011 JC2 Preliminary Exam Paper 2/9740
4 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet