0% found this document useful (0 votes)

31 views2 pages

PCA - Colab

Uploaded by

Dina Bardakji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views2 pages

PCA - Colab

Uploaded by

Dina Bardakji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

12/14/24, 9:13 PM Untitled3.

ipynb - Colab

# Import necessary libraries

import numpy as np
import pandas as pd
from sklearn.decomposition import PCA
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
import os

# Define the file path

file_path = '/creditcard.csv'

# Check if the file exists

if os.path.isfile(file_path):
print("File found, loading the dataset.")
data = pd.read_csv(file_path) # Load dataset
print("First few rows of the dataset:")
print(data.head()) # Display the first few rows
else:
print("Error: File not found. Generating mock data.")
# Create mock data if the CSV file is not found
data = pd.DataFrame({
'Feature1': np.random.rand(10),
'Feature2': np.random.rand(10),
'Feature3': np.random.rand(10),
'Target': np.random.choice([0, 1], size=10)
})
print(data.head()) # Display mock data

# Check if the dataset has at least 2 columns

if data.shape[1] < 2:
print("Error: The dataset does not have enough columns.")
exit()

# Check if the target column is categorical with exactly two classes

if data.iloc[:, -1].nunique() != 2:
print("Error: The target variable must have exactly two classes for binary classification.")
exit()

# Features (all columns except the last) and Labels (last column)
X = data.iloc[:, :-1].values # Features
y = data.iloc[:, -1].values # Labels

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Apply PCA to reduce the dimensionality of the data to 2 components

pca = PCA(n_components=2) # Set number of components to 2
X_train_pca = pca.fit_transform(X_train) # Fit PCA on training data
X_test_pca = pca.transform(X_test) # Transform test data

# Train a logistic regression model on the reduced data

model = LogisticRegression() # Initialize logistic regression
model.fit(X_train_pca, y_train) # Train the model

# Make predictions on the test data

y_pred = model.predict(X_test_pca)

# Calculate the accuracy of the model

accuracy = accuracy_score(y_test, y_pred)
print("PCA Explained Variance Ratio:", pca.explained_variance_ratio_)
print("Accuracy of Logistic Regression:", accuracy)

# Provide interpretation of the accuracy score

if accuracy < 0.5:
interpretation = "The model is performing poorly. It may not be learning the patterns in the data."
elif 0.5 <= accuracy < 0.7:
interpretation = "The model has moderate accuracy. There may be room for improvement."
elif 0.7 <= accuracy < 0.9:
interpretation = "The model is performing well, but there might still be some overfitting."
else:
interpretation = "The model has high accuracy and is likely performing well on the test data."

# Print the interpretation of the accuracy

print("Interpretation of Accuracy:", interpretation)

https://colab.research.google.com/drive/1_KBRgYJlwVjZQJe-P3HdUﬂPhQPGtAG4#scrollTo=T_ebqXyuWO1Z&printMode=true 1/2
12/14/24, 9:13 PM Untitled3.ipynb - Colab
File found, loading the dataset.
First few rows of the dataset:
Time V1 V2 V3 V4 V5 V6 V7 \
0 0 -1.359807 -0.072781 2.536347 1.378155 -0.338321 0.462388 0.239599
1 0 1.191857 0.266151 0.166480 0.448154 0.060018 -0.082361 -0.078803
2 1 -1.358354 -1.340163 1.773209 0.379780 -0.503198 1.800499 0.791461
3 1 -0.966272 -0.185226 1.792993 -0.863291 -0.010309 1.247203 0.237609
4 2 -1.158233 0.877737 1.548718 0.403034 -0.407193 0.095921 0.592941

V8 V9 ... V21 V22 V23 V24 V25 \

0 0.098698 0.363787 ... -0.018307 0.277838 -0.110474 0.066928 0.128539
1 0.085102 -0.255425 ... -0.225775 -0.638672 0.101288 -0.339846 0.167170
2 0.247676 -1.514654 ... 0.247998 0.771679 0.909412 -0.689281 -0.327642
3 0.377436 -1.387024 ... -0.108300 0.005274 -0.190321 -1.175575 0.647376
4 -0.270533 0.817739 ... -0.009431 0.798278 -0.137458 0.141267 -0.206010

V26 V27 V28 Amount Class

0 -0.189115 0.133558 -0.021053 149.62 0
1 0.125895 -0.008983 0.014724 2.69 0
2 -0.139097 -0.055353 -0.059752 378.66 0
3 -0.221929 0.062723 0.061458 123.50 0
4 0.502292 0.219422 0.215153 69.99 0

[5 rows x 31 columns]
PCA Explained Variance Ratio: [9.99761791e-01 2.38122500e-04]
Accuracy of Logistic Regression: 0.9971666666666666
Interpretation of Accuracy: The model has high accuracy and is likely performing well on the test data.

https://colab.research.google.com/drive/1_KBRgYJlwVjZQJe-P3HdUﬂPhQPGtAG4#scrollTo=T_ebqXyuWO1Z&printMode=true 2/2

Supervised Learning
100% (1)
Supervised Learning
15 pages
Section 9: Reporting On Line Items: Transaction KSB1 - Display Actual Cost Line Items For Cost Centres
No ratings yet
Section 9: Reporting On Line Items: Transaction KSB1 - Display Actual Cost Line Items For Cost Centres
24 pages
Credit Card Fraud Detection
100% (1)
Credit Card Fraud Detection
20 pages
TP - Ipynb - Colab
No ratings yet
TP - Ipynb - Colab
6 pages
Dsbda 5
No ratings yet
Dsbda 5
4 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
Credit Card Default Clients Prediction 1693295790
No ratings yet
Credit Card Default Clients Prediction 1693295790
23 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
ML All Projectpdf Removed
No ratings yet
ML All Projectpdf Removed
41 pages
SML Practicals
No ratings yet
SML Practicals
4 pages
1
No ratings yet
1
13 pages
Chapter 5 - Classification Problems
100% (1)
Chapter 5 - Classification Problems
25 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
Da Program
No ratings yet
Da Program
18 pages
05 E RandomForest LoanData
No ratings yet
05 E RandomForest LoanData
8 pages
Bacdeaf 23032025 115708 Split 1
No ratings yet
Bacdeaf 23032025 115708 Split 1
37 pages
MLfull
No ratings yet
MLfull
29 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
C121 Exp2
No ratings yet
C121 Exp2
23 pages
27 KrishParasShah
No ratings yet
27 KrishParasShah
17 pages
Import As Import As From Import From Import From Import From Import
No ratings yet
Import As Import As From Import From Import From Import From Import
4 pages
Classification
No ratings yet
Classification
3 pages
Deep Learning Practical File
No ratings yet
Deep Learning Practical File
18 pages
Featureselection
No ratings yet
Featureselection
11 pages
B22EE010 Report
No ratings yet
B22EE010 Report
9 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Da 012307
No ratings yet
Da 012307
8 pages
Train
No ratings yet
Train
17 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
23BCE7199 ML Lab Assignment
No ratings yet
23BCE7199 ML Lab Assignment
15 pages
Mini Project
No ratings yet
Mini Project
9 pages
Home Work
No ratings yet
Home Work
12 pages
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
No ratings yet
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
13 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
ML Internal Answers
No ratings yet
ML Internal Answers
9 pages
Ann Experiential Learning
No ratings yet
Ann Experiential Learning
43 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Print Out ML - Finallllllllllllllll
No ratings yet
Print Out ML - Finallllllllllllllll
11 pages
AML Lab
No ratings yet
AML Lab
14 pages
Exp 5
No ratings yet
Exp 5
4 pages
22se02cs039 DS P-11
No ratings yet
22se02cs039 DS P-11
10 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
DA Programs
No ratings yet
DA Programs
44 pages
Machine Learning Assignment-2
No ratings yet
Machine Learning Assignment-2
7 pages
1st PGM
No ratings yet
1st PGM
10 pages
ML Assignment
No ratings yet
ML Assignment
34 pages
Articles Xgboost Classification With Smote-Enn Algorithm
No ratings yet
Articles Xgboost Classification With Smote-Enn Algorithm
11 pages
ADS - Phase 3
No ratings yet
ADS - Phase 3
34 pages
Case Study Stock Market Prediciton
No ratings yet
Case Study Stock Market Prediciton
10 pages
SPYPRO ML Project RF - Ipynb - Colaboratory
No ratings yet
SPYPRO ML Project RF - Ipynb - Colaboratory
4 pages
Data Description
No ratings yet
Data Description
8 pages
ML Model Report
No ratings yet
ML Model Report
8 pages
Machine
100% (1)
Machine
45 pages
Experiment ML
No ratings yet
Experiment ML
14 pages
Import As Import As From Import From Import From Import From Import
No ratings yet
Import As Import As From Import From Import From Import From Import
6 pages
Prathamesh KRAI
No ratings yet
Prathamesh KRAI
38 pages
Machine Learning Lab New
No ratings yet
Machine Learning Lab New
14 pages
Untitled Document
No ratings yet
Untitled Document
19 pages
Blazor and API Example: Classroom Quiz Application
From Everand
Blazor and API Example: Classroom Quiz Application
Taurius Litvinavicius
No ratings yet
Mid Full Summary
No ratings yet
Mid Full Summary
44 pages
Apologia Final
No ratings yet
Apologia Final
12 pages
SoftSkills1 BTech 1year Question Bank
No ratings yet
SoftSkills1 BTech 1year Question Bank
51 pages
Important Techniques For Analyzing Visual Tex
No ratings yet
Important Techniques For Analyzing Visual Tex
6 pages
CH 1
No ratings yet
CH 1
25 pages
Demystifying Innovation in The Value Chain
No ratings yet
Demystifying Innovation in The Value Chain
8 pages
Poem Annotation
No ratings yet
Poem Annotation
26 pages
Untitled Document
No ratings yet
Untitled Document
3 pages
Self Inflected Wound
No ratings yet
Self Inflected Wound
13 pages
Act I
No ratings yet
Act I
3 pages
Chem Practice
No ratings yet
Chem Practice
2 pages
English A Language and Literature Internal Assessment Class of 2022
No ratings yet
English A Language and Literature Internal Assessment Class of 2022
6 pages
Finance CH 3 Booklet - Financial Statements
No ratings yet
Finance CH 3 Booklet - Financial Statements
8 pages
Lecture 11 Chapter 6 Part 2 Big Data Processing Concepts
No ratings yet
Lecture 11 Chapter 6 Part 2 Big Data Processing Concepts
14 pages
Lecture 10 Chapter 6 Part 1 Big Data Processing Concepts
No ratings yet
Lecture 10 Chapter 6 Part 1 Big Data Processing Concepts
26 pages
Interesting Python
No ratings yet
Interesting Python
5 pages
Lecture 5 Chapter 5 Part 1 Big Data Storage Concepts
No ratings yet
Lecture 5 Chapter 5 Part 1 Big Data Storage Concepts
19 pages
Evolution 5.1 Natural Selection Edit
No ratings yet
Evolution 5.1 Natural Selection Edit
3 pages
Microsoft Word - Lecture 1
No ratings yet
Microsoft Word - Lecture 1
55 pages
Docx
No ratings yet
Docx
15 pages
Lecture 1
No ratings yet
Lecture 1
68 pages
Lecture 6 Chapter 5 Part 2 Big Data Storage Concepts
No ratings yet
Lecture 6 Chapter 5 Part 2 Big Data Storage Concepts
6 pages
Soft Skills Summary
No ratings yet
Soft Skills Summary
17 pages
Cells IB
No ratings yet
Cells IB
37 pages
Ds Bida
No ratings yet
Ds Bida
2 pages
Teilnehmerliste - Mündlicher Ausdruck - Labs
No ratings yet
Teilnehmerliste - Mündlicher Ausdruck - Labs
14 pages
A Symbiotic Relationship Biology 4.4
No ratings yet
A Symbiotic Relationship Biology 4.4
1 page
Chapter 9 Test Bank
No ratings yet
Chapter 9 Test Bank
31 pages
Death of A Salesman - Act 2 Questions
No ratings yet
Death of A Salesman - Act 2 Questions
2 pages
Mis Laudon 14 Chapter 4 Test Bank
No ratings yet
Mis Laudon 14 Chapter 4 Test Bank
29 pages
List of Experiments: DDVHDL
No ratings yet
List of Experiments: DDVHDL
52 pages
Quartus II Introduction Using Verilog Design
No ratings yet
Quartus II Introduction Using Verilog Design
29 pages
Social Media
No ratings yet
Social Media
5 pages
Number: 1z0-532 Passing Score: 800 Time Limit: 120 Min
No ratings yet
Number: 1z0-532 Passing Score: 800 Time Limit: 120 Min
18 pages
C If Statement
No ratings yet
C If Statement
6 pages
Bursting Reports in Cognos BI With Version 10 & 11 - Lodestar Solutions
No ratings yet
Bursting Reports in Cognos BI With Version 10 & 11 - Lodestar Solutions
9 pages
Assignment MITO 8107 05.10.20
No ratings yet
Assignment MITO 8107 05.10.20
9 pages
An PRC 148 Rover FMV MM - Web
100% (1)
An PRC 148 Rover FMV MM - Web
2 pages
Unit 2 - JDBC
No ratings yet
Unit 2 - JDBC
114 pages
Iot Solved Q.paper by @SD
No ratings yet
Iot Solved Q.paper by @SD
40 pages
Security Data Lake PDF
100% (1)
Security Data Lake PDF
37 pages
Os Lab Manual - 2024
No ratings yet
Os Lab Manual - 2024
79 pages
Vivek Resume
No ratings yet
Vivek Resume
4 pages
Lecture 1
No ratings yet
Lecture 1
5 pages
Data Warehousing & Data Mining Chapter 2
No ratings yet
Data Warehousing & Data Mining Chapter 2
88 pages
Close: Lascon Storage Backups
No ratings yet
Close: Lascon Storage Backups
6 pages
CS 465 Module Five Full Stack Guide
No ratings yet
CS 465 Module Five Full Stack Guide
18 pages
Lab Assignment-5 - DOS (CSE3249)
No ratings yet
Lab Assignment-5 - DOS (CSE3249)
12 pages
Project 2: University Course & Result Management System: Feature List & Score SL# Feature Score
No ratings yet
Project 2: University Course & Result Management System: Feature List & Score SL# Feature Score
14 pages
Smart Aquaponics Farming Using Internet of Things
No ratings yet
Smart Aquaponics Farming Using Internet of Things
15 pages
MTech CO
No ratings yet
MTech CO
21 pages
Log
No ratings yet
Log
8 pages
Lesson 4 - Looping Structure
No ratings yet
Lesson 4 - Looping Structure
28 pages
Ocaml Manual
No ratings yet
Ocaml Manual
218 pages
Solutions Configurator Faqs
No ratings yet
Solutions Configurator Faqs
42 pages
Ciphering Procedure in GSM Call Flow
No ratings yet
Ciphering Procedure in GSM Call Flow
3 pages
M800 CDMA TM-System Architecture
100% (1)
M800 CDMA TM-System Architecture
85 pages
Car Template Proposal 4g
No ratings yet
Car Template Proposal 4g
37 pages
CyberAces Module1-Windows 7 Registry
No ratings yet
CyberAces Module1-Windows 7 Registry
13 pages

PCA - Colab

Uploaded by

PCA - Colab

Uploaded by

12/14/24, 9:13 PM Untitled3.

# Import necessary libraries

# Define the file path

# Check if the file exists

# Check if the dataset has at least 2 columns

# Check if the target column is categorical with exactly two classes

# Split the data into training and testing sets

# Apply PCA to reduce the dimensionality of the data to 2 components

# Train a logistic regression model on the reduced data

# Make predictions on the test data

# Calculate the accuracy of the model

# Provide interpretation of the accuracy score

# Print the interpretation of the accuracy

V8 V9 ... V21 V22 V23 V24 V25 \

V26 V27 V28 Amount Class

You might also like