0% found this document useful (0 votes)

10 views2 pages

ML Assignment 9

The document outlines a process for applying Principal Component Analysis (PCA) to the heart_disease dataset for binary classification using logistic regression. It includes steps for data loading, preprocessing, PCA transformation, and model training and evaluation, achieving an accuracy of approximately 85.25%. The code snippets provided guide the user through each stage of the analysis.

Uploaded by

anuj rawat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views2 pages

ML Assignment 9

Uploaded by

anuj rawat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

9w3itlede

January 3, 2025

0.1 Apply PCA on heart_disease.csv for implementing binary classification.

Please refer to the meta data of heart_disease data before implementation.
[1]: import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.decomposition import PCA
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score

# Load the dataset

url = "https://itv-contentbucket.s3.ap-south-1.amazonaws.com/Exams/ML/PCA/
↪heart_disease.csv"

data = pd.read_csv(url)

# Display the first few rows

print(data.head())

age sex cp trestbps chol fbs restecg thalach exang oldpeak slope \
0 63 1 3 145 233 1 0 150 0 2.3 0
1 37 1 2 130 250 0 1 187 0 3.5 0
2 41 0 1 130 204 0 0 172 0 1.4 2
3 56 1 1 120 236 0 1 178 0 0.8 2
4 57 0 0 120 354 0 1 163 1 0.6 2

ca thal target
0 0 1 1
1 0 2 1
2 0 2 1
3 0 2 1
4 0 2 1

[2]: # Check for missing values

print(data.isnull().sum())

# Drop or fill missing values as required

data = data.dropna() # Example of dropping missing values

1
age 0
sex 0
cp 0
trestbps 0
chol 0
fbs 0
restecg 0
thalach 0
exang 0
oldpeak 0
slope 0
ca 0
thal 0
target 0
dtype: int64

[3]: # Example assuming 'target' is the target column based on typical naming
X = data.drop('target', axis=1) # Replace 'target' with the actual target␣
↪column name

y = data['target'] # Replace 'target' with the actual target column name

[4]: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,␣

↪random_state=42)

[5]: scaler = StandardScaler()

X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)

[6]: # Choose the number of principal components to keep (e.g., 2 components)

pca = PCA(n_components=2)
X_train_pca = pca.fit_transform(X_train)
X_test_pca = pca.transform(X_test)

print(f'Explained Variance Ratio: {pca.explained_variance_ratio_}')

Explained Variance Ratio: [0.2072575 0.12434085]

[7]: model = LogisticRegression()

model.fit(X_train_pca, y_train)

[7]: LogisticRegression()

[8]: y_pred = model.predict(X_test_pca)

accuracy = accuracy_score(y_test, y_pred)
print(f'Accuracy: {accuracy}')

Accuracy: 0.8524590163934426

Teaching Statistics and Data Analysis With R
No ratings yet
Teaching Statistics and Data Analysis With R
16 pages
Cost Behavior and Forecasting: Seventh Edition
No ratings yet
Cost Behavior and Forecasting: Seventh Edition
130 pages
Hearth Failure Prediction
No ratings yet
Hearth Failure Prediction
38 pages
Coding Questions
No ratings yet
Coding Questions
124 pages
Note Multivariate Analysis of Variance
No ratings yet
Note Multivariate Analysis of Variance
3 pages
ML Practicals
No ratings yet
ML Practicals
21 pages
Heart - Disease - 1.ipynb - Colaboratory
No ratings yet
Heart - Disease - 1.ipynb - Colaboratory
9 pages
TP3.ipynb - Colab
No ratings yet
TP3.ipynb - Colab
17 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
C2 W4 Lab 02 Tree Ensemble
No ratings yet
C2 W4 Lab 02 Tree Ensemble
10 pages
COMP5318
No ratings yet
COMP5318
42 pages
C2 W4 Lab 02 Tree Ensemble
No ratings yet
C2 W4 Lab 02 Tree Ensemble
16 pages
MLT Lab 07
No ratings yet
MLT Lab 07
4 pages
Inbound 3085046103164618170
No ratings yet
Inbound 3085046103164618170
2 pages
Heart Disease Classification Using Ann Hands-On
No ratings yet
Heart Disease Classification Using Ann Hands-On
7 pages
Ex 12
No ratings yet
Ex 12
4 pages
Naive Bayes Ve SVM Alqoritimleri
No ratings yet
Naive Bayes Ve SVM Alqoritimleri
2 pages
ML 7th and 10th Program
No ratings yet
ML 7th and 10th Program
8 pages
Assignment 1
No ratings yet
Assignment 1
11 pages
Anoosha ML Lab02
No ratings yet
Anoosha ML Lab02
5 pages
Research Paper
No ratings yet
Research Paper
53 pages
Ex No 4
No ratings yet
Ex No 4
3 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
AI Mini Project
No ratings yet
AI Mini Project
6 pages
Lab 2
No ratings yet
Lab 2
8 pages
Aih Exp 2
No ratings yet
Aih Exp 2
8 pages
Heart Dataset Analysis
No ratings yet
Heart Dataset Analysis
24 pages
4-10 Aiml
No ratings yet
4-10 Aiml
25 pages
Session 2 Machine Learning Execution
No ratings yet
Session 2 Machine Learning Execution
12 pages
IR 3 Bayesian Network Heart
No ratings yet
IR 3 Bayesian Network Heart
2 pages
E.X No.4 Implement Bayesian Networks Aim
No ratings yet
E.X No.4 Implement Bayesian Networks Aim
2 pages
John Doe: Contact Work Experience
No ratings yet
John Doe: Contact Work Experience
1 page
Ex 8
No ratings yet
Ex 8
2 pages
Heart Diesese
No ratings yet
Heart Diesese
9 pages
Exp Number 13 LM
No ratings yet
Exp Number 13 LM
1 page
BSCS CURR PE Rev2023
No ratings yet
BSCS CURR PE Rev2023
2 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
Heart Attack Prediction
No ratings yet
Heart Attack Prediction
6 pages
Bayesian Network Notes
No ratings yet
Bayesian Network Notes
4 pages
Dissertation Data Collection and Analysis
100% (2)
Dissertation Data Collection and Analysis
4 pages
Examples of Business Analytics in Action - HBS Online
No ratings yet
Examples of Business Analytics in Action - HBS Online
5 pages
AIML Practical 05 22105A2021
No ratings yet
AIML Practical 05 22105A2021
9 pages
Prediction of Heart Disease Final-1-20 April - Jupyter Notebook
No ratings yet
Prediction of Heart Disease Final-1-20 April - Jupyter Notebook
10 pages
Heart Disease Classification ML Assignment - Jupyter Notebook
No ratings yet
Heart Disease Classification ML Assignment - Jupyter Notebook
7 pages
Program 9-Bayesian Network Inference
No ratings yet
Program 9-Bayesian Network Inference
1 page
Powerbi Road Map
No ratings yet
Powerbi Road Map
3 pages
Rubina Resume
No ratings yet
Rubina Resume
1 page
Google - Business Systems Analyst, Android and Business Communication - Google - Hyderabad, Telangana, India - Google Careers
No ratings yet
Google - Business Systems Analyst, Android and Business Communication - Google - Hyderabad, Telangana, India - Google Careers
3 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
Histograms Questions
No ratings yet
Histograms Questions
6 pages
Heart - Cleveland - Ipynb - Colab
No ratings yet
Heart - Cleveland - Ipynb - Colab
5 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
R Hitung Dan R Tabel
No ratings yet
R Hitung Dan R Tabel
6 pages
Heart Disease Diagnosis Using Machine Learning
No ratings yet
Heart Disease Diagnosis Using Machine Learning
26 pages
Department of Environmental Science
No ratings yet
Department of Environmental Science
13 pages
IR Final LabManual
No ratings yet
IR Final LabManual
18 pages
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
No ratings yet
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
15 pages
Aff700 1000 230109
No ratings yet
Aff700 1000 230109
9 pages
HEART
No ratings yet
HEART
15 pages
ML LAB - Principal Component Analysis
No ratings yet
ML LAB - Principal Component Analysis
3 pages
ML Lab Program - VTU
No ratings yet
ML Lab Program - VTU
5 pages
Deloitte Technical Proposal PDF
100% (11)
Deloitte Technical Proposal PDF
51 pages
MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis
No ratings yet
MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis
53 pages
Chapter 15
No ratings yet
Chapter 15
43 pages
Web Application
No ratings yet
Web Application
13 pages
2007 Efficacy of A C1-2 Self SNAG in The Management of Cervicogenic Headache
No ratings yet
2007 Efficacy of A C1-2 Self SNAG in The Management of Cervicogenic Headache
8 pages
Ide To 6 Classification Algorithms
No ratings yet
Ide To 6 Classification Algorithms
34 pages
Lab Report Content - 15marks
No ratings yet
Lab Report Content - 15marks
10 pages
Python Cod1
No ratings yet
Python Cod1
3 pages
A.I Lab Report
No ratings yet
A.I Lab Report
24 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Statistics of Business
No ratings yet
Statistics of Business
26 pages
Heart Disease Report
No ratings yet
Heart Disease Report
8 pages
YUKI ENDO - FInalexam
No ratings yet
YUKI ENDO - FInalexam
2 pages
Implementing PCA in Python With Scikit
No ratings yet
Implementing PCA in Python With Scikit
6 pages
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
100% (1)
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
73 pages
ETL Testing Interview Questions
No ratings yet
ETL Testing Interview Questions
33 pages
Unbalanced Panel Data PDF
No ratings yet
Unbalanced Panel Data PDF
51 pages
Walmart Case
No ratings yet
Walmart Case
5 pages
Strategic Foresight Platform
No ratings yet
Strategic Foresight Platform
575 pages
Business Intelligence
No ratings yet
Business Intelligence
8 pages
Project Report
No ratings yet
Project Report
18 pages
Content Outline: Chapter 1: Descriptive Statistics and Graphical Analysis
50% (2)
Content Outline: Chapter 1: Descriptive Statistics and Graphical Analysis
4 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
DAPv9d Mac2011
No ratings yet
DAPv9d Mac2011
36 pages
Employee Turnover in Banking Sector: Empirical Evidence
100% (1)
Employee Turnover in Banking Sector: Empirical Evidence
5 pages
House Pricing Prediction System
No ratings yet
House Pricing Prediction System
36 pages
Program 7
100% (1)
Program 7
4 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Advanced SAS Interview Questions You'll Most Likely Be Asked
From Everand
Advanced SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

ML Assignment 9

Uploaded by

ML Assignment 9

Uploaded by

9w3itlede

0.1 Apply PCA on heart_disease.csv for implementing binary classification.

# Load the dataset

# Display the first few rows

[2]: # Check for missing values

# Drop or fill missing values as required

y = data['target'] # Replace 'target' with the actual target column name

[4]: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,␣

[5]: scaler = StandardScaler()

[6]: # Choose the number of principal components to keep (e.g., 2 components)

print(f'Explained Variance Ratio: {pca.explained_variance_ratio_}')

Explained Variance Ratio: [0.2072575 0.12434085]

[7]: model = LogisticRegression()

[8]: y_pred = model.predict(X_test_pca)

You might also like