Lab 3

The document outlines a Python program that implements Principal Component Analysis (PCA) to reduce the dimensionality of the Iris dataset from 4 features to 2. It includes steps for loading the dataset, standardizing the data, applying PCA, and visualizing the results with a scatter plot. The explained variance ratio indicates how much variance is captured by the two principal components.

Uploaded by

binu28443

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views3 pages

Lab 3

Uploaded by

binu28443

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

3.

Develop a program to implement Principal Component Analysis (PCA)

for reducing the dimensionality of the Iris dataset from 4 features to 2
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris
from sklearn.decomposition import PCA
from sklearn.preprocessing import StandardScaler

# Load the Iris dataset

iris = load_iris()
print(iris.feature_names) # Column names
print(iris.target_names) # Class names

df = pd.DataFrame(data=iris.data, columns=iris.feature_names)
print(df.head())

# Standardize data before applying PCA

df_standardized = StandardScaler().fit_transform(df)

# Apply PCA with 2 components

pca = PCA(n_components=2)
principalComponents = pca.fit_transform(df_standardized)

# Create a new DataFrame with the principal components

pdf = pd.DataFrame(data=principalComponents, columns=['Principal Component 1', 'Principal
Component 2'])

# Concatenate the DataFrame with class labels

finalDf = pd.concat([pdf, pd.DataFrame(data=iris.target, columns=['target'])], axis=1)
print(finalDf.head())

# Visualize the data

fig, ax = plt.subplots(figsize=(8, 6))
ax.set_xlabel('Principal Component 1', fontsize=15)
ax.set_ylabel('Principal Component 2', fontsize=15)
explained_variance = sum(pca.explained_variance_ratio_)
ax.set_title(f'2 Component PCA (Explained Variance: {explained_variance:.2f})', fontsize=20)

targets = [0, 1, 2]
colors = ['r', 'g', 'b']
for target, color in zip(targets, colors):
indicesToKeep = finalDf['target'] == target
ax.scatter(finalDf.loc[indicesToKeep, 'Principal Component 1'],
finalDf.loc[indicesToKeep, 'Principal Component 2'],
c=color, label=iris.target_names[target], s=50, edgecolors='k')

ax.legend()
ax.grid()
plt.show()

# Print explained variance ratio

print('Explained variance ratio:', pca.explained_variance_ratio_)

Output
['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']
['setosa' 'versicolor' 'virginica']
sepal length (cm) sepal width (cm) petal length (cm) petal width (cm)
0 5.1 3.5 1.4 0.2
1 4.9 3.0 1.4 0.2
2 4.7 3.2 1.3 0.2
3 4.6 3.1 1.5 0.2
4 5.0 3.6 1.4 0.2
Principal Component 1 Principal Component 2 target
0 -2.264703 0.480027 0
1 -2.080961 -0.674134 0
2 -2.364229 -0.341908 0
3 -2.299384 -0.597395 0
4 -2.389842 0.646835 0
Explained variance ratio: [0.72962445 0.22850762]

(Feature Engineering) (Extended-Cheatsheet)
No ratings yet
(Feature Engineering) (Extended-Cheatsheet)
9 pages
Batch Fermentation Modeling, Monitoring, and Control (Chemical Industries, Vol. 93) PDF
100% (1)
Batch Fermentation Modeling, Monitoring, and Control (Chemical Industries, Vol. 93) PDF
620 pages
Education - Post 12th Standard - CSV
88% (16)
Education - Post 12th Standard - CSV
11 pages
Top 45 Machine Learning Interview Questions 2024
100% (1)
Top 45 Machine Learning Interview Questions 2024
34 pages
MATLAB For Brain and Cognitive Scientists
0% (2)
MATLAB For Brain and Cognitive Scientists
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
MScFE 600 Financial Data GWP1 - GRP - 7982 - Ques3
No ratings yet
MScFE 600 Financial Data GWP1 - GRP - 7982 - Ques3
6 pages
Validation of The Romanian Version of The Toronto Empathy Questionnaire (TEQ) Among Undergraduate Medical Students
No ratings yet
Validation of The Romanian Version of The Toronto Empathy Questionnaire (TEQ) Among Undergraduate Medical Students
15 pages
Research Methodology 4
No ratings yet
Research Methodology 4
33 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
NMF Tutorial
No ratings yet
NMF Tutorial
189 pages
Dvpd11 Merged Merged 27 83
No ratings yet
Dvpd11 Merged Merged 27 83
57 pages
ML Lab Manual PRGM 2&3
No ratings yet
ML Lab Manual PRGM 2&3
6 pages
Fixed Income Relative Value Analysis Website A Practitioners Guide To The Theory Tools and Trades 2nd Edition Doug Huggins Christian Schaller Download
No ratings yet
Fixed Income Relative Value Analysis Website A Practitioners Guide To The Theory Tools and Trades 2nd Edition Doug Huggins Christian Schaller Download
82 pages
AS Notebook - PCA - Wine Data-4
100% (1)
AS Notebook - PCA - Wine Data-4
1 page
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Matrix Decompositions
No ratings yet
Matrix Decompositions
9 pages
ML Labmanual
No ratings yet
ML Labmanual
33 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
33 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
42 pages
AI Roadmap - Based On Berkeley AI Graduate Certificate
No ratings yet
AI Roadmap - Based On Berkeley AI Graduate Certificate
23 pages
Joliffe, I., & Morgan, B. (1992) - Principal Component Analysis and Exploratory Factor Analysis.
No ratings yet
Joliffe, I., & Morgan, B. (1992) - Principal Component Analysis and Exploratory Factor Analysis.
28 pages
Pratik Zanke Factor Hair Revised
No ratings yet
Pratik Zanke Factor Hair Revised
37 pages
MLFILE
No ratings yet
MLFILE
21 pages
ModuleAr Merged
No ratings yet
ModuleAr Merged
42 pages
PPT1
No ratings yet
PPT1
93 pages
L 10 Principal Component Analysis 09052024 072206pm
No ratings yet
L 10 Principal Component Analysis 09052024 072206pm
37 pages
ML Assignment 01 Code
No ratings yet
ML Assignment 01 Code
21 pages
Principal Components Analysis (PCA)
No ratings yet
Principal Components Analysis (PCA)
53 pages
14.M.E Big Data
No ratings yet
14.M.E Big Data
89 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
What Is PCA: When Should You Use PCA?
No ratings yet
What Is PCA: When Should You Use PCA?
21 pages
ML Unit - 3 DimensionalitY Reduction
No ratings yet
ML Unit - 3 DimensionalitY Reduction
39 pages
Chapter 04 Dimension Reduction (R)
No ratings yet
Chapter 04 Dimension Reduction (R)
27 pages
PCA by Vikram Kumar
No ratings yet
PCA by Vikram Kumar
19 pages
The Credibility of Speed Limits On 80 KMH Rural Ro
No ratings yet
The Credibility of Speed Limits On 80 KMH Rural Ro
11 pages
Crop Recommendation
No ratings yet
Crop Recommendation
19 pages
Automatic Extraction and Analysis of Lineament Features Using ASTER and Sentinel 1 SAR Data
No ratings yet
Automatic Extraction and Analysis of Lineament Features Using ASTER and Sentinel 1 SAR Data
15 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
ML Lab
No ratings yet
ML Lab
14 pages
Mine 5
No ratings yet
Mine 5
8 pages
Data Science Using Python
No ratings yet
Data Science Using Python
9 pages
Unit-2 Solution
No ratings yet
Unit-2 Solution
22 pages
4 - Chrnic Kidney Disease Prediction Based On Machine Learning Algorithms
No ratings yet
4 - Chrnic Kidney Disease Prediction Based On Machine Learning Algorithms
12 pages
Principal Component Analysis: #Question 1
No ratings yet
Principal Component Analysis: #Question 1
6 pages
Metabolites 03 00259
No ratings yet
Metabolites 03 00259
18 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
14 pages
Exp 15
No ratings yet
Exp 15
12 pages
Report Kernel Pca Method
No ratings yet
Report Kernel Pca Method
11 pages
HCPC Husson Josse
No ratings yet
HCPC Husson Josse
17 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
Apple Data
No ratings yet
Apple Data
8 pages
DR Pca
No ratings yet
DR Pca
22 pages
Assignment 1 A
No ratings yet
Assignment 1 A
12 pages
Debesai Gutierrez Koyluoglu
No ratings yet
Debesai Gutierrez Koyluoglu
11 pages
A Comparative Study of Face Recognition Techniques
No ratings yet
A Comparative Study of Face Recognition Techniques
9 pages
116 Principal Components Analysis
No ratings yet
116 Principal Components Analysis
6 pages
5 Acoustic Settings Combination As A Sensory Crispness Indicator - 10p
No ratings yet
5 Acoustic Settings Combination As A Sensory Crispness Indicator - 10p
10 pages
Key Frame Extraction Analysis Based On Optimized Convolution Neural Network OCNN Using Intensity Feature Selection IFS
No ratings yet
Key Frame Extraction Analysis Based On Optimized Convolution Neural Network OCNN Using Intensity Feature Selection IFS
6 pages
5 Pca
No ratings yet
5 Pca
14 pages
Exp 12 and 15
No ratings yet
Exp 12 and 15
4 pages
Program - 3
No ratings yet
Program - 3
4 pages
MVA Assignment 1
No ratings yet
MVA Assignment 1
5 pages
Prog 3
No ratings yet
Prog 3
3 pages
PCA Notes
No ratings yet
PCA Notes
3 pages
ML Lab - Exp1-10
No ratings yet
ML Lab - Exp1-10
4 pages
Unit1 ML Programs
No ratings yet
Unit1 ML Programs
5 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Data Reduction Using Pythonh
No ratings yet
Data Reduction Using Pythonh
5 pages
DS Prac 9
No ratings yet
DS Prac 9
3 pages
Reduce Data Dimensionality Using PCA
No ratings yet
Reduce Data Dimensionality Using PCA
6 pages
Principal Component Analysis For Data Science
No ratings yet
Principal Component Analysis For Data Science
4 pages
PGM 3
No ratings yet
PGM 3
2 pages
Experiment 3 PCA On Iris Dataset
No ratings yet
Experiment 3 PCA On Iris Dataset
2 pages
Exp 3
No ratings yet
Exp 3
4 pages
Exp 3 A
No ratings yet
Exp 3 A
2 pages
1 - Pca Python Code
No ratings yet
1 - Pca Python Code
1 page
Education - Post 12th Standard - CSV
No ratings yet
Education - Post 12th Standard - CSV
11 pages
Data Science Libraries
No ratings yet
Data Science Libraries
4 pages
15BCE0400LAB10
No ratings yet
15BCE0400LAB10
2 pages
Sonar Class: A MATLAB Toolbox For The Classification of Side Scan Sonar Imagery, Using Local Textural and Reverberational Characteristics
No ratings yet
Sonar Class: A MATLAB Toolbox For The Classification of Side Scan Sonar Imagery, Using Local Textural and Reverberational Characteristics
7 pages
Factor Analysis - Spss
No ratings yet
Factor Analysis - Spss
15 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
D3 4.x数据可视化实战手册（第2版）: Chinese Edition
From Everand
D3 4.x数据可视化实战手册（第2版）: Chinese Edition
Posts & Telecom Press
No ratings yet

Lab 3

Uploaded by

Lab 3

Uploaded by

3.

Develop a program to implement Principal Component Analysis (PCA)

# Load the Iris dataset

# Standardize data before applying PCA

# Apply PCA with 2 components

# Create a new DataFrame with the principal components

# Concatenate the DataFrame with class labels

# Visualize the data

# Print explained variance ratio

You might also like