0% found this document useful (0 votes)

27 views4 pages

Stats Lab (10-12)

The document outlines three programs implementing data analysis techniques using Python: PCA on the Wisconsin breast cancer dataset, LDA on the Iris dataset, and multiple linear regression on the Iris dataset. Each program includes data loading, transformation, visualization, and evaluation of results. The PCA and LDA visualizations display the separation of classes, while the regression analysis provides metrics like Mean Squared Error and R-squared for model performance.

Uploaded by

Sai Kishan .s

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views4 pages

Stats Lab (10-12)

Uploaded by

Sai Kishan .s

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

10. Program to implement PCA for Wisconsin dataset, visualize and analyze the results.

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

from sklearn.datasets import load_breast_cancer

print(df)

# Standardize the data

X_mean = np.mean(X, axis=0)

X_std = np.std(X, axis=0)

X_standardized = (X - X_mean) / X_std

# Compute the covariance matrix

cov_matrix = np.cov(X_standardized, rowvar=False)

print(cov_matrix)

# Compute eigenvalues and eigenvectors

eigenvalues, eigenvectors = np.linalg.eig(cov_matrix)

# Sort the eigenvalues and eigenvectors

sorted_indices = np.argsort(eigenvalues)[::-1]

eigenvalues_sorted = eigenvalues[sorted_indices]

eigenvectors_sorted = eigenvectors[:, sorted_indices]

# Select the top 2 principal components

k=2

eigenvectors_subset = eigenvectors_sorted[:, :k]

# Transform the data

X_pca = X_standardized.dot(eigenvectors_subset)

# Visualize the PCA results

plt.figure(figsize=(10, 6))

plt.scatter(X_pca[:, 0], X_pca[:, 1], c=y, cmap='viridis', edgecolor='k', s=50)

plt.title('PCA of Wisconsin Breast Cancer Dataset')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

plt.colorbar(label='Class Label')

plt.grid()

plt.show()

11. Program to implement the working of linear discriminant analysis using IRIS dataset
and visualize the result.

import pandas as pd

import numpy as np

import matplotlib.pyplot as plt

import seaborn as sns

from sklearn import datasets

from sklearn.discriminant_analysis import LinearDiscriminantAnalysis as LDA

# Load the iris dataset

iris = datasets.load_iris()

X = iris.data # Features

y = iris.target # Target classes

print(y)

# Create an instance of LDA

lda = LDA(n_components=2)

# Fit and transform the data

X_lda = lda.fit_transform(X, y)

# Create a DataFrame for visualization

lda_df = pd.DataFrame(data=X_lda, columns=['LD1', 'LD2'])

lda_df['target'] = y

# Map target values to class names

lda_df['target'] = lda_df['target'].map({0: 'Setosa', 1: 'Versicolor', 2: 'Virginica'})

# Plotting

plt.figure(figsize=(10, 6))

sns.scatterplot(data=lda_df, x='LD1', y='LD2', hue='target', palette='viridis', s=100)

plt.title('LDA of Iris Dataset')

plt.xlabel('Linear Discriminant 1')

plt.ylabel('Linear Discriminant 2')

plt.legend(title='Species')

plt.grid()

plt.show()

12. Program to implement multiple linear regression using IRIS dataset, visualize and
analyze the results.

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, r2_score
# Load the iris dataset
iris = sns.load_dataset('iris')
print(iris.head())
# Define independent variables (features) and dependent variable (target)
X = iris[['sepal_length', 'sepal_width', 'petal_width']]
y = iris['petal_length']
# Split the dataset into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)
# Create a linear regression model
model = LinearRegression()
model.fit(X_train, y_train)
# Make predictions
y_pred = model.predict(X_test)
# Evaluate the model
mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

print(f'Mean Squared Error: {mse}')

print(f'R-squared: {r2}')
# Visualize the results
plt.figure(figsize=(10, 6))
plt.scatter(y_test, y_pred, color='blue')
plt.plot([y.min(), y.max()], [y.min(), y.max()], color='red', linewidth=2)
plt.title('Actual vs Predicted Petal Length')
plt.xlabel('Actual Petal Length')
plt.ylabel('Predicted Petal Length')
plt.grid()
plt.show()

Sony rcp-1530 1st-Edition Rev.1 MM
No ratings yet
Sony rcp-1530 1st-Edition Rev.1 MM
172 pages
CTRF
No ratings yet
CTRF
2 pages
Program
No ratings yet
Program
9 pages
Strangers
No ratings yet
Strangers
8 pages
PGM 3
No ratings yet
PGM 3
2 pages
Unit1 ML Programs
No ratings yet
Unit1 ML Programs
5 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
PR
No ratings yet
PR
17 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
External
No ratings yet
External
11 pages
Experiment 3 Code
No ratings yet
Experiment 3 Code
2 pages
ML Manual
No ratings yet
ML Manual
30 pages
Final ML File
No ratings yet
Final ML File
34 pages
1
No ratings yet
1
13 pages
Experiment 3 PCA On Iris Dataset
No ratings yet
Experiment 3 PCA On Iris Dataset
2 pages
Lab Extern L
No ratings yet
Lab Extern L
8 pages
ML Spy Programs
No ratings yet
ML Spy Programs
16 pages
Lab 6
No ratings yet
Lab 6
4 pages
Python Lab Programs-23pca017 & 23pca018
No ratings yet
Python Lab Programs-23pca017 & 23pca018
8 pages
Week6 - Colab
No ratings yet
Week6 - Colab
3 pages
AIML Lab 7 8 9 10
No ratings yet
AIML Lab 7 8 9 10
10 pages
Experiment 1
No ratings yet
Experiment 1
19 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
ML II Lab
No ratings yet
ML II Lab
5 pages
ML 3
No ratings yet
ML 3
24 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
ML Minimized Programs
No ratings yet
ML Minimized Programs
9 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
DAI Amberish LAB ASSIGNMENT 3
No ratings yet
DAI Amberish LAB ASSIGNMENT 3
7 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
ML Lab - Exp1-10
No ratings yet
ML Lab - Exp1-10
4 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
Machinelearninglabmanual
No ratings yet
Machinelearninglabmanual
47 pages
ML Assignment 01 Code
No ratings yet
ML Assignment 01 Code
21 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
ML Manual
No ratings yet
ML Manual
9 pages
ML Lab
No ratings yet
ML Lab
14 pages
Roll NO 2020
No ratings yet
Roll NO 2020
8 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
ML Labmanual
No ratings yet
ML Labmanual
33 pages
ML Programs
No ratings yet
ML Programs
14 pages
B22EE010 Report
No ratings yet
B22EE010 Report
9 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
ML Lab Manual
No ratings yet
ML Lab Manual
60 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
MACHINE LEARNING Manual
No ratings yet
MACHINE LEARNING Manual
36 pages
Program - 3
No ratings yet
Program - 3
4 pages
20BCP021 Assignment 6
No ratings yet
20BCP021 Assignment 6
15 pages
BCSL606 Machine Learning Lab
No ratings yet
BCSL606 Machine Learning Lab
33 pages
LinearRegression Iris
No ratings yet
LinearRegression Iris
4 pages
Exp 14
No ratings yet
Exp 14
2 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
4 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Machine
100% (1)
Machine
45 pages
Lab Manual ML
No ratings yet
Lab Manual ML
26 pages
Machine Learning Labnem
No ratings yet
Machine Learning Labnem
5 pages
KRAI Practical
No ratings yet
KRAI Practical
14 pages
BCSL606 Machine Learning Lab Final Draft
No ratings yet
BCSL606 Machine Learning Lab Final Draft
32 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
Module 2
No ratings yet
Module 2
7 pages
Stats Lab (4-6)
No ratings yet
Stats Lab (4-6)
7 pages
4 MR&MM
No ratings yet
4 MR&MM
25 pages
Develop A Program For Error Detecting Code Using CRC-CCITT (16-Bits)
No ratings yet
Develop A Program For Error Detecting Code Using CRC-CCITT (16-Bits)
15 pages
Ddco Lab Manual
No ratings yet
Ddco Lab Manual
35 pages
Question Bank 1
No ratings yet
Question Bank 1
2 pages
Ddco Module 4 Notes
No ratings yet
Ddco Module 4 Notes
17 pages
DS Manual 22scheme BCSL305
No ratings yet
DS Manual 22scheme BCSL305
78 pages
Module 1 PPT
No ratings yet
Module 1 PPT
48 pages
FM Modulators: Experiment 7
100% (2)
FM Modulators: Experiment 7
17 pages
BL - Awb
No ratings yet
BL - Awb
1 page
DAY 6 PATHFit 1
No ratings yet
DAY 6 PATHFit 1
34 pages
Morphology of Flowering Plants Learn Cbse
No ratings yet
Morphology of Flowering Plants Learn Cbse
6 pages
Deloitte Mergers Aquisitons Tax
No ratings yet
Deloitte Mergers Aquisitons Tax
1 page
Fatigue Strength
No ratings yet
Fatigue Strength
7 pages
One Word Answer Questions Covering Dermatology
50% (2)
One Word Answer Questions Covering Dermatology
5 pages
CHY Brochure A4 72pg V11
No ratings yet
CHY Brochure A4 72pg V11
72 pages
VBQ-XII - English Core - 2
No ratings yet
VBQ-XII - English Core - 2
25 pages
Synonyms
No ratings yet
Synonyms
3 pages
GAD Activity Design Template
No ratings yet
GAD Activity Design Template
2 pages
Jennifer Bridges
No ratings yet
Jennifer Bridges
3 pages
Nelder Mead Slides
No ratings yet
Nelder Mead Slides
47 pages
03 Corpo Rigido-2d
No ratings yet
03 Corpo Rigido-2d
91 pages
FITA - Academy - UI UX Design
No ratings yet
FITA - Academy - UI UX Design
17 pages
VTP Interview Questions and Answers (VLAN Trunking Protocol) - Networker Interview
100% (1)
VTP Interview Questions and Answers (VLAN Trunking Protocol) - Networker Interview
2 pages
Buddhist Animal Release Practices - Shiu, Stokes
No ratings yet
Buddhist Animal Release Practices - Shiu, Stokes
17 pages
All Aboard Unit 1
No ratings yet
All Aboard Unit 1
7 pages
Thomasyl CV
No ratings yet
Thomasyl CV
7 pages
Important!: Read Before Proceeding!
No ratings yet
Important!: Read Before Proceeding!
10 pages
Test 03a
No ratings yet
Test 03a
4 pages
Contoh Soal - Imrona-Ngantang 1
No ratings yet
Contoh Soal - Imrona-Ngantang 1
3 pages
Week 1 - Lecture Prinsip Perakaunan Principles of Accounting (Bt11003)
No ratings yet
Week 1 - Lecture Prinsip Perakaunan Principles of Accounting (Bt11003)
30 pages
FPGA Implementation of Simplified SVPWM Algorithm For Three Phase Voltage Source Inverter
No ratings yet
FPGA Implementation of Simplified SVPWM Algorithm For Three Phase Voltage Source Inverter
8 pages
Lec 2-Week 1 - (Design of Sewer System)
No ratings yet
Lec 2-Week 1 - (Design of Sewer System)
19 pages
Basic Question Bank With Answers and Explanations
No ratings yet
Basic Question Bank With Answers and Explanations
275 pages
Emcee Script
100% (2)
Emcee Script
2 pages

Stats Lab (10-12)

Uploaded by

Stats Lab (10-12)

Uploaded by

10. Program to implement PCA for Wisconsin dataset, visualize and analyze the results.

import matplotlib.pyplot as plt

from sklearn.datasets import load_breast_cancer

# Standardize the data

X_mean = np.mean(X, axis=0)

X_std = np.std(X, axis=0)

X_standardized = (X - X_mean) / X_std

# Compute the covariance matrix

cov_matrix = np.cov(X_standardized, rowvar=False)

# Compute eigenvalues and eigenvectors

eigenvalues, eigenvectors = np.linalg.eig(cov_matrix)

# Sort the eigenvalues and eigenvectors

eigenvectors_sorted = eigenvectors[:, sorted_indices]

# Select the top 2 principal components

eigenvectors_subset = eigenvectors_sorted[:, :k]

# Transform the data

# Visualize the PCA results

plt.scatter(X_pca[:, 0], X_pca[:, 1], c=y, cmap='viridis', edgecolor='k', s=50)

plt.title('PCA of Wisconsin Breast Cancer Dataset')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

import matplotlib.pyplot as plt

import seaborn as sns

from sklearn import datasets

from sklearn.discriminant_analysis import LinearDiscriminantAnalysis as LDA

# Load the iris dataset

y = iris.target # Target classes

# Create an instance of LDA

# Fit and transform the data

# Create a DataFrame for visualization

# Map target values to class names

lda_df['target'] = lda_df['target'].map({0: 'Setosa', 1: 'Versicolor', 2: 'Virginica'})

sns.scatterplot(data=lda_df, x='LD1', y='LD2', hue='target', palette='viridis', s=100)

plt.title('LDA of Iris Dataset')

plt.xlabel('Linear Discriminant 1')

plt.ylabel('Linear Discriminant 2')

print(f'Mean Squared Error: {mse}')

You might also like