0% found this document useful (0 votes)

8 views8 pages

CO3

Uploaded by

sankeerthrockz2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views8 pages

CO3

Uploaded by

sankeerthrockz2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

CO3

1. Program to implement decision trees using and standard dataset available in the public domain and find the
accuracy of the algorithm.(Implement the pruning technique to avoid overfitting and re-evaluate the decision
tree's performance after pruning. Compare the decision tree model's performance with other classification
algorithms, such as k-Nearest Neighbors (k-NN) or Naive Bayes. Use either the ID3, C4.5, or CART (Gini impurity)
algorithm)

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import accuracy_score

from sklearn.neighbors import KNeighborsClassifier

from sklearn.naive_bayes import GaussianNB

import matplotlib.pyplot as plt

data = load_iris()

X, y = data.data, data.target

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

tree_clf = DecisionTreeClassifier(criterion="gini", random_state=42)

tree_clf.fit(X_train, y_train)

y_pred = tree_clf.predict(X_test)

accuracy_before_pruning = accuracy_score(y_test, y_pred)

print("Decision Tree Accuracy (before pruning):", accuracy_before_pruning)

plt.figure(figsize=(15, 8))

plot_tree(tree_clf, filled=True, feature_names=data.feature_names, class_names=data.target_names)

plt.title("Decision Tree before Pruning")

plt.show()

pruned_tree_clf = DecisionTreeClassifier(criterion="gini", max_depth=3, random_state=42)

pruned_tree_clf.fit(X_train, y_train)
y_pruned_pred = pruned_tree_clf.predict(X_test)

accuracy_after_pruning = accuracy_score(y_test, y_pruned_pred)

print("Decision Tree Accuracy (after pruning):", accuracy_after_pruning)

plt.figure(figsize=(15, 8))

plot_tree(pruned_tree_clf, filled=True, feature_names=data.feature_names, class_names=data.target_names)

plt.title("Decision Tree after Pruning")

plt.show()

# k-Nearest Neighbors (k-NN)

knn_clf = KNeighborsClassifier(n_neighbors=5)

knn_clf.fit(X_train, y_train)

y_knn_pred = knn_clf.predict(X_test)

knn_accuracy = accuracy_score(y_test, y_knn_pred)

print("k-Nearest Neighbors Accuracy:", knn_accuracy)

# Naive Bayes

nb_clf = GaussianNB()

nb_clf.fit(X_train, y_train)

y_nb_pred = nb_clf.predict(X_test)

nb_accuracy = accuracy_score(y_test, y_nb_pred)

print("Naive Bayes Accuracy:", nb_accuracy)

print("\nSummary of model accuracies:")

print(f"Decision Tree (before pruning): {accuracy_before_pruning:.4f}")

print(f"Decision Tree (after pruning): {accuracy_after_pruning:.4f}")

print(f"k-Nearest Neighbors: {knn_accuracy:.4f}")

print(f"Naive Bayes: {nb_accuracy:.4f}")

k-Nearest Neighbors Accuracy: 1.0
Naive Bayes Accuracy: 0.9777777777777777
Summary of model accuracies:
Decision Tree (before pruning): 1.0000
Decision Tree (after pruning): 1.0000
k-Nearest Neighbors: 1.0000
Naive Bayes: 0.9778
2. Explore the concepts of simple linear regression, multiple linear regression, and correlations using the ordinary
least squares estimation method to fit regression models.

import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, r2_score
from sklearn.datasets import load_iris
import seaborn as sns
import matplotlib.pyplot as plt

iris = load_iris()
df = pd.DataFrame(iris.data, columns=iris.feature_names)
df.columns = ["sepal_length", "sepal_width", "petal_length", "petal_width"]

correlation_matrix = df.corr()
print("Correlation Matrix:\n", correlation_matrix)

plt.figure(figsize=(8, 6))
sns.heatmap(correlation_matrix, annot=True, cmap='coolwarm')
plt.title("Correlation Matrix for Iris Dataset Features")
plt.show()

X = df.drop(columns="sepal_length")
y = df["sepal_length"]

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

X_simple = X_train[["petal_length"]]
X_simple_test = X_test[["petal_length"]]
lr_simple = LinearRegression()
lr_simple.fit(X_simple, y_train)
y_pred_simple = lr_simple.predict(X_simple_test)

mse_simple = mean_squared_error(y_test, y_pred_simple)

r2_simple = r2_score(y_test, y_pred_simple)
print("Simple Linear Regression MSE:", mse_simple)
print("Simple Linear Regression R^2:", r2_simple)

lr_multiple = LinearRegression()
lr_multiple.fit(X_train, y_train)
y_pred_multiple = lr_multiple.predict(X_test)

mse_multiple = mean_squared_error(y_test, y_pred_multiple)

r2_multiple = r2_score(y_test, y_pred_multiple)
print("Multiple Linear Regression MSE:", mse_multiple)
print("Multiple Linear Regression R^2:", r2_multiple)

Output
Correlation Matrix:
sepal_length sepal_width petal_length petal_width
sepal_length 1.000000 -0.117570 0.871754 0.817941
sepal_width -0.117570 1.000000 -0.428440 -0.366126
petal_length 0.871754 -0.428440 1.000000 0.962865
petal_width 0.817941 -0.366126 0.962865 1.000000

Simple Linear Regression MSE: 0.129093146356764

Simple Linear Regression R^2: 0.812980761507489
Multiple Linear Regression MSE: 0.10212647866320387
Multiple Linear Regression R^2: 0.8520477902310163
3. Work with a dataset containing independent variables (features) and a dependent variable (target) to predict
and analyze their relationships.(Implement feature scaling (e.g., standardization or normalization) for the
independent variables and re-evaluate the performance of the multiple linear regression model. Implement
regularization techniques (e.g., Lasso or Ridge regression) to handle potential overfitting in the multiple linear
regression model. Compare the performance of regularized and nonregularized models.)

import pandas as pd

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

from sklearn.linear_model import LinearRegression, Ridge, Lasso

from sklearn.metrics import mean_squared_error, r2_score

from sklearn.datasets import load_iris

iris = load_iris()

df = pd.DataFrame(iris.data, columns=iris.feature_names)

df.columns = ["sepal_length", "sepal_width", "petal_length", "petal_width"]

X = df.drop(columns="sepal_length")

y = df["sepal_length"]

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

scaler = StandardScaler()

X_train_scaled = scaler.fit_transform(X_train)

X_test_scaled = scaler.transform(X_test)

X_single_feature = X_train[["petal_length"]]

X_single_test = X_test[["petal_length"]]

lr_single = LinearRegression()

lr_single.fit(X_single_feature, y_train)

y_pred_single = lr_single.predict(X_single_test)

mse_single = mean_squared_error(y_test, y_pred_single)

r2_single = r2_score(y_test, y_pred_single)

print("Simple Linear Regression MSE:", mse_single)

print("Simple Linear Regression R^2:", r2_single)

lr_multi = LinearRegression()

lr_multi.fit(X_train_scaled, y_train)

y_pred_multi = lr_multi.predict(X_test_scaled)

mse_multi = mean_squared_error(y_test, y_pred_multi)

r2_multi = r2_score(y_test, y_pred_multi)

print("Multiple Linear Regression MSE:", mse_multi)

print("Multiple Linear Regression R^2:", r2_multi)

ridge = Ridge(alpha=1.0)

ridge.fit(X_train_scaled, y_train)

y_pred_ridge = ridge.predict(X_test_scaled)

mse_ridge = mean_squared_error(y_test, y_pred_ridge)

r2_ridge = r2_score(y_test, y_pred_ridge)

print("Ridge Regression MSE:", mse_ridge)

print("Ridge Regression R^2:", r2_ridge)

lasso = Lasso(alpha=0.1)

lasso.fit(X_train_scaled, y_train)

y_pred_lasso = lasso.predict(X_test_scaled)

mse_lasso = mean_squared_error(y_test, y_pred_lasso)

r2_lasso = r2_score(y_test, y_pred_lasso)

print("Lasso Regression MSE:", mse_lasso)

print("Lasso Regression R^2:", r2_lasso)

print("Multiple Linear Regression Coefficients:", lr_multi.coef_)

print("Ridge Regression Coefficients:", ridge.coef_)

print("Lasso Regression Coefficients:", lasso.coef_)

Output
Simple Linear Regression MSE: 0.129093146356764

Simple Linear Regression R^2: 0.812980761507489

Multiple Linear Regression MSE: 0.10212647866320375

Multiple Linear Regression R^2: 0.8520477902310164

Ridge Regression MSE: 0.09363860271269377

Ridge Regression R^2: 0.8643443074473242

Lasso Regression MSE: 0.12157358128102438

Lasso Regression R^2: 0.8238744717775386

Multiple Linear Regression Coefficients: [ 0.29673801 1.32167517 -0.50506043]

Ridge Regression Coefficients: [ 0.27896087 1.1378305 -0.33189876]

Lasso Regression Coefficients: [0.09182964 0.64697702 0. ]

Lease Forms Residential Lease Agreement
100% (4)
Lease Forms Residential Lease Agreement
6 pages
Eai Exp 2-5
No ratings yet
Eai Exp 2-5
13 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
PR
No ratings yet
PR
17 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
LinearRegression Iris
No ratings yet
LinearRegression Iris
4 pages
#Exp2 Eda On 2 Variable Dataset
No ratings yet
#Exp2 Eda On 2 Variable Dataset
4 pages
22MCA1008 - Varun ML LAB ASSIGNMENTS
100% (1)
22MCA1008 - Varun ML LAB ASSIGNMENTS
41 pages
Print Out ML - Finallllllllllllllll
No ratings yet
Print Out ML - Finallllllllllllllll
11 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
ML Four To Eight
No ratings yet
ML Four To Eight
3 pages
ML Minimized Programs
No ratings yet
ML Minimized Programs
9 pages
L3 - Classification - RandomForest - Jupyter Notebook
No ratings yet
L3 - Classification - RandomForest - Jupyter Notebook
6 pages
ML Codes
No ratings yet
ML Codes
9 pages
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
No ratings yet
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
1 page
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
1
No ratings yet
1
13 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
Machine Learnin
100% (2)
Machine Learnin
23 pages
ML Using Python Programs
No ratings yet
ML Using Python Programs
12 pages
ML Lab Manual 4-8
No ratings yet
ML Lab Manual 4-8
11 pages
ML Brefing
No ratings yet
ML Brefing
28 pages
Implementation of Simple Linear Regression Algorithm Using Python
No ratings yet
Implementation of Simple Linear Regression Algorithm Using Python
12 pages
AI ML - Cycle 2 Programs
No ratings yet
AI ML - Cycle 2 Programs
15 pages
Final ML File
No ratings yet
Final ML File
34 pages
ML Journal External
No ratings yet
ML Journal External
14 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
Lab Manual
No ratings yet
Lab Manual
9 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
SC Assignment Q2
No ratings yet
SC Assignment Q2
7 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
AML Lab
No ratings yet
AML Lab
14 pages
ML
No ratings yet
ML
17 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
ML Remaining
No ratings yet
ML Remaining
17 pages
ML Record
No ratings yet
ML Record
19 pages
ML II Lab
No ratings yet
ML II Lab
5 pages
ML Functions
No ratings yet
ML Functions
12 pages
Ai Int-1
No ratings yet
Ai Int-1
6 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
Aiml Practicals
No ratings yet
Aiml Practicals
22 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
Practicalpgm ML
No ratings yet
Practicalpgm ML
33 pages
Soft Sensor Code
No ratings yet
Soft Sensor Code
4 pages
Soft Sensor Code
No ratings yet
Soft Sensor Code
4 pages
CR Lab
No ratings yet
CR Lab
5 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Wa0003
No ratings yet
Wa0003
16 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
ML Lab Prgms Split
No ratings yet
ML Lab Prgms Split
3 pages
ML Lab Experiment Shortened With Same Output
No ratings yet
ML Lab Experiment Shortened With Same Output
6 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
20BCP021 Assignment 6
No ratings yet
20BCP021 Assignment 6
15 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
253 125 Example of A PL
No ratings yet
253 125 Example of A PL
2 pages
253 125 Concurrency Control NE
No ratings yet
253 125 Concurrency Control NE
48 pages
Need For Upsampling in GANs
No ratings yet
Need For Upsampling in GANs
6 pages
Adobe Scan 12-Mar-2024
No ratings yet
Adobe Scan 12-Mar-2024
1 page
C++ Reduced List 2021
No ratings yet
C++ Reduced List 2021
13 pages
Novel Transfer Learning Approach For Driver Drowsiness Detection Using Eye Movement Behavior
No ratings yet
Novel Transfer Learning Approach For Driver Drowsiness Detection Using Eye Movement Behavior
14 pages
Real-Time Eye Blink Detection Using General Camera
No ratings yet
Real-Time Eye Blink Detection Using General Camera
8 pages
253 125 21 8 UnitTesting
No ratings yet
253 125 21 8 UnitTesting
19 pages
Q1
No ratings yet
Q1
3 pages
Work at Height Permit
No ratings yet
Work at Height Permit
1 page
Maribel - r92 - El Chico Que Detesto
No ratings yet
Maribel - r92 - El Chico Que Detesto
443 pages
Basic Microbiology and Biochemistry
No ratings yet
Basic Microbiology and Biochemistry
67 pages
Purcom Speech 1
No ratings yet
Purcom Speech 1
1 page
The Cell
No ratings yet
The Cell
6 pages
Research Proposal
No ratings yet
Research Proposal
10 pages
O Poder Do Mel
No ratings yet
O Poder Do Mel
26 pages
Murabahah and Murabahah For Purchase Orderer: Islamic Financial Transactions
No ratings yet
Murabahah and Murabahah For Purchase Orderer: Islamic Financial Transactions
14 pages
The NGINX Real-Time API Handbook
No ratings yet
The NGINX Real-Time API Handbook
26 pages
FKF Rules and Regulations Final
No ratings yet
FKF Rules and Regulations Final
29 pages
Fundamentals For Nursing
100% (1)
Fundamentals For Nursing
5 pages
Acyfar 3 Answer Key Q1andq2 T2ay2324
No ratings yet
Acyfar 3 Answer Key Q1andq2 T2ay2324
3 pages
Fee Structure Agm Current
No ratings yet
Fee Structure Agm Current
2 pages
Catalogo Bujías Gauss
No ratings yet
Catalogo Bujías Gauss
32 pages
Fall of Dhaka
100% (4)
Fall of Dhaka
4 pages
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
No ratings yet
Pre-Schwarzian and Schwarzian Norm Estimates For Subclasses of Univalent Functions
19 pages
Query Optimization in Object Oriented Databases Through Detecting Independent Subqueries
No ratings yet
Query Optimization in Object Oriented Databases Through Detecting Independent Subqueries
5 pages
Syltherm HF Tds
No ratings yet
Syltherm HF Tds
2 pages
Intermediary Liability in A Global World: Prof. Dr. Matthias Leistner, LL.M. (Cambridge)
No ratings yet
Intermediary Liability in A Global World: Prof. Dr. Matthias Leistner, LL.M. (Cambridge)
40 pages
Will (Advanced Uses)
No ratings yet
Will (Advanced Uses)
5 pages
Book 3 Unit 8. Communicating With Staff: Group Name: 4 Arya Nugroho Indri Novianti Rahayu Yiyin
No ratings yet
Book 3 Unit 8. Communicating With Staff: Group Name: 4 Arya Nugroho Indri Novianti Rahayu Yiyin
10 pages
Grade 6 2nd Q Final
No ratings yet
Grade 6 2nd Q Final
5 pages
A Guidelines For Interviewing For The High School Newspaper
No ratings yet
A Guidelines For Interviewing For The High School Newspaper
4 pages
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
No ratings yet
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
9 pages
CHM2032L Lab Manual 8 Spectrophotometry Yavuz-Petrowski Fall 2021 Tde88JS
No ratings yet
CHM2032L Lab Manual 8 Spectrophotometry Yavuz-Petrowski Fall 2021 Tde88JS
21 pages
Inspection Preparation For Ships
No ratings yet
Inspection Preparation For Ships
3 pages
Bell, SOME EXPERIMENTS IN DIAGNOSTIC TEACHING
No ratings yet
Bell, SOME EXPERIMENTS IN DIAGNOSTIC TEACHING
23 pages
Our Annual List of Must-Have Wines.: by The Editors of Wine Enthusiast Magazine
100% (1)
Our Annual List of Must-Have Wines.: by The Editors of Wine Enthusiast Magazine
10 pages
UMTS Call Flow Scenarios Overview
No ratings yet
UMTS Call Flow Scenarios Overview
161 pages

CO3

Uploaded by

CO3

Uploaded by

CO3

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import accuracy_score

from sklearn.neighbors import KNeighborsClassifier

from sklearn.naive_bayes import GaussianNB

import matplotlib.pyplot as plt

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

tree_clf = DecisionTreeClassifier(criterion="gini", random_state=42)

accuracy_before_pruning = accuracy_score(y_test, y_pred)

print("Decision Tree Accuracy (before pruning):", accuracy_before_pruning)

plot_tree(tree_clf, filled=True, feature_names=data.feature_names, class_names=data.target_names)

plt.title("Decision Tree before Pruning")

pruned_tree_clf = DecisionTreeClassifier(criterion="gini", max_depth=3, random_state=42)

accuracy_after_pruning = accuracy_score(y_test, y_pruned_pred)

print("Decision Tree Accuracy (after pruning):", accuracy_after_pruning)

plot_tree(pruned_tree_clf, filled=True, feature_names=data.feature_names, class_names=data.target_names)

plt.title("Decision Tree after Pruning")

# k-Nearest Neighbors (k-NN)

knn_accuracy = accuracy_score(y_test, y_knn_pred)

print("k-Nearest Neighbors Accuracy:", knn_accuracy)

nb_accuracy = accuracy_score(y_test, y_nb_pred)

print("Naive Bayes Accuracy:", nb_accuracy)

print("\nSummary of model accuracies:")

print(f"Decision Tree (before pruning): {accuracy_before_pruning:.4f}")

print(f"Decision Tree (after pruning): {accuracy_after_pruning:.4f}")

print(f"k-Nearest Neighbors: {knn_accuracy:.4f}")

print(f"Naive Bayes: {nb_accuracy:.4f}")

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

mse_simple = mean_squared_error(y_test, y_pred_simple)

mse_multiple = mean_squared_error(y_test, y_pred_multiple)

Simple Linear Regression MSE: 0.129093146356764

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

from sklearn.linear_model import LinearRegression, Ridge, Lasso

from sklearn.metrics import mean_squared_error, r2_score

from sklearn.datasets import load_iris

df.columns = ["sepal_length", "sepal_width", "petal_length", "petal_width"]

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

mse_single = mean_squared_error(y_test, y_pred_single)

r2_single = r2_score(y_test, y_pred_single)

print("Simple Linear Regression MSE:", mse_single)

mse_multi = mean_squared_error(y_test, y_pred_multi)

r2_multi = r2_score(y_test, y_pred_multi)

print("Multiple Linear Regression MSE:", mse_multi)

print("Multiple Linear Regression R^2:", r2_multi)

mse_ridge = mean_squared_error(y_test, y_pred_ridge)

r2_ridge = r2_score(y_test, y_pred_ridge)

print("Ridge Regression MSE:", mse_ridge)

print("Ridge Regression R^2:", r2_ridge)

mse_lasso = mean_squared_error(y_test, y_pred_lasso)

r2_lasso = r2_score(y_test, y_pred_lasso)

print("Lasso Regression MSE:", mse_lasso)

print("Lasso Regression R^2:", r2_lasso)

print("Multiple Linear Regression Coefficients:", lr_multi.coef_)

print("Ridge Regression Coefficients:", ridge.coef_)

print("Lasso Regression Coefficients:", lasso.coef_)

Simple Linear Regression R^2: 0.812980761507489

Multiple Linear Regression MSE: 0.10212647866320375

Multiple Linear Regression R^2: 0.8520477902310164

Ridge Regression MSE: 0.09363860271269377

Ridge Regression R^2: 0.8643443074473242

Lasso Regression MSE: 0.12157358128102438

Lasso Regression R^2: 0.8238744717775386

Multiple Linear Regression Coefficients: [ 0.29673801 1.32167517 -0.50506043]

Ridge Regression Coefficients: [ 0.27896087 1.1378305 -0.33189876]

Lasso Regression Coefficients: [0.09182964 0.64697702 0. ]

You might also like