0% found this document useful (0 votes)

24 views16 pages

ABHAYMLFILE

Ml file

Uploaded by

ranabeena804

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views16 pages

ABHAYMLFILE

Ml file

Uploaded by

ranabeena804

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Name: Abhay Chand Ramola

Course: BCA(6) Sec: A

Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Problem Statement: Write a python program to implement logistic regression on California_housing

dataset.
Source code:
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import accuracy_score, confusion_matrix

# Load the dataset

df = pd.read_csv('/content/sample_data/california_housing_train.csv')

# Data preprocessing by dropping any rows with missing values

df.dropna(inplace=True)

# Binning the target variable 'median_house_value' into two categories

median_value = df['median_house_value'].median()
df['value_category'] = (df['median_house_value'] > median_value).astype(int)

# Splitting the dataset into X and y variables

X = df.drop(['median_house_value', 'value_category'], axis=1)
y = df['value_category']

# Splitting the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Feature scaling
scaler = StandardScaler()
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

# Logistic Regression model

model = LogisticRegression()

# Training the model

model.fit(X_train_scaled, y_train)

# Predictions on the testing set

y_pred = model.predict(X_test_scaled)

# Evaluation metrics
accuracy = accuracy_score(y_test, y_pred)
conf_matrix = confusion_matrix(y_test, y_pred)

print(f"Accuracy:, {accuracy}")
print(f"Confusion matrix:\n{conf_matrix} ")

Output:
Accuracy:, 0.8370588235294117
Confusion matrix:
[[1397 259]
[ 295 1449]]
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Problem Statement : Write a python program to implement ID3 algorithm using entropy in decision tree.
Source Code:
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import accuracy_score,confusion_matrix

#Load the dataset

df=pd.read_csv('/content/sample_data/california_housing_train.csv')

#Data preprocessing
#Dropping any rows with missing values
df.dropna(inplace=True)

#Splitting the dataset into features and target values

X=df.drop('median_house_value',axis=1)#Replalce 'target_column_name' with actual column name
y=df['median_house_value']# Replace 'target_column_name' with actual column name

#Splitting the dataset into training and testing sets

X_train,X_test,y_train,y_test=train_test_split(X,y,test_size=0.2,random_state=42)

#Feature Scaling
scaler=StandardScaler()
X_train_scaled=scaler.fit_transform(X_train)
X_test_scaled=scaler.transform(X_test)
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
#Decision Tree model
model=DecisionTreeClassifier(criterion='entropy') #Using ID#(Entropy) criterion

#Training the model

model.fit(X_train_scaled,y_train)

#Preddictions on the testing set

y_pred=model.predict(X_test_scaled)

#Model evaluation
accuracy=accuracy_score(y_test,y_pred)
print("Accuracy: ",accuracy)
print("Confusion matrix : \n",conf_matrix)

Output:
Accuracy: 0.025588235294117648
Confusion matrix :
[[1397 259]
[ 295 1449]]
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Problem Statement: Write a python program to implement CART algorithm for decision tree.
Source Code:
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import accuracy_score,confusion_matrix

#Load the dataset

df=pd.read_csv('/content/sample_data/california_housing_train.csv')

#Data Preprocessing
#Dropping any rows with missing values
df.dropna(inplace=True)

#Splitting the dataset into features and target variables

X=df.drop('median_house_value',axis=1) #Replace 'target_column_name' with actual target column name
y=df['median_house_value']#Replace 'target_column_name' with actual target column name

#Splitting the dataset into trianing and testing sets

X_train,X_test,y_train,y_test=train_test_split(X,y,test_size=0.2,random_state=42)

#Feature Scaling
scaler=StandardScaler()
X_train_scaled=scaler.fit_transform(X_train)
X_test_scaled=scaler.transform(X_test)
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
#CART (Decision Tree)model
model.fit(X_train_scaled,y_train)

#Predictions on the testing set

y_pred=model.predict(X_test_scaled)

#Model evaluation
accuracy=accuracy_score(y_test,y_pred)
print("Accuracy: ",accuracy)
print("Confusion matrix : \n",conf_matrix)

Output:
Accuracy: 0.023823529411764705
Confusion matrix :
[[1397 259]
[ 295 1449]]
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Problem Statement: Write a python program to implement SVM using linear kernel on iris.csv.
Source Code:
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.svm import SVC
from sklearn.metrics import classification_report,accuracy_score

url="http://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data"
column_names=['sepal_length','sepal_width','petal_length','petal_width','species']
iris=pd.read_csv(url ,header=None, names=column_names)

print(iris.head())

X=iris.iloc[:,:-1].values #all columns except the last one

y=iris.iloc[:,-1].values #the last column

X_train,X_test,y_train,y_test=train_test_split(X,y,test_size=0.3,random_state=42)

scaler=StandardScaler()
X_train=scaler.fit_transform(X_train)
X_test=scaler.transform(X_test)

svm=SVC(kernel='linear',random_state=42)
svm.fit(X_train,y_train)
y_pred=svm.predict(X_test)
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
accuracy=accuracy_score(y_test,y_pred)
print(f"Accuracy:{accuracy:.2f}")
print(classification_report(y_test,y_pred))

Output:
sepal_length sepal_width petal_length petal_width species
0 5.1 3.5 1.4 0.2 Iris-setosa
1 4.9 3.0 1.4 0.2 Iris-setosa
2 4.7 3.2 1.3 0.2 Iris-setosa
3 4.6 3.1 1.5 0.2 Iris-setosa
4 5.0 3.6 1.4 0.2 Iris-setosa
Accuracy:0.98
precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 19

Iris-versicolor 1.00 0.92 0.96 13
Iris-virginica 0.93 1.00 0.96 13

accuracy 0.98 45
macro avg 0.98 0.97 0.97 45
weighted avg 0.98 0.98 0.98 45
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Problem Statement: Write a python program to carry out visualization for each feature separately .
Source Code:
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.datasets import load_iris

# Load the Iris dataset

iris = load_iris()
X = iris.data
y = iris.target
feature_names = iris.feature_names
target_names = iris.target_names

# Plot histograms for each feature

plt.figure(figsize=(12, 6))
for i in range(X.shape[1]):
plt.subplot(2, 2, i+1)
sns.histplot(X[:, i], kde=True, color='skyblue')
plt.title(feature_names[i])
plt.tight_layout()
plt.show()

# Load Iris dataset in a DataFrame for pairplot

iris_df = sns.load_dataset('iris')

# Correct the hue parameter to a valid column

sns.pairplot(iris_df, hue='species')
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
plt.show()

# PCA Visualization
from sklearn.decomposition import PCA

pca = PCA(n_components=2)
X_pca = pca.fit_transform(X)

# Scatter plot for PCA components

plt.figure(figsize=(8, 6))
sns.scatterplot(x=X_pca[:, 0], y=X_pca[:, 1], hue=y, palette='viridis', legend='full')
plt.title('PCA Visualization')
plt.xlabel('Principal Component 1')
plt.ylabel('Principal Component 2')
plt.show()
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Output:
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Problem Statement: Write a program to data analyse using supervised algorithms building a predictive
model for customer churn in a subscription based bussiness.
Source Code:
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix
import joblib

# Step 1: Data Generation

def generate_customer_churn_data(num_customers=1000, start_date='2019-01-01', end_date='2022-01-
01'):
start_date = pd.to_datetime(start_date)
end_date = pd.to_datetime(end_date)

customer_ids = np.arange(1, num_customers + 1)

join_dates = [np.random.choice(pd.date_range(start_date, end_date)) for _ in range(num_customers)]
churn_dates = [join_date + pd.Timedelta(days=np.random.randint(30, 365)) for join_date in join_dates]
churn_status = ['Churned' if date <= end_date else 'Active' for date in churn_dates]

data = {
'CustomerID': customer_ids,
'JoinDate': join_dates,
'ChurnDate': churn_dates,
'ChurnStatus': churn_status
}

df = pd.DataFrame(data)
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
return df

# Step 2: Data Preprocessing

def preprocess_data(df):
df['JoinYear'] = df['JoinDate'].dt.year
df['JoinMonth'] = df['JoinDate'].dt.month
df['JoinDay'] = df['JoinDate'].dt.day
df['JoinDayOfWeek'] = df['JoinDate'].dt.dayofweek

df['DaysToChurn'] = (df['ChurnDate'] - df['JoinDate']).dt.days

df.drop(['JoinDate', 'ChurnDate'], axis=1, inplace=True)

df['ChurnStatus'] = df['ChurnStatus'].map({'Active': 0, 'Churned': 1})

return df

# Step 3: Split Data

def split_data(df, test_size=0.2):
X = df.drop('ChurnStatus', axis=1)
y = df['ChurnStatus']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=test_size, random_state=42)
return X_train, X_test, y_train, y_test

# Step 4: Model Training

def train_model(X_train, y_train):
model = RandomForestClassifier(n_estimators=100, random_state=42)

model.fit(X_train, y_train)
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
return model

# Step 5: Model Evaluation

def evaluate_model(model, X_test, y_test):
y_pred = model.predict(X_test)
accuracy = accuracy_score(y_test, y_pred)
print("Accuracy:", accuracy)
print("\nClassification Report:")
print(classification_report(y_test, y_pred))
print("\nConfusion Matrix:")
print(confusion_matrix(y_test, y_pred))
# Step 6: Model Deployment
def save_model(model, filepath='customer_churn_model.pkl'):
joblib.dump(model, filepath)
print("Model saved successfully.")

def main():
# Step 1: Generate data
df = generate_customer_churn_data()

# Step 2: Preprocess data

df = preprocess_data(df)

# Step 3: Split data

X_train, X_test, y_train, y_test = split_data(df)

# Step 4: Train model

Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning
model = train_model(X_train, y_train)

# Step 5: Evaluate model

evaluate_model(model, X_test, y_test)

# Step 6: Save model

save_model(model)
main()

Output:
Accuracy: 0.975

Classification Report:
precision recall f1-score support
0 1.00 0.85 0.92 33
1 0.97 1.00 0.99 167

accuracy 0.97 200

macro avg 0.99 0.92 0.95 200
weighted avg 0.98 0.97 0.97 200

Confusion Matrix:
[[ 28 5]
[ 0 167]]
Model saved successfully.
Name: Abhay Chand Ramola
Course: BCA(6) Sec: A
Roll No: 2121020(05)
Subject: Fundamental of Machine Learning

Data Analytics Using Python Lab Manual
50% (2)
Data Analytics Using Python Lab Manual
8 pages
Questions Answers Chapter Wise
No ratings yet
Questions Answers Chapter Wise
4 pages
Python Cheatsheets 1635792640
100% (1)
Python Cheatsheets 1635792640
9 pages
Election Prediction Projectfinal
No ratings yet
Election Prediction Projectfinal
30 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
Kendriya Vidyalaya Sangathan, Chennai Region PRACTICE TEST 2020-2021 Class XII
100% (1)
Kendriya Vidyalaya Sangathan, Chennai Region PRACTICE TEST 2020-2021 Class XII
6 pages
Programming For Engineers in Python: Recitation 12
No ratings yet
Programming For Engineers in Python: Recitation 12
39 pages
Importing Libraries: Pandas PD Matplotlib - Pyplot PLT Numpy NP
No ratings yet
Importing Libraries: Pandas PD Matplotlib - Pyplot PLT Numpy NP
10 pages
AI (AL-304) Lab Manual
No ratings yet
AI (AL-304) Lab Manual
37 pages
Numpy Cheat Sheet: Umpy Umerical Ython
No ratings yet
Numpy Cheat Sheet: Umpy Umerical Ython
1 page
Data Science With Python PDF
0% (1)
Data Science With Python PDF
7 pages
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
No ratings yet
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
5 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
109 Sourabh Vivek Chougule
No ratings yet
109 Sourabh Vivek Chougule
75 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
Updated - M5 - Python For Machine Learning - Copy - Maria S
No ratings yet
Updated - M5 - Python For Machine Learning - Copy - Maria S
67 pages
Final ML File
No ratings yet
Final ML File
34 pages
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
100% (1)
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
1 page
Project Walkthrough - Bike Share-2020
No ratings yet
Project Walkthrough - Bike Share-2020
58 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Train
No ratings yet
Train
17 pages
ML Journal
No ratings yet
ML Journal
37 pages
Geoplotlib Research Paper PDF
No ratings yet
Geoplotlib Research Paper PDF
21 pages
DS Practical
No ratings yet
DS Practical
30 pages
ML Final Prac
No ratings yet
ML Final Prac
47 pages
NumPy 2
No ratings yet
NumPy 2
11 pages
Py MVPA
No ratings yet
Py MVPA
17 pages
Instant Download Introduction To Python For Econometrics, Statistics and Data Analysis Kevin Sheppard PDF All Chapter
100% (4)
Instant Download Introduction To Python For Econometrics, Statistics and Data Analysis Kevin Sheppard PDF All Chapter
76 pages
FDSA Lab Manual
No ratings yet
FDSA Lab Manual
32 pages
Karmbir 19 ML
No ratings yet
Karmbir 19 ML
20 pages
20BCP021 Assignment 6
No ratings yet
20BCP021 Assignment 6
15 pages
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
Untitled Document
No ratings yet
Untitled Document
19 pages
Trade Backtest
No ratings yet
Trade Backtest
23 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
Da Program
No ratings yet
Da Program
18 pages
SVM and Kmeans - Iris Dataset - Ipynb - Colab
No ratings yet
SVM and Kmeans - Iris Dataset - Ipynb - Colab
5 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Aiml Practicals
No ratings yet
Aiml Practicals
22 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
PR
No ratings yet
PR
17 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
Ballistic Calc Explain
No ratings yet
Ballistic Calc Explain
31 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Statmech - Lab 1 (Exercise 1) : Name: Pratul Manna
No ratings yet
Statmech - Lab 1 (Exercise 1) : Name: Pratul Manna
9 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
ML Manual
No ratings yet
ML Manual
30 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
ML Manual
No ratings yet
ML Manual
9 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
Python Ch-4 - Notes
No ratings yet
Python Ch-4 - Notes
15 pages
ML Lab Manual 4-8
No ratings yet
ML Lab Manual 4-8
11 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
CSA Lab 2
No ratings yet
CSA Lab 2
5 pages
Source Code Python Jemmy
No ratings yet
Source Code Python Jemmy
7 pages
Difference Between Numpy Arrays & Tensorflow Tensors - Python in Plain English
No ratings yet
Difference Between Numpy Arrays & Tensorflow Tensors - Python in Plain English
8 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
ML Internal Answers
No ratings yet
ML Internal Answers
9 pages
Python - Numpy
No ratings yet
Python - Numpy
8 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
ML Record
No ratings yet
ML Record
19 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
1
No ratings yet
1
13 pages
20ad41e2 - Data Science
No ratings yet
20ad41e2 - Data Science
2 pages
Lab 6
No ratings yet
Lab 6
4 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
Praveen Ai
No ratings yet
Praveen Ai
6 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Lab Activity 7
No ratings yet
Lab Activity 7
6 pages
ML Exp-5,6
No ratings yet
ML Exp-5,6
6 pages
ML
No ratings yet
ML
11 pages
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
ML Minimized Programs
No ratings yet
ML Minimized Programs
9 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
ML Functions
No ratings yet
ML Functions
12 pages
Nomlab 14 Ai
No ratings yet
Nomlab 14 Ai
3 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
ML Expt 4
No ratings yet
ML Expt 4
4 pages
Data Analytics Short and Focused Answers
No ratings yet
Data Analytics Short and Focused Answers
3 pages
Set B
No ratings yet
Set B
4 pages
Lab Week 7
No ratings yet
Lab Week 7
3 pages
Assignment 1
No ratings yet
Assignment 1
3 pages