0% found this document useful (0 votes)

10 views4 pages

Assignment No 2 - ML - Output

Uploaded by

Akkimsd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views4 pages

Assignment No 2 - ML - Output

Uploaded by

Akkimsd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment no 2 _ML

October 8, 2024

[36]: #ASSIGNMENT NO 2
#Use K-Nearest Neighbors and Support Vector Machine for classification. Analyze␣
↪their performance.

#Dataset link: The emails.csv dataset on the Kaggle https://www.kaggle.com/

↪datasets/balaka18/email-spam-classification-dataset-csv

[37]: import pandas as pd

import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt

[38]: df=pd.read_csv('/home/pc13/Documents/Email/emails.csv')

[51]: df.head() #it returns the first five rows of the DataFrame df

[51]: Email No. the to ect and for of a you hou … connevey jay \
0 Email 1 0 0 1 0 0 0 2 0 0 … 0 0
1 Email 2 8 13 24 6 6 2 102 1 27 … 0 0
2 Email 3 0 0 1 0 0 0 8 0 0 … 0 0
3 Email 4 0 5 22 0 5 1 51 2 10 … 0 0
4 Email 5 7 6 17 1 5 2 57 0 9 … 0 0

valued lay infrastructure military allowing ff dry Prediction

0 0 0 0 0 0 0 0 0
1 0 0 0 0 0 1 0 0
2 0 0 0 0 0 0 0 0
3 0 0 0 0 0 0 0 0
4 0 0 0 0 0 1 0 0

[5 rows x 3002 columns]

[52]: df.info() #df.info() function in pandas provides a concise summary of a␣

↪DataFrame

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 5172 entries, 0 to 5171
Columns: 3002 entries, Email No. to Prediction

1
dtypes: int64(3001), object(1)
memory usage: 118.5+ MB

[53]: df.isnull().sum() #The df.isnull().sum() function in pandas is used to check␣

↪for missing (null) values in a DataFrame.

[53]: Email No. 0

the 0
to 0
ect 0
and 0
..
military 0
allowing 0
ff 0
dry 0
Prediction 0
Length: 3002, dtype: int64

[54]: X = df.iloc[:, 1:-1].values

y = df.iloc[:, -1].values
#X and y are being created from a pandas DataFrame df using the iloc method,␣
↪which is used for integer-location based indexing

#X typically represents the feature set (input data) used for training a␣
↪machine learning model.

#y usually represents the target variable (output data) that the model aims to␣
↪predict.

[55]: from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.30,␣
↪random_state=101)

#the train_test_split function from the sklearn.model_selection module is used␣

↪to split the dataset into training and testing sets

[56]: from sklearn.preprocessing import StandardScaler

sc_X = StandardScaler()
X_train = sc_X.fit_transform(X_train)
X_test = sc_X.transform(X_test)
#The StandardScaler from the sklearn.preprocessing module is used to␣
↪standardize the feature set

[57]: from sklearn.neighbors import KNeighborsClassifier

classifier = KNeighborsClassifier(n_neighbors=5)
classifier.fit(X_train, y_train)
#using the KNeighborsClassifier from the sklearn.neighbors module to create and␣
↪train a K-Nearest Neighbors (KNN) classifier.

2
[57]: KNeighborsClassifier()

[58]: #KNeighborsClassifier() is a class in the sklearn.neighbors module of the␣

↪scikit-learn library, which implements the K-Nearest Neighbors (KNN)␣

↪algorithm for classification tasks

[59]: y_pred = classifier.predict(X_test)

#In this line of code, y_pred = classifier.predict(X_test), you're using the␣
↪trained K-Nearest Neighbors classifier to make predictions on the test set

[60]: from sklearn.metrics import confusion_matrix, accuracy_score

cm = confusion_matrix(y_test, y_pred)
#using functions from sklearn.metrics to evaluate the performance of your␣
↪K-Nearest Neighbors classifier by generating a confusion matrix.

[61]: cm
#The variable cm contains the confusion matrix generated by the␣
↪confusion_matrix function.

[61]: array([[866, 248],

[ 16, 422]])

[49]: from sklearn.metrics import classification_report

cl_report=classification_report(y_test,y_pred)
print(cl_report)
#Generating a classification report using the classification_report function␣
↪from the sklearn.metrics module

precision recall f1-score support

0 0.98 0.78 0.87 1114

1 0.63 0.96 0.76 438

accuracy 0.83 1552

macro avg 0.81 0.87 0.81 1552
weighted avg 0.88 0.83 0.84 1552

[50]: print("Accuracy Score for KNN : ", accuracy_score(y_pred,y_test))

Accuracy Score for KNN : 0.8298969072164949

[62]: from sklearn.svm import SVC

from sklearn.metrics import accuracy_score
#importing the Support Vector Classifier (SVC) from the sklearn.svm module and␣
↪the accuracy_score function from sklearn.metrics

3
[69]: svc = SVC(C=1.0,kernel='rbf',gamma='auto')
svc.fit(X_train,y_train)
y_pred2 = svc.predict(X_test)
#you're creating and training a Support Vector Classifier (SVC) using the␣
↪Radial Basis Function (RBF) kernel

[64]: from sklearn.metrics import confusion_matrix, accuracy_score

#generating a confusion matrix for the predictions made by the Support Vector␣
↪Classifier (SVC

cm = confusion_matrix(y_test, y_pred2)
#Creating the Confusion Matrix

[70]: cm
#The variable cm contains the confusion matrix generated from your SVC model's␣
↪predictions

[70]: array([[1106, 8],

[ 95, 343]])

[67]: print("Accuracy Score for SVC : ", accuracy_score(y_pred2,y_test))

Accuracy Score for SVC : 0.9336340206185567

[71]: from sklearn.metrics import classification_report

cl_report=classification_report(y_test,y_pred2)
print(cl_report)
#generating a classification report for the predictions made by your Support␣
↪Vector Classifier (SVC)

precision recall f1-score support

0 0.92 0.99 0.96 1114

1 0.98 0.78 0.87 438

accuracy 0.93 1552

macro avg 0.95 0.89 0.91 1552
weighted avg 0.94 0.93 0.93 1552

[ ]:

Medical Studies at The University of Santo Tomas (UST)
100% (1)
Medical Studies at The University of Santo Tomas (UST)
31 pages
Total Listing Machine Learning
100% (1)
Total Listing Machine Learning
114 pages
Assessment Task 4 Instructions
0% (3)
Assessment Task 4 Instructions
3 pages
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
Topics Tested Mathematics PP1 P2 2017-2023 Analysis
No ratings yet
Topics Tested Mathematics PP1 P2 2017-2023 Analysis
3 pages
Lab 8
No ratings yet
Lab 8
7 pages
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
100% (1)
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
1 page
ENGLISH 7 - Basic Factors of Delivery
No ratings yet
ENGLISH 7 - Basic Factors of Delivery
5 pages
Kami Export - Gene Expression-Translation-S.1617553074
89% (9)
Kami Export - Gene Expression-Translation-S.1617553074
6 pages
02 - Email - Spam - Ipynb - Colab
No ratings yet
02 - Email - Spam - Ipynb - Colab
11 pages
Assignment B 2 EmailClassification
No ratings yet
Assignment B 2 EmailClassification
6 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
33 pages
ML 2 16
No ratings yet
ML 2 16
6 pages
ML Practical 2
No ratings yet
ML Practical 2
6 pages
Machine Learning Assignment 3
No ratings yet
Machine Learning Assignment 3
7 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
No ratings yet
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
5 pages
ML Lab Programs (1-13)
No ratings yet
ML Lab Programs (1-13)
44 pages
Act 8
No ratings yet
Act 8
20 pages
Risss ML Record 6
No ratings yet
Risss ML Record 6
6 pages
ML 2
No ratings yet
ML 2
1 page
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
ML 5
No ratings yet
ML 5
3 pages
Practical - 5 - 52
No ratings yet
Practical - 5 - 52
4 pages
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
No ratings yet
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
2 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
ML Practical 2D
No ratings yet
ML Practical 2D
6 pages
9,12,19,68 - ML Assignment-2
No ratings yet
9,12,19,68 - ML Assignment-2
5 pages
Assignment 02
No ratings yet
Assignment 02
5 pages
P2) Code Email Spam Detection
No ratings yet
P2) Code Email Spam Detection
3 pages
Practical 7
No ratings yet
Practical 7
6 pages
ML Week10.1
No ratings yet
ML Week10.1
5 pages
ML Lab6
No ratings yet
ML Lab6
4 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
Project-4 (KNN CLASSIFICATION) (2) PRANAB
No ratings yet
Project-4 (KNN CLASSIFICATION) (2) PRANAB
2 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
AIML - ECE304 - Assign-2 - Kartikeya - Kandpal - Ajitesh - S.ipynb - Colab
No ratings yet
AIML - ECE304 - Assign-2 - Kartikeya - Kandpal - Ajitesh - S.ipynb - Colab
4 pages
KNN SVM
No ratings yet
KNN SVM
2 pages
Ml-Exp-2 - Jupyter Notebook
No ratings yet
Ml-Exp-2 - Jupyter Notebook
2 pages
FAQ's - Supervised Learning
No ratings yet
FAQ's - Supervised Learning
4 pages
Naive Bayes Gaussian Table Tennis - Jupyter Notebook
No ratings yet
Naive Bayes Gaussian Table Tennis - Jupyter Notebook
6 pages
KnnClassifier - Jupyter Notebook
No ratings yet
KnnClassifier - Jupyter Notebook
2 pages
Scaling in One Range: 5172 Rows × 3002 Columns
No ratings yet
Scaling in One Range: 5172 Rows × 3002 Columns
2 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
Bi 6 New
No ratings yet
Bi 6 New
6 pages
ML Practical Kiranjot 6-10
No ratings yet
ML Practical Kiranjot 6-10
10 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
Ai Project File
No ratings yet
Ai Project File
11 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
Exp9 10
No ratings yet
Exp9 10
4 pages
Service Catalogue For Amadeus Training
No ratings yet
Service Catalogue For Amadeus Training
10 pages
DSASSign 4
No ratings yet
DSASSign 4
11 pages
DATA SCI Ex12 KNN-correct and Wrong Predictions
No ratings yet
DATA SCI Ex12 KNN-correct and Wrong Predictions
1 page
Email Spam Detection
No ratings yet
Email Spam Detection
3 pages
KNN Model Implementation
No ratings yet
KNN Model Implementation
12 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
7 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
Cp4252 Machine Learning Lab Manual
No ratings yet
Cp4252 Machine Learning Lab Manual
40 pages
Artificial Intelligence Lab 7
No ratings yet
Artificial Intelligence Lab 7
10 pages
Classification (K-Nearest Neighbor)
No ratings yet
Classification (K-Nearest Neighbor)
22 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
2022 A Hybrid DenseNet121-UNet Model For Brain Tumor Segmentation From MR Images
No ratings yet
2022 A Hybrid DenseNet121-UNet Model For Brain Tumor Segmentation From MR Images
9 pages
Module 1 Legal Basis of Rizal Course
No ratings yet
Module 1 Legal Basis of Rizal Course
25 pages
Image Segmentation DeepLearning
No ratings yet
Image Segmentation DeepLearning
18 pages
Math 9 DLL Q1W1
No ratings yet
Math 9 DLL Q1W1
7 pages
ENG 201 Quiz # 1
50% (2)
ENG 201 Quiz # 1
5 pages
09 - Lecture 4 Language Maintenance and Shift
No ratings yet
09 - Lecture 4 Language Maintenance and Shift
9 pages
Quarter 1 Module 1 Cookery Doris L. Pagtalunan
No ratings yet
Quarter 1 Module 1 Cookery Doris L. Pagtalunan
35 pages
Unit 2 Univariate Data Unit Plan
No ratings yet
Unit 2 Univariate Data Unit Plan
5 pages
Subhojit Roy Resume Java Latest
No ratings yet
Subhojit Roy Resume Java Latest
5 pages
Designing A Rubric
No ratings yet
Designing A Rubric
28 pages
TAROT - The Royal Road - 6 SIX OF SWORDS VI
No ratings yet
TAROT - The Royal Road - 6 SIX OF SWORDS VI
12 pages
E Class Record 9 STE Consumer Chem R. CAYANAN
No ratings yet
E Class Record 9 STE Consumer Chem R. CAYANAN
12 pages
Theories of Morality Chart
No ratings yet
Theories of Morality Chart
1 page
Banners
No ratings yet
Banners
2 pages
Sisipan Harga BU 2026 - R3 - Share
No ratings yet
Sisipan Harga BU 2026 - R3 - Share
2 pages
Project 5
No ratings yet
Project 5
3 pages
PHD Thesis Felix Eichas
No ratings yet
PHD Thesis Felix Eichas
166 pages
Feedback: Your Answer Is Correct
No ratings yet
Feedback: Your Answer Is Correct
8 pages
BSC Sem 3 & 4 (Major-Minor-MDC-SEC) Medical Laboratory Syllabus From 2024-25 (DT 13-05-2024)
No ratings yet
BSC Sem 3 & 4 (Major-Minor-MDC-SEC) Medical Laboratory Syllabus From 2024-25 (DT 13-05-2024)
24 pages
Ionic Equilibrium - JEE Main 2024 January Question Bank - MathonGo
No ratings yet
Ionic Equilibrium - JEE Main 2024 January Question Bank - MathonGo
6 pages
Venkatesh Resume
No ratings yet
Venkatesh Resume
2 pages
Nomination Form18
No ratings yet
Nomination Form18
6 pages
Variations of Love by Margaret Atwood
No ratings yet
Variations of Love by Margaret Atwood
3 pages
Case Based 1 - Week 9
No ratings yet
Case Based 1 - Week 9
3 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet

Assignment No 2 - ML - Output

Uploaded by

Assignment No 2 - ML - Output

Uploaded by

Assignment no 2 _ML

#Dataset link: The emails.csv dataset on the Kaggle https://www.kaggle.com/

[37]: import pandas as pd

valued lay infrastructure military allowing ff dry Prediction

[5 rows x 3002 columns]

[52]: df.info() #df.info() function in pandas provides a concise summary of a␣

[53]: df.isnull().sum() #The df.isnull().sum() function in pandas is used to check␣

[53]: Email No. 0

[54]: X = df.iloc[:, 1:-1].values

[55]: from sklearn.model_selection import train_test_split

#the train_test_split function from the sklearn.model_selection module is used␣

[56]: from sklearn.preprocessing import StandardScaler

[57]: from sklearn.neighbors import KNeighborsClassifier

[58]: #KNeighborsClassifier() is a class in the sklearn.neighbors module of the␣

↪algorithm for classification tasks

[59]: y_pred = classifier.predict(X_test)

[60]: from sklearn.metrics import confusion_matrix, accuracy_score

[61]: array([[866, 248],

[49]: from sklearn.metrics import classification_report

precision recall f1-score support

0 0.98 0.78 0.87 1114

accuracy 0.83 1552

[50]: print("Accuracy Score for KNN : ", accuracy_score(y_pred,y_test))

Accuracy Score for KNN : 0.8298969072164949

[62]: from sklearn.svm import SVC

[64]: from sklearn.metrics import confusion_matrix, accuracy_score

[70]: array([[1106, 8],

[67]: print("Accuracy Score for SVC : ", accuracy_score(y_pred2,y_test))

Accuracy Score for SVC : 0.9336340206185567

[71]: from sklearn.metrics import classification_report

precision recall f1-score support

0 0.92 0.99 0.96 1114

accuracy 0.93 1552

You might also like