0% found this document useful (0 votes)

9 views2 pages

Utkarsh

The document outlines a process for analyzing two datasets using Python in Google Colab, specifically focusing on the Iris dataset and a movie dataset. It details steps for data loading, preprocessing, and applying a K-Nearest Neighbors (KNN) classifier to predict species in the Iris dataset, achieving a test accuracy of 95.65%. The analysis includes splitting the data into training, validation, and test sets, as well as determining the optimal number of neighbors (K) for the KNN model.

Uploaded by

bhaishaab175

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views2 pages

Utkarsh

Uploaded by

bhaishaab175

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

29/01/2025, 19:19 29-01-2025 - Colab

from google.colab import drive

drive.mount('/content/gdrive')

Mounted at /content/gdrive

import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score
from sklearn import datasets

import matplotlib.pyplot as plt

path = "/content/drive/MyDrive/dataset/exp 2/datairis.csv"

df=pd.read_csv(path)
df.head(10)

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

5 6 5.4 3.9 1.7 0.4 Iris-setosa

6 7 4.6 3.4 1.4 0.3 Iris-setosa

7 8 5.0 3.4 1.5 0.2 Iris-setosa

8 9 4.4 2.9 1.4 0.2 Iris-setosa

9 10 4.9 3.1 1.5 0.1 Iris-setosa

Next steps: Generate code with df toggle_off View recommended plots New interactive sheet

path2 = "/content/drive/MyDrive/dataset/exp 2/datasetmovies (1).csv"

df=pd.read_csv(path2)
df.head(10)

No. of action scene No.of comedy scene Class/Label/categories

0 100 15 Action

1 20 95 comedy

2 90 5 Action

3 10 85 Comedy

Next steps: Generate code with df toggle_off View recommended plots New interactive sheet

# Load the Iris dataset

iris = datasets.load_iris()
X, y = iris.data, iris.target

# Split into training (70%), validation (15%), and testing (15%)

X_train, X_temp, y_train, y_temp = train_test_split(X, y, test_size=0.3, random_state=42, stratify=y)
X_val, X_test, y_val, y_test = train_test_split(X_temp, y_temp, test_size=0.5, random_state=42, stratify=y_temp)

# Check data shapes

print(f"Train size: {X_train.shape}, Validation size: {X_val.shape}, Test size: {X_test.shape}")

Train size: (105, 4), Validation size: (22, 4), Test size: (23, 4)

scaler = StandardScaler()
X_train = scaler.fit_transform(X_train)
X_val = scaler.transform(X_val)

https://colab.research.google.com/drive/1IQUGVJGWYn7xnTY76KNnjGNmYlyQmhcB#scrollTo=dKeo40EOqbnr&printMode=true 1/2
29/01/2025, 16:15 29-01-2025 - Colab
X_test = scaler.transform(X_test)

best_k = 1
best_accuracy = 0

for k in range(1, 21):

knn = KNeighborsClassifier(n_neighbors=k)
knn.fit(X_train, y_train)
val_preds = knn.predict(X_val)
val_accuracy = accuracy_score(y_val, val_preds)

if val_accuracy > best_accuracy:

best_accuracy = val_accuracy
best_k = k

print(f"Best K found: {best_k} with validation accuracy: {best_accuracy:.4f}")

Best K found: 1 with validation accuracy: 0.9091

final_knn = KNeighborsClassifier(n_neighbors=best_k)
final_knn.fit(X_train, y_train)
test_preds = final_knn.predict(X_test)
test_accuracy = accuracy_score(y_test, test_preds)

print(f"Test accuracy using best K ({best_k}): {test_accuracy:.4f}")

Test accuracy using best K (1): 0.9565

2/2

Technical Handbook Abarth 500 A.C. and L.E
100% (1)
Technical Handbook Abarth 500 A.C. and L.E
52 pages
SVM and Kmeans - Iris Dataset - Ipynb - Colab
No ratings yet
SVM and Kmeans - Iris Dataset - Ipynb - Colab
5 pages
Example - LinearDiscriminantAnalysis - Ipynb Colaboratory
No ratings yet
Example - LinearDiscriminantAnalysis - Ipynb Colaboratory
2 pages
Practical 5
No ratings yet
Practical 5
11 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
ML 1
No ratings yet
ML 1
4 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
KNN ALGORITHM - Ipynb - Colab
No ratings yet
KNN ALGORITHM - Ipynb - Colab
4 pages
DS 6
No ratings yet
DS 6
2 pages
PR
No ratings yet
PR
17 pages
22MCA1008 - Varun ML LAB ASSIGNMENTS
100% (1)
22MCA1008 - Varun ML LAB ASSIGNMENTS
41 pages
Remaining ML Program
No ratings yet
Remaining ML Program
12 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
30 - 11 - 24 - Ensemble - Based Learning
No ratings yet
30 - 11 - 24 - Ensemble - Based Learning
1 page
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Iris - Regression - Jupyter Notebook
No ratings yet
Iris - Regression - Jupyter Notebook
5 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
100% (1)
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
1 page
5 Random Forest - Jupyter Notebook
No ratings yet
5 Random Forest - Jupyter Notebook
2 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
AML Lab
No ratings yet
AML Lab
14 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
Decision Tree Exp 5 DWM
No ratings yet
Decision Tree Exp 5 DWM
2 pages
B2 40 Practical 5A
No ratings yet
B2 40 Practical 5A
6 pages
K-Nearest Neighbors Classifiers 2025
No ratings yet
K-Nearest Neighbors Classifiers 2025
33 pages
Mlpy 2
No ratings yet
Mlpy 2
18 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Logistic Multiclass Classification
No ratings yet
Logistic Multiclass Classification
2 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Heart Disease Prediction - Colab
No ratings yet
Heart Disease Prediction - Colab
18 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
ML Expt 4
No ratings yet
ML Expt 4
4 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
4 Decision Tree - Jupyter Notebook
No ratings yet
4 Decision Tree - Jupyter Notebook
2 pages
ML Journal
No ratings yet
ML Journal
37 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
1
No ratings yet
1
13 pages
Code and Output of Cancer Detection Model
No ratings yet
Code and Output of Cancer Detection Model
13 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
Strangers
No ratings yet
Strangers
8 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
Ipynb - Colab01
No ratings yet
Ipynb - Colab01
4 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
Nitin ML Assignment 1
No ratings yet
Nitin ML Assignment 1
18 pages
Lab Session 10
No ratings yet
Lab Session 10
9 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
AAM PR QB
No ratings yet
AAM PR QB
13 pages
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
No ratings yet
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
20 pages
Assignment No - 6-1
100% (1)
Assignment No - 6-1
3 pages
ML 3
No ratings yet
ML 3
24 pages
ML Lab
No ratings yet
ML Lab
7 pages
Comparison of Classifiers
No ratings yet
Comparison of Classifiers
6 pages
ML II Lab
No ratings yet
ML II Lab
5 pages
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Shops & Estt
No ratings yet
Shops & Estt
4 pages
Igcse
No ratings yet
Igcse
9 pages
Keberhasilan Media Promosi Judi Online Dalam Menarik Minat Masyarakat-1
No ratings yet
Keberhasilan Media Promosi Judi Online Dalam Menarik Minat Masyarakat-1
10 pages
CTPAT Job Aid - Personnel Training Checklist Sample - October 2021
No ratings yet
CTPAT Job Aid - Personnel Training Checklist Sample - October 2021
4 pages
Ericsson India Private Limited VS Reliance Telecom Limited NCLT MUMBAI
No ratings yet
Ericsson India Private Limited VS Reliance Telecom Limited NCLT MUMBAI
30 pages
Pro Proctor User Guide
No ratings yet
Pro Proctor User Guide
24 pages
Waste Valorization For Bioenergy and Bioproducts Hwai Chyuan Ong - The Latest Ebook Edition With All Chapters Is Now Available
100% (3)
Waste Valorization For Bioenergy and Bioproducts Hwai Chyuan Ong - The Latest Ebook Edition With All Chapters Is Now Available
50 pages
IELTS Simon Speaking Part 3 9dee133876
No ratings yet
IELTS Simon Speaking Part 3 9dee133876
37 pages
Chapter Test: QS - Explain How You Found Your Answer
No ratings yet
Chapter Test: QS - Explain How You Found Your Answer
1 page
University of Okara: Advertisement No. 2/2020
No ratings yet
University of Okara: Advertisement No. 2/2020
3 pages
Gulfood Exhibitor List N 1
No ratings yet
Gulfood Exhibitor List N 1
19 pages
Inputs and Outputs List Page:1/21: Example-9: Sequential Control of Induction Motors
No ratings yet
Inputs and Outputs List Page:1/21: Example-9: Sequential Control of Induction Motors
7 pages
MB-310 Dynamics 365 Finance
No ratings yet
MB-310 Dynamics 365 Finance
13 pages
Family Emergency Plan
No ratings yet
Family Emergency Plan
2 pages
Hyperlipidemia 1
No ratings yet
Hyperlipidemia 1
54 pages
NTFK VOL 104 2 (3) 2017 - Henri Rikander - The Use of Electroshock Weapons by The Finnish Police 2016
No ratings yet
NTFK VOL 104 2 (3) 2017 - Henri Rikander - The Use of Electroshock Weapons by The Finnish Police 2016
34 pages
Imagine
No ratings yet
Imagine
5 pages
ENGLISH 9 Q1 Week 1 2
No ratings yet
ENGLISH 9 Q1 Week 1 2
10 pages
Math of Finance
No ratings yet
Math of Finance
33 pages
Patent Concept Foe Entreprenenur
No ratings yet
Patent Concept Foe Entreprenenur
6 pages
Gravity Light Project
No ratings yet
Gravity Light Project
16 pages
M.sc. Chemistry
No ratings yet
M.sc. Chemistry
20 pages
Ans-C01 7
No ratings yet
Ans-C01 7
17 pages
David Wall VP Hse & Im EPT - HSE, Operations & Engineering: Confidential BP-HZN - BLYOO196756
No ratings yet
David Wall VP Hse & Im EPT - HSE, Operations & Engineering: Confidential BP-HZN - BLYOO196756
3 pages
C' Ifornia: California Code Ol, Regulations
No ratings yet
C' Ifornia: California Code Ol, Regulations
62 pages
Raghuvamsa CantoV English Meaning
No ratings yet
Raghuvamsa CantoV English Meaning
69 pages
Cost Optimization of Reinforced Concrete Rectangular Beams
100% (1)
Cost Optimization of Reinforced Concrete Rectangular Beams
12 pages
Tense Changes in Reported Speech Rules, Examples, and Usage
No ratings yet
Tense Changes in Reported Speech Rules, Examples, and Usage
1 page
Computer (Eng) SSC CHSL 2024 All 70 Questions (RBE)
No ratings yet
Computer (Eng) SSC CHSL 2024 All 70 Questions (RBE)
8 pages

Utkarsh

Uploaded by

Utkarsh

Uploaded by

29/01/2025, 19:19 29-01-2025 - Colab

from google.colab import drive

import matplotlib.pyplot as plt

path = "/content/drive/MyDrive/dataset/exp 2/datairis.csv"

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

5 6 5.4 3.9 1.7 0.4 Iris-setosa

6 7 4.6 3.4 1.4 0.3 Iris-setosa

7 8 5.0 3.4 1.5 0.2 Iris-setosa

8 9 4.4 2.9 1.4 0.2 Iris-setosa

9 10 4.9 3.1 1.5 0.1 Iris-setosa

path2 = "/content/drive/MyDrive/dataset/exp 2/datasetmovies (1).csv"

No. of action scene No.of comedy scene Class/Label/categories

# Load the Iris dataset

# Split into training (70%), validation (15%), and testing (15%)

# Check data shapes

for k in range(1, 21):

if val_accuracy > best_accuracy:

print(f"Best K found: {best_k} with validation accuracy: {best_accuracy:.4f}")

Best K found: 1 with validation accuracy: 0.9091

print(f"Test accuracy using best K ({best_k}): {test_accuracy:.4f}")

Test accuracy using best K (1): 0.9565

You might also like