0% found this document useful (0 votes)

115 views4 pages

October 11, 2020: 0.1 Applied Machine Learning, Module 1: A Simple Classification Task

This document provides an introduction and overview of Module 1 of an applied machine learning course. It loads fruit data, explores the data through visualizations, performs a train-test split, trains a k-Nearest Neighbors classifier on the training data, evaluates the classifier on test data, makes predictions on new data, visualizes the decision boundaries, and analyzes the effects of varying k and the train-test split proportion on classification accuracy.

Uploaded by

engsamerhozin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views4 pages

October 11, 2020: 0.1 Applied Machine Learning, Module 1: A Simple Classification Task

Uploaded by

engsamerhozin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Module 1

October 11, 2020

You are currently looking at version 1.0 of this notebook. To download notebooks and datafiles, as well
as get help on Jupyter notebooks in the Coursera platform, visit the Jupyter Notebook FAQ course resource.

0.1 Applied Machine Learning, Module 1: A simple classification task

0.1.1 Import required modules and load data file
In [1]: %matplotlib notebook
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
from sklearn.model_selection import train_test_split

fruits = pd.read_table('readonly/fruit_data_with_colors.txt')

In [2]: fruits.head()

Out[2]: fruit_label fruit_name fruit_subtype mass width height color_score

0 1 apple granny_smith 192 8.4 7.3 0.55
1 1 apple granny_smith 180 8.0 6.8 0.59
2 1 apple granny_smith 176 7.4 7.2 0.60
3 2 mandarin mandarin 86 6.2 4.7 0.80
4 2 mandarin mandarin 84 6.0 4.6 0.79

In [4]: # create a mapping from fruit label value to fruit name to make results eas
lookup_fruit_name = dict(zip(fruits.fruit_label.unique(), fruits.fruit_name
lookup_fruit_name

Out[4]: {1: 'apple', 2: 'mandarin', 3: 'orange', 4: 'lemon'}

The file contains the mass, height, and width of a selection of oranges, lemons and apples. The
heights were measured along the core of the fruit. The widths were the widest width perpendicu-
lar to the height.

1
0.1.2 Examining the data
In [5]: # plotting a scatter matrix
from matplotlib import cm

X = fruits[['height', 'width', 'mass', 'color_score']]

y = fruits['fruit_label']
X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=0)

cmap = cm.get_cmap('gnuplot')
scatter = pd.scatter_matrix(X_train, c= y_train, marker = 'o', s=40, hist_k

<IPython.core.display.Javascript object>

<IPython.core.display.HTML object>

In [6]: # plotting a 3D scatter plot

from mpl_toolkits.mplot3d import Axes3D

fig = plt.figure()
ax = fig.add_subplot(111, projection = '3d')
ax.scatter(X_train['width'], X_train['height'], X_train['color_score'], c =
ax.set_xlabel('width')
ax.set_ylabel('height')
ax.set_zlabel('color_score')
plt.show()

<IPython.core.display.Javascript object>

<IPython.core.display.HTML object>

0.1.3 Create train-test split

In [7]: # For this example, we use the mass, width, and height features of each fru
X = fruits[['mass', 'width', 'height']]
y = fruits['fruit_label']

# default is 75% / 25% train-test split

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=0)

0.1.4 Create classifier object

In [8]: from sklearn.neighbors import KNeighborsClassifier

knn = KNeighborsClassifier(n_neighbors = 5)

2
0.1.5 Train the classifier (fit the estimator) using the training data
In [ ]: knn.fit(X_train, y_train)

0.1.6 Estimate the accuracy of the classifier on future data, using the test data
In [ ]: knn.score(X_test, y_test)

0.1.7 Use the trained k-NN classifier model to classify new, previously unseen objects
In [ ]: # first example: a small fruit with mass 20g, width 4.3 cm, height 5.5 cm
fruit_prediction = knn.predict([[20, 4.3, 5.5]])
lookup_fruit_name[fruit_prediction[0]]

In [ ]: # second example: a larger, elongated fruit with mass 100g, width 6.3 cm, h
fruit_prediction = knn.predict([[100, 6.3, 8.5]])
lookup_fruit_name[fruit_prediction[0]]

0.1.8 Plot the decision boundaries of the k-NN classifier

In [ ]: from adspy_shared_utilities import plot_fruit_knn

plot_fruit_knn(X_train, y_train, 5, 'uniform') # we choose 5 nearest neig

0.1.9 How sensitive is k-NN classification accuracy to the choice of the ‘k’ parameter?
In [ ]: k_range = range(1,20)
scores = []

for k in k_range:
knn = KNeighborsClassifier(n_neighbors = k)
knn.fit(X_train, y_train)
scores.append(knn.score(X_test, y_test))

plt.figure()
plt.xlabel('k')
plt.ylabel('accuracy')
plt.scatter(k_range, scores)
plt.xticks([0,5,10,15,20]);

0.1.10 How sensitive is k-NN classification accuracy to the train/test split proportion?
In [ ]: t = [0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.2]

knn = KNeighborsClassifier(n_neighbors = 5)

plt.figure()

for s in t:

3
scores = []
for i in range(1,1000):
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size
knn.fit(X_train, y_train)
scores.append(knn.score(X_test, y_test))
plt.plot(s, np.mean(scores), 'bo')

plt.xlabel('Training set proportion (%)')

plt.ylabel('accuracy');

In [ ]:

IT Infrastructure Management eI9RGuDM0m
100% (1)
IT Infrastructure Management eI9RGuDM0m
276 pages
Ai Combined Update
No ratings yet
Ai Combined Update
274 pages
K-Nearest Neighbors Classifiers 2025
No ratings yet
K-Nearest Neighbors Classifiers 2025
33 pages
KNN Datacamp
No ratings yet
KNN Datacamp
31 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
Mod3 Classification
No ratings yet
Mod3 Classification
32 pages
ML Notes
100% (2)
ML Notes
125 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
Clustering
No ratings yet
Clustering
6 pages
Record
No ratings yet
Record
23 pages
Lect3 Supervised1
No ratings yet
Lect3 Supervised1
25 pages
Unit-2 Feature Selection
No ratings yet
Unit-2 Feature Selection
92 pages
AIML Record 56
No ratings yet
AIML Record 56
28 pages
W1
No ratings yet
W1
15 pages
Water: Piping & Designing Hydronic Systems
100% (1)
Water: Piping & Designing Hydronic Systems
80 pages
Program 4
No ratings yet
Program 4
3 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Freshservice Product Deck
100% (2)
Freshservice Product Deck
50 pages
Practical 10 K-Nearest Neighbors Algorithm
No ratings yet
Practical 10 K-Nearest Neighbors Algorithm
16 pages
Lecture7 KNN
No ratings yet
Lecture7 KNN
40 pages
1 Supervise Learning (KNN) (Solution) : 1.1 Distance Measuring in Machine Learning
No ratings yet
1 Supervise Learning (KNN) (Solution) : 1.1 Distance Measuring in Machine Learning
14 pages
Thinkpad Yoga 260 User Guide
No ratings yet
Thinkpad Yoga 260 User Guide
174 pages
Pratham ML
No ratings yet
Pratham ML
14 pages
Unit 2 ML
No ratings yet
Unit 2 ML
93 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
4801instant Download Spec (Hell's Handlers MC Florida Chapter Book 2) Lilly Atlas PDF All Chapters
100% (1)
4801instant Download Spec (Hell's Handlers MC Florida Chapter Book 2) Lilly Atlas PDF All Chapters
66 pages
ISYE6501 Homework 2
No ratings yet
ISYE6501 Homework 2
11 pages
Lab Report TikTok 3
100% (1)
Lab Report TikTok 3
25 pages
All Passwords
No ratings yet
All Passwords
31 pages
ML Lecture For School Students
No ratings yet
ML Lecture For School Students
8 pages
Module 4 - Classification
No ratings yet
Module 4 - Classification
10 pages
K-Nearest Neighbors Clearly Explained
No ratings yet
K-Nearest Neighbors Clearly Explained
11 pages
VLSM Workbook Instructors Edition - V1 - 0
100% (4)
VLSM Workbook Instructors Edition - V1 - 0
27 pages
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
Statement Theory and Syntax
No ratings yet
Statement Theory and Syntax
14 pages
Worksheet Classification2
No ratings yet
Worksheet Classification2
14 pages
Machine Learning Lab Manual 7
100% (1)
Machine Learning Lab Manual 7
8 pages
Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Fire Extinguishers
No ratings yet
Fire Extinguishers
31 pages
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
No ratings yet
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
9 pages
Basics of Boiler and HRSG Design by Brad Buecker
No ratings yet
Basics of Boiler and HRSG Design by Brad Buecker
183 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
Salesforce Entitlements Implementation Guide
No ratings yet
Salesforce Entitlements Implementation Guide
50 pages
DP-100 Instruction Manual
No ratings yet
DP-100 Instruction Manual
9 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
Lab 1 - Machine Learning with Python - ML Engineering مهم
No ratings yet
Lab 1 - Machine Learning with Python - ML Engineering مهم
10 pages
Dip Project File
No ratings yet
Dip Project File
4 pages
KNN Example
No ratings yet
KNN Example
4 pages
Area Efficient Low Power Image Watermarking Architecture Using Faithful Approximation and Reversible Logic
No ratings yet
Area Efficient Low Power Image Watermarking Architecture Using Faithful Approximation and Reversible Logic
35 pages
DS Report
No ratings yet
DS Report
11 pages
Aam Codes
No ratings yet
Aam Codes
8 pages
Guide 7 A
No ratings yet
Guide 7 A
21 pages
On The Design Details of SS/PBCH, Signal Generation and PRACH in 5G-NR
No ratings yet
On The Design Details of SS/PBCH, Signal Generation and PRACH in 5G-NR
21 pages
Ds Europe An401plus Series
No ratings yet
Ds Europe An401plus Series
20 pages
Computer Systems, Hardware and Software
No ratings yet
Computer Systems, Hardware and Software
27 pages
IGL 7.2.1 Database Setup and Management Guide
No ratings yet
IGL 7.2.1 Database Setup and Management Guide
35 pages
KNN RahayuFitria 19510004
No ratings yet
KNN RahayuFitria 19510004
5 pages
K - NN Classification
No ratings yet
K - NN Classification
4 pages
Sleeve
No ratings yet
Sleeve
16 pages
Using Raspberry Pi 4 With Ubuntu Ros Rplidar Ardui
No ratings yet
Using Raspberry Pi 4 With Ubuntu Ros Rplidar Ardui
19 pages
A Complete Guide To KNN
No ratings yet
A Complete Guide To KNN
16 pages
K Nearest Neighbours
No ratings yet
K Nearest Neighbours
4 pages
KNN Classifier
No ratings yet
KNN Classifier
5 pages
2024LLM Inference Bench ArXiv
No ratings yet
2024LLM Inference Bench ArXiv
18 pages
Naive
No ratings yet
Naive
2 pages
83 Sklearn Pipeline
No ratings yet
83 Sklearn Pipeline
8 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Hyperparameter Tuning For K
No ratings yet
Hyperparameter Tuning For K
2 pages
HALO Overview
No ratings yet
HALO Overview
15 pages
Unit 1:: Linear Classifier and Generalizations
No ratings yet
Unit 1:: Linear Classifier and Generalizations
17 pages
Yashveer PC Practical File
No ratings yet
Yashveer PC Practical File
14 pages
Common Fruits Identification
No ratings yet
Common Fruits Identification
5 pages
CSO Previous Year Question Paper (2019-15)
No ratings yet
CSO Previous Year Question Paper (2019-15)
10 pages
Fspas 09 1073578
No ratings yet
Fspas 09 1073578
13 pages
Question Bank - 345 Mod
No ratings yet
Question Bank - 345 Mod
12 pages
Direct Questions Ms
No ratings yet
Direct Questions Ms
8 pages
DSU - Electricity Bill Calculator Using C
No ratings yet
DSU - Electricity Bill Calculator Using C
9 pages
Find Approximate Value of The Below Given Questions
No ratings yet
Find Approximate Value of The Below Given Questions
6 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
Class 01 (Introduction of Adobe Illustrator)
No ratings yet
Class 01 (Introduction of Adobe Illustrator)
8 pages
Iaa202 - Lab6 - Ia1803 - Le Tien Long - He171603
No ratings yet
Iaa202 - Lab6 - Ia1803 - Le Tien Long - He171603
7 pages
SCH 225 e SCD Gen 022 Trio e Series Radio Modem Datasheet
No ratings yet
SCH 225 e SCD Gen 022 Trio e Series Radio Modem Datasheet
4 pages
Part 1 - Digital Imaging
No ratings yet
Part 1 - Digital Imaging
3 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
Lab 1 Assignment
No ratings yet
Lab 1 Assignment
1 page
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
Mid Term Exam Networking
No ratings yet
Mid Term Exam Networking
3 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
Wall Hanger Brackets Bracket Adapters: (Red Brackets Are Galvanized)
No ratings yet
Wall Hanger Brackets Bracket Adapters: (Red Brackets Are Galvanized)
1 page
Fruits Classification Using Convolutional Neural Network
No ratings yet
Fruits Classification Using Convolutional Neural Network
6 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
Yash Raj Artificial Intelligence New
No ratings yet
Yash Raj Artificial Intelligence New
9 pages