0% found this document useful (0 votes)

5 views13 pages

SLRG

The document discusses various machine learning techniques, focusing on supervised learning methods such as regression and the K-nearest neighbor (KNN) algorithm. It outlines the steps involved in applying KNN for regression problems, including data preparation, model fitting, and performance evaluation. Additionally, it highlights the importance of understanding model performance metrics like R2 and mean absolute error.

Uploaded by

hanyeelovesgod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views13 pages

SLRG

Uploaded by

hanyeelovesgod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

035.

001 Spring, 2024

Digital Computer Concept and Practice

Supervised Learning (2)

Soohyun Yang

College of Engineering
Department of Civil and Environmental Engineering
Types of ML techniques – All learning is learning!
Our scope

•
Classification
“Presence of labels”
Advertisement popularity •
“Absence of labels”
Recommender systems (YT) •
“Behavior-driven : feedback loop”
Learning to play games (AlphaGo)
• Spam classification • Clustering
Buying habits (group customers) • Industrial simulation
• Regression
Face recognition • Grouping user logs • Resource management

https://towardsdatascience.com/what-are-the-types-of-machine-learning-e2b9e5d1756f
Regression
 A statistical method to determine the relationship between
a dependent variable (target) and one or more independent
variables (features), predicting a target value on a continuous scale
for a given new data.

 Algorithms of our scope

• K-nearest neighbor (KNN)
• Linear regression (LR) => Simple, Polynomial, Multiple
• Ridge regression
Regularization
• Lasso regression
• Decision trees === // Ensemble // ===> Random forest
KNN algorithm – for Regression
 Principles:
1. Choose the (odd) number of k and a distance metric (Euclidean in general).
2. Calculate a distance from a target data to all training data points.
3. Find the k-nearest neighbors from the target data.
4. Determine a predicted value by averaging target values of the nearest neighbors.
k=1 k=3

Mueller & Guido (2017)

KNN algorithm – Regression problem
 Let’s apply for the KNN algorithm
to resolve a regression problem.
 1. Data preparation & import :
InClassData_Traffic.csv
Input
Feature 1 Target

Samples
KNN algorithm – Regression problem (con’t)
 2. Data separation into the
training and test sets
• random_state [integer] : A parameter
for the random number generator.
• DO NOT NEED ‘Stratification’ process
for regression problem.

>>Tip : Index ‘-1’ in the reshape() function

means that its length is determined after  3. Reshape 1-D training sets
satisfying already user-defined dimension. as 2-D array
>>Note : Training sets must be ‘2-D array’
format to use the sci-kit learn library.
KNN algorithm – Regression problem (con’t)
 4. Import ‘KNeighborsRegressor’
class and create its instance.
• n_neighbors [integer] : A parameter
to set the number of neighbors.
KNN algorithm – Regression problem (con’t)

=> On average, the

model predictions are
~87 % of the target
values in the test set!

 4. Fit the regression model using the training set (fit method).
=> Storing the training set to compute neighbors during prediction.
 5. Make predictions on the test data (predict method).
 6. Evaluate the model’s performance (score method
=> via the coefficient of determination, R2, 결정계수).
• 0 ≤ R2 ≤ 1 : Higher value => Better performance in predicting the test set’s outcomes.
KNN algorithm – Regression problem (con’t)

83.000
=> On average, the
predicted targets differ
from the real ones as
many as ~83 vehicles/hr.

 [Tip] Let’s estimate a complementary metric to understand the model’s

performance more intuitively.
=> Mean absolute error [mae] : Averaging the absolute difference between
the predicted and the real target values for a given data.
KNN algorithm – Regression problem (con’t)
 Let’s evaluate whether the model trained with k=5 became over-
or under-fitted.
Greater k, Simpler model (DB),
Less sensitive to noise in the data

 What do you get with [knr.n_neighbors = 21]?

Trained model application to a new data1
 Let’s predict a target 847.8
value for a new data
with [feature1 = 50].
Does the result satisfy you?
If not, why?

847.8
Trained model application to a new data2
 Let’s predict a target value for the other new data with [feature1 = 100].
 The predicted outcome is 847.8, which is identical with the new data1.
=> Does it make sense? Why did it happen? How can we resolve it?
Take-home points (THPs)
-
-
-
…

KNN Algorithm - PPT (Autosaved)
0% (1)
KNN Algorithm - PPT (Autosaved)
8 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
9 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Unit10 Plane of Regression: Structure
No ratings yet
Unit10 Plane of Regression: Structure
14 pages
ML Mid Sem Question Bank
No ratings yet
ML Mid Sem Question Bank
11 pages
Machine Learning: An Applied Econometric Approach
100% (1)
Machine Learning: An Applied Econometric Approach
31 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
DTREG
No ratings yet
DTREG
395 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
Managerial Accounting and Cost Concepts
No ratings yet
Managerial Accounting and Cost Concepts
66 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
ML Unit 5..
No ratings yet
ML Unit 5..
40 pages
Chapter 2 Simple Linear Regression
No ratings yet
Chapter 2 Simple Linear Regression
60 pages
MLP Unit-2
No ratings yet
MLP Unit-2
102 pages
Module 3
No ratings yet
Module 3
63 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
Provided by K-State Research Exchange
No ratings yet
Provided by K-State Research Exchange
130 pages
ML Unit 2 r20 Jntuk
No ratings yet
ML Unit 2 r20 Jntuk
34 pages
Exam BUS326
No ratings yet
Exam BUS326
57 pages
Machine Learning and Data Analytics Using Python Lab
No ratings yet
Machine Learning and Data Analytics Using Python Lab
36 pages
Classification Trees - CART and CHAID
No ratings yet
Classification Trees - CART and CHAID
50 pages
ML CH 3
No ratings yet
ML CH 3
88 pages
KNN 2
No ratings yet
KNN 2
53 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
Data Type
No ratings yet
Data Type
22 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
ML Supervised Learning Unit 3
No ratings yet
ML Supervised Learning Unit 3
51 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
Assignment 3: Forecasting Question 5 - 33
No ratings yet
Assignment 3: Forecasting Question 5 - 33
15 pages
Master Budgeting
No ratings yet
Master Budgeting
82 pages
Outlook Temp Humidity Windy Play
No ratings yet
Outlook Temp Humidity Windy Play
17 pages
Chapter#10 (Part#01) SL (K-NN)
No ratings yet
Chapter#10 (Part#01) SL (K-NN)
27 pages
Introduction To Blocking: Nuisance Factor: A Factor That Probably Has An Effect On The Response, But Is Not A Factor
No ratings yet
Introduction To Blocking: Nuisance Factor: A Factor That Probably Has An Effect On The Response, But Is Not A Factor
4 pages
EE769 9 Combining Models
No ratings yet
EE769 9 Combining Models
32 pages
Algorithm
No ratings yet
Algorithm
27 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
23 pages
Lesson 4 - Supervised Learning
No ratings yet
Lesson 4 - Supervised Learning
36 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
INSTANCE Based Learning
No ratings yet
INSTANCE Based Learning
12 pages
1) Introduction To Numpy, Pandas and Matplotlib
No ratings yet
1) Introduction To Numpy, Pandas and Matplotlib
11 pages
05 K-Nearest Neighbors
No ratings yet
05 K-Nearest Neighbors
15 pages
3 - Modeling - Ipynb - Colaboratory
No ratings yet
3 - Modeling - Ipynb - Colaboratory
31 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
Digital Computer Concept and Practice: Supervised Learning
No ratings yet
Digital Computer Concept and Practice: Supervised Learning
30 pages
Week07b FitProbDist
No ratings yet
Week07b FitProbDist
19 pages
Econ G2 Final
No ratings yet
Econ G2 Final
10 pages
β β X, X σ X X X: Simposium Nasional Akuntansi Vi
No ratings yet
β β X, X σ X X X: Simposium Nasional Akuntansi Vi
12 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Dva 2
No ratings yet
Dva 2
13 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
Experiment 2.2 KNN Classifier
No ratings yet
Experiment 2.2 KNN Classifier
7 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
Classification (K-Nearest Neighbor)
No ratings yet
Classification (K-Nearest Neighbor)
22 pages
EFM Ch6
No ratings yet
EFM Ch6
35 pages
ML Unit 3
No ratings yet
ML Unit 3
12 pages
DSV Ia2
No ratings yet
DSV Ia2
18 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Data Sciene - Unit 5 Material
No ratings yet
Data Sciene - Unit 5 Material
15 pages
FORMULA SHEET and The TABLES
No ratings yet
FORMULA SHEET and The TABLES
10 pages
Multiple Regression Analysis in SPSS Statistics - Laerd Statistics
No ratings yet
Multiple Regression Analysis in SPSS Statistics - Laerd Statistics
7 pages
Rmprobit
No ratings yet
Rmprobit
8 pages
cYCLE 9
No ratings yet
cYCLE 9
5 pages
PS4 PDF
No ratings yet
PS4 PDF
10 pages
Aiml K2
No ratings yet
Aiml K2
8 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
MCQ
No ratings yet
MCQ
8 pages
ML Practical Kiranjot 6-10
No ratings yet
ML Practical Kiranjot 6-10
10 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
7 pages
KNN Model Implementation
No ratings yet
KNN Model Implementation
12 pages
Notes: KNN: K-Nearest Neighbors
No ratings yet
Notes: KNN: K-Nearest Neighbors
4 pages
BA Module 5 Summary
No ratings yet
BA Module 5 Summary
3 pages
MRV1
No ratings yet
MRV1
6 pages
CM15 Extreme Value Distributions
No ratings yet
CM15 Extreme Value Distributions
7 pages
ML 3
No ratings yet
ML 3
6 pages
ANOVA Poplar-Trees
No ratings yet
ANOVA Poplar-Trees
3 pages
Machine Lar Arii
No ratings yet
Machine Lar Arii
9 pages
Homework 1 - Simple Linear Regression - Neal Pania
No ratings yet
Homework 1 - Simple Linear Regression - Neal Pania
4 pages
T01 Soln
No ratings yet
T01 Soln
5 pages
CH 6 Quiz 4 - Attempt Review - Utm Odl Sem 2324 - 2
No ratings yet
CH 6 Quiz 4 - Attempt Review - Utm Odl Sem 2324 - 2
4 pages
Research Different Types of Indices of Correlation and Their Verbal Description
No ratings yet
Research Different Types of Indices of Correlation and Their Verbal Description
4 pages
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
No ratings yet
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
3 pages
IOT DA 21bee0309
No ratings yet
IOT DA 21bee0309
3 pages
HandsOnExs Variables and DataType
No ratings yet
HandsOnExs Variables and DataType
3 pages
Lecture 38 KNN
No ratings yet
Lecture 38 KNN
4 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
No ratings yet
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
2 pages
Two Way Anova Dengan Replikasi: Gender Pendidikan Ujian
No ratings yet
Two Way Anova Dengan Replikasi: Gender Pendidikan Ujian
2 pages
The K
No ratings yet
The K
2 pages
2005 S Fin
No ratings yet
2005 S Fin
1 page
2003 S Fin
No ratings yet
2003 S Fin
1 page
2006 S Fin
No ratings yet
2006 S Fin
1 page
2007 S Fin
No ratings yet
2007 S Fin
1 page
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet

SLRG

Uploaded by

SLRG

Uploaded by

035.

001 Spring, 2024

Digital Computer Concept and Practice

 Algorithms of our scope

Mueller & Guido (2017)

>>Tip : Index ‘-1’ in the reshape() function

=> On average, the

 [Tip] Let’s estimate a complementary metric to understand the model’s

 What do you get with [knr.n_neighbors = 21]?

You might also like