Digital Computer Concept and Practice: Supervised Learning

The document provides an overview of supervised learning (SL) in machine learning, detailing its techniques such as classification, regression, and clustering. It discusses the importance of labeled data, the bias-variance trade-off, and various supervised learning algorithms, including K-nearest neighbor (KNN). Additionally, it covers feature scaling methods and their impact on model performance.

Uploaded by

hanyeelovesgod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views30 pages

Digital Computer Concept and Practice: Supervised Learning

Uploaded by

hanyeelovesgod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

035.

001 Spring, 2024

Digital Computer Concept and Practice

Supervised Learning (1)

Soohyun Yang

College of Engineering
Department of Civil and Environmental Engineering
Types of ML techniques – All learning is learning!
Our scope

•
Classification
“Presence of labels”
Advertisement popularity •
“Absence of labels”
Recommender systems (YT) •
“Behavior-driven : feedback loop”
Learning to play games (AlphaGo)
• Spam classification • Clustering
Buying habits (group customers) • Industrial simulation
• Regression
Face recognition • Grouping user logs • Resource management

https://towardsdatascience.com/what-are-the-types-of-machine-learning-e2b9e5d1756f
Supervised Learning (SL)
 A sub-category of machine learning to train algorithms for
predicting outcomes or classifying data via the use of labeled data.
=> “Regression” => “Classification”
 Each sample should be a pair of an input object (입력, typically a
vector) and a target value (타깃, i.e., label, supervisory signal).
 The input object consists of multiple features (속성, generally # > 1).
Input
Feature 1 Feature 2 Feature 3 Target
Traffic volume Number of lane Size of city Congestion
2500 4 Small No
4000 6 Big No
Samples 20000 6 Big Yes
… … … …
50000 6 Big Yes
Supervised Learning (con’t)
 Our goal is to get a generalized learning
model, which accurately predicts a new
dataset.
 Otherwise, the model gets under-fitted
or over-fitted (과소적합 or 과대적합).

https://labelyourdata.com/articles/machine-learning-and-training-data
 To achieve the goal, sample dataset
should be randomly divided into two
parts (Training : Test = 7.5 : 2.5, default).
• Training set : to build a learning model.
• Test set (new dataset) : to evaluate the

Mueller & Guido (2017);

trained model’s performance.
Under- and over-fitting in supervised learning
 Underfitted model is too simple, failing to create a relation between
the input and the output
=> high bias (low accuracy; 정확도)
 Overfitted model is too complex, trying to fit entire training data and
its noise fluctuation => high variance (low precision; 정밀도)

https://www.mathworks.com/discovery/overfitting.html
http://scott.fortmann-roe.com/docs/BiasVariance.html
Bias-Variance trade-off
 A supervised learning’s dilemma between
‘accurate capture of the regularities in the training data’ and
‘well generalization to unseen data’.
 Bias represents how much a SL model’s mean accuracy varies
as the training data is changed.
 Variance indicates how sensitive a SL model is to a given specific input
data.

©Wikipedia
Supervised Learning Algorithms (SLAs)
 A wide range of SLAs are available and applicable to regression
and/or classification problems.
 But, there is no single SLA which works best on all SL problems.
=> Selection of SLA is dependent on the types of problem and data.
 Each SLA has its own strengths and weaknesses.
 Representative examples:
• K-nearest neighbor (KNN)
• Linear models
• Decision trees === // Ensemble // ===> Random forest
• Naïve Bayes classifiers
• Support vector machines (SVM)
Advantages and disadvantages of SLAs (I)
Algorithm
(CL = classification; RG = regression)
Advantages Disadvantages
- Very easy to understand - Slow and poor prediction for
- Very fast to build the model for small many number of features or
number of samples samples (>100)
K-nearest neighbor - Good starting point before executing - Bad performance for sparse
(CL & RG) advanced techniques datasets (i.e., most features are 0
most of the time)

Linear models (CL & RG) - Relatively easy to understand the - Often unclear to interpret the
- [CL] Logistic regression / prediction procedure values of model coefficients
Linear support vector - Very fast to train & predict
classifier - Work well with large and/or sparse
- [RG] Linear / Ridge / datasets
Lasso
Classification
 Categorize data into representative distinct classes or groups,
predicting the labels for new, unseen data.

 Algorithms of our scope

• K-nearest neighbor (KNN)
K-nearest neighbor (KNN) algorithm – for Classification
 The simplest, non-parametric SL algorithm
 A lazy learner – Memorizing the training set, instead of learning a
discriminative function from it.
 Principles:
1. Choose the (odd) number of k and a distance metric (Euclidean in general).
2. Calculate a distance from a target data to all training data points.
3. Find the k-nearest neighbors from the target data.
4. Assign the class label by majority voting among the nearest neighbors.

https://www.datacamp.com/tutorial/k-
nearest-neighbor-classification-scikit-learn
KNN algorithm – Classification problem
 Let’s apply for the KNN algorithm
to resolve a classification problem.
 1. Data preparation & import :
InClassData_Traffic.csv
Input
Feature 1 Feature 2 Target

Samples
KNN algorithm – Classification problem (con’t)
 2. Data separation into the
training and test sets
• random_state [integer] : A parameter
for the random number generator.
=> To ensure that we get the same split
every time we run the code.

In this situation, what problem can occur?

KNN algorithm – Classification problem (con’t)
 2. Data separation into the
training and test sets
• random_state [integer] : A parameter
for the random number generator.
⇒To ensure that we get the same split
every time we run the code.

• stratify [array] : A parameter for the

random number generator.
=> To ensure classes in the target are
distributed in a similar way in both the
training and testing sets (as in the
original set)
=> To avoid biased model performance.
KNN algorithm – Classification problem (con’t)
 3. Import ‘KNeighborsClassifier’
class and create its instance.
• n_neighbors [integer] : A parameter
to set the number of neighbors.
=> Odd number is recommended.
KNN algorithm – Classification problem (con’t)

=> The model correctly

predicted 100 % of the
samples in the test set!

 4. Fit the classifier using the training set (fit method).

=> Storing the training set to compute neighbors during prediction.
 5. Make predictions on the test data (predict method).
 6. Evaluate the prediction’s accuracy (score method
=> the mean accuracy of the predictions on the test data).
• 0 ≤ accuracy ≤ 1 : Higher value => Better performance in classifying the test set.
Trained model application to a new data
 Let’s classify a new data [2]
with [feature1, feture2]
= [25, 150].
[[2 2 2 1 2]]

Did the model result a correct

classification (Type2) ?
(Feature 2)

(Feature 1)
Feature scaling
 The majority of ML algorithms behave much better if features are on
the same scale.
 Methods : 1) Standardization, 2) Min-max scaling, 3) Robust scaling

Did the model result a correct Not sure, because...

classification (Type2) ? the feature 1 is almost neglected to determine NNs.
(Feature 2)

(Feature 2)

(Feature 1) (Feature 1)
Feature scaling 1 : Standardization
 Center the feature columns at mean 0 with standard deviation 1
=> A standard normal distribution
 Easier to learn the weights of individual features.
 Maintains useful info about outliers.
 Makes the algorithm less sensitive to outliers.

i x − µx
i
x strd =
σx
where µx and σx indicate the mean and the standard deviation of a feature x in the training set.
Feature scaling 1 : Standardization (con’t)
 Import ‘StandardScaler’
class and create its
instance.
 Standardize all samples
& new data based on
the mean & std. of the
training set.
 Newly run a KNN (k=5)
with the standardized
data.

 Check the result!

[1]
[1 1 1 1 1]
Feature scaling 1 : Standardization (con’t)
 Visualize the result
of k = 5 & save the
figure.
Feature scaling 1 : Standardization (con’t)
 Decision boundary (DB):
=>The set of points in the feature
space where the class assignment
changes.
=>Its shape depends on the k’s
value and the geometry of the
data.
Feature scaling 1 : Standardization (con’t)
 [Observation] Effects of varying k
on the performance accuracy,
the bias&variance, and
the decision boundary

Greater k,
Simpler model (DB),
Less sensitive to
noise in the data
Feature scaling 1 : Standardization (con’t)
 [For loop application]
Create each DB as k varies
Feature scaling 1 : Standardization (con’t)
 [For loop application]
Calculate the accuracy,
as k varies
 Define a KNN model in the
for-loop.
 Append the score values in
the list.
Feature scaling 1 : Standardization (con’t)
 [For loop application]
Calculate the bias & variance,
as k varies
 Define a KNN model in the
for-loop.
 Append the ‘averaged’ bias
& variance values in the list.
Feature scaling 2 : Min-max scaling
 Rescale the features to a range of [0, 1].
 Useful when we need values in a bounded interval.

i
i x − xmin
x norm =
xmax − xmin
where xmin and xmax indicate the minimum and maximum of a feature x in the training set.
Feature scaling 2 : Min-max scaling (con’t)
 Import ‘MinMaxScaler’ class and
create its instance.
 Min-max scale all samples &
new data based on the mean &
std. of the training set.
 Newly run a KNN (k=5) with the
min-max scaled data.
Feature scaling 3 : Robust scaling
 Extreme values and outliers get less pronounced.
 Useful when we work with small datasets that contain many outliers.

i
i x − q2
x rbst =
q3 − q1
where q1, q2 and q3 indicate the 1st , 2nd, and 3rd quartiles of a feature x in the training set.
Feature scaling 3 : Robust scaling (con’t)
 Import ‘RobustScaler’ class and
create its instance.
 Robust scale all samples & new
data based on the quartiles of
the training set.
 Newly run a KNN (k=5) with the
robust scaled data.
Take-home points (THPs)
-
-
-
…

K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
Classification Algorithms I
No ratings yet
Classification Algorithms I
14 pages
AIML Unit-IV & V
100% (1)
AIML Unit-IV & V
47 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
105 pages
MLP Unit-2
No ratings yet
MLP Unit-2
102 pages
Introduction To Classification and Classification Algorithms
No ratings yet
Introduction To Classification and Classification Algorithms
9 pages
Lecture 2 Final
No ratings yet
Lecture 2 Final
90 pages
ML Unit 3
No ratings yet
ML Unit 3
106 pages
AI ML Report
No ratings yet
AI ML Report
35 pages
ML Supervised Learning Unit 3
No ratings yet
ML Supervised Learning Unit 3
51 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
UNIT 2 - Notes
No ratings yet
UNIT 2 - Notes
31 pages
4.0 Supervised Learning 4.1 Discuss Classification Model
No ratings yet
4.0 Supervised Learning 4.1 Discuss Classification Model
48 pages
Unit 5 Learning With Algorithm
No ratings yet
Unit 5 Learning With Algorithm
7 pages
Lesson 4 - Supervised Learning
No ratings yet
Lesson 4 - Supervised Learning
36 pages
Machine Learning3
No ratings yet
Machine Learning3
51 pages
ML Unit 4
No ratings yet
ML Unit 4
76 pages
Unit 5
No ratings yet
Unit 5
73 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Week 03
No ratings yet
Week 03
28 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
CSCI946 W5-Classification
No ratings yet
CSCI946 W5-Classification
72 pages
Cse Vsem 503 B PR Unit 2 Notes
No ratings yet
Cse Vsem 503 B PR Unit 2 Notes
17 pages
SLRG
No ratings yet
SLRG
13 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
12 pages
Statistic Inference Unit 2 Notes
No ratings yet
Statistic Inference Unit 2 Notes
34 pages
Unit 3
No ratings yet
Unit 3
15 pages
Let's Begin With:: Differentiate Between Supervised and Unsupervised Learning
No ratings yet
Let's Begin With:: Differentiate Between Supervised and Unsupervised Learning
26 pages
Algorithm
No ratings yet
Algorithm
27 pages
مشین سیکھنا
No ratings yet
مشین سیکھنا
5 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
Update Week 13 Machine Learning Supervised
No ratings yet
Update Week 13 Machine Learning Supervised
21 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
ML Unit2
No ratings yet
ML Unit2
38 pages
ML Unit-2
No ratings yet
ML Unit-2
33 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
5 No Ans.
No ratings yet
5 No Ans.
38 pages
ML 4
No ratings yet
ML 4
33 pages
Module 3
No ratings yet
Module 3
63 pages
Unit 3 - Supervise Learning Classification
No ratings yet
Unit 3 - Supervise Learning Classification
23 pages
Unit 5
No ratings yet
Unit 5
28 pages
Classification
No ratings yet
Classification
58 pages
Machine Lar Arii
No ratings yet
Machine Lar Arii
9 pages
Classification
No ratings yet
Classification
53 pages
Spam Not Spam
No ratings yet
Spam Not Spam
7 pages
ML Lec-10
No ratings yet
ML Lec-10
19 pages
Chapter#10 (Part#01) SL (K-NN)
No ratings yet
Chapter#10 (Part#01) SL (K-NN)
27 pages
Notes: KNN: K-Nearest Neighbors
No ratings yet
Notes: KNN: K-Nearest Neighbors
4 pages
ch-4 FML
No ratings yet
ch-4 FML
13 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
Lect 1
No ratings yet
Lect 1
24 pages
Module Iii
No ratings yet
Module Iii
15 pages
New Classification and Regression Models
No ratings yet
New Classification and Regression Models
7 pages
Managerial Accounting and Cost Concepts
No ratings yet
Managerial Accounting and Cost Concepts
66 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
Cyber Security and Operations Management For Industry 4 - 0 - Ahmed A Elngar N Thillaiarasu Mohamed Elhoseny K Martin - 2022 - CRC Press - 9781032079486 - Anna's Archive
No ratings yet
Cyber Security and Operations Management For Industry 4 - 0 - Ahmed A Elngar N Thillaiarasu Mohamed Elhoseny K Martin - 2022 - CRC Press - 9781032079486 - Anna's Archive
161 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
AI Enhanced+Cybersecurity+in+Smart+Manufacturing
No ratings yet
AI Enhanced+Cybersecurity+in+Smart+Manufacturing
38 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
0% (1)
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
18 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
57 pages
Full Notes
No ratings yet
Full Notes
62 pages
Master Budgeting
No ratings yet
Master Budgeting
82 pages
Project Loan Automl
No ratings yet
Project Loan Automl
52 pages
Ai Syllabus
No ratings yet
Ai Syllabus
7 pages
EFM Ch6
No ratings yet
EFM Ch6
35 pages
Encryption & Decryption Apk
No ratings yet
Encryption & Decryption Apk
27 pages
Ai PPT Material
No ratings yet
Ai PPT Material
9 pages
Chapter 6 DATA MINING R1
No ratings yet
Chapter 6 DATA MINING R1
81 pages
Unit3-Important Topics Related To Neural Network
No ratings yet
Unit3-Important Topics Related To Neural Network
10 pages
ML Interview Questions
No ratings yet
ML Interview Questions
7 pages
Dissertation PBA Publication Version
No ratings yet
Dissertation PBA Publication Version
225 pages
Unit 2 Machine Learning Aktu
No ratings yet
Unit 2 Machine Learning Aktu
18 pages
CM15 Extreme Value Distributions
No ratings yet
CM15 Extreme Value Distributions
7 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Group 2 ML Assignment PDF
No ratings yet
Group 2 ML Assignment PDF
29 pages
Ba Unit 4 - Part1
No ratings yet
Ba Unit 4 - Part1
7 pages
Dimension Reduction EECS 6327
No ratings yet
Dimension Reduction EECS 6327
24 pages
Data Type
No ratings yet
Data Type
22 pages
Interpretable Machine Learning
No ratings yet
Interpretable Machine Learning
3 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Notes 01
No ratings yet
Notes 01
56 pages
Week07b FitProbDist
No ratings yet
Week07b FitProbDist
19 pages
34 Machine Learning Interview Questions & Answers For 2020
No ratings yet
34 Machine Learning Interview Questions & Answers For 2020
27 pages
Abalone Shell Age Prediction
No ratings yet
Abalone Shell Age Prediction
15 pages
MRV1
No ratings yet
MRV1
6 pages
Lesson 1
No ratings yet
Lesson 1
37 pages
2005 S Fin
No ratings yet
2005 S Fin
1 page
2003 S Fin
No ratings yet
2003 S Fin
1 page
2007 S Fin
No ratings yet
2007 S Fin
1 page
2006 S Fin
No ratings yet
2006 S Fin
1 page
Magnetic Normal Modes of Bi-Component Permalloy Structures : Pam Malagò
No ratings yet
Magnetic Normal Modes of Bi-Component Permalloy Structures : Pam Malagò
5 pages
HandsOnExs Variables and DataType
No ratings yet
HandsOnExs Variables and DataType
3 pages
PSY417 Week12
No ratings yet
PSY417 Week12
34 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
3 pages
Cs3491 - Aiml - Unit III - Linear Classification Models - Discriminant Function
No ratings yet
Cs3491 - Aiml - Unit III - Linear Classification Models - Discriminant Function
6 pages
AML Winter 2021 Solution
No ratings yet
AML Winter 2021 Solution
6 pages
Ss
No ratings yet
Ss
26 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet