Lesson 4 - Supervised Learning

Uploaded by

Quỳnh Hương Đỗ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views36 pages

Lesson 4 - Supervised Learning

Uploaded by

Quỳnh Hương Đỗ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

INTRODUCTION TO

ARTIFICIAL INTELLIGENCE
BUI NGOC DUNG
Information (if available)

CHAPTER 4: SUPERVISED LEARNING

SUPERVISED LEARNING
❑ Supervised learning is the types of machine learning in which machines are trained using labelled training
data, and on basis of that data, machines predict the output.
❑ The aim of a supervised learning algorithm is to find a mapping function to map the input variable (𝒙) with
the output variable (𝒚).
GRAB PROBLEM

An AI engineer took a Grab to visit his lover. However, unfortunately the Grab application crashed and could
not charge this person.
GRAB PROBLEM
Km Price
2 13
7 35
9 41
3 19
10 45
6 28
1 10
8 55

However, luckily, because the person ordering the car is an AI engineer and the application has saved this
person's travel history, he will be able to build a price prediction model based on the number of kilometers
traveled today.
TRAINING PROGRESS
TRAINING PROGRESS
𝑦ො 𝑦
Features Prediction Label
13 𝑖 2*2+3=7 13 𝑖
𝑥 𝑦
35 17 35
41 Model 21 Cost Function 41

8 19 9 19
data points 45 23 45
28 15 28
𝑦ො (𝑖) = ℎ𝜃 𝑥ො 𝑖
= 2𝑥 𝑖
+3
10 5 10
55 19 55
Initialized randomly
at the first step
DATA VISUALIZATION
Let me visualize
the data first

Km Price
2 13
7 35
9 41
3 19
10 45
6 28
1 10
8 55
OUTLIER
Let me visualize
the data first

Km Price
2 13
7 35
9 41
3 19
10 45
6 28
1 10
8 55
Set up a

HYPOTHESIS hypothetical
model

Km (x) Price (y)

2 13
7 35
With 1 data point 9 41
ℎ𝜃 𝑥 : ℝ → ℝ 3 19
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥 10 45
𝜃0 , 𝜃1 ∈ ℝ
6 28
The training data set has a lot of data points
1 10
𝑥 1 ,𝑦 1 , … , (𝑥 𝑚 , 𝑦 (𝑚) )
8 55
Set up a

HYPOTHESIS hypothetical
model

Mean of y
(30.75)
ℎ𝜃 (𝑥) = 𝜃0 Km (x) Price (y)

𝜃1 = 0 2 13
7 35
With 1 data point 9 41
ℎ𝜃 𝑥 : ℝ → ℝ 3 19
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥 10 45
𝜃0 , 𝜃1 ∈ ℝ
6 28
1 10
8 55
Set up a

HYPOTHESIS hypothetical
model

Mean of y
(30.75)
ℎ𝜃 (𝑥) = 𝜃0 Km (x) Price (y)

𝜃1 = 0 2 13
7 35
With 1 data point 9 41
ℎ𝜃 𝑥 : ℝ → ℝ 3 19
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥 10 45
𝜃0 , 𝜃1 ∈ ℝ
6 28
1 10
8 55
HYPOTHESIS With 1 data point
ℎ𝜃 𝑥 : ℝ → ℝ
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥
𝜃0 , 𝜃1 ∈ ℝ

Mean of y
(30.75)
ℎ𝜃 (𝑥) = 𝜃0 Km (x) Price (y)

𝜃1 = 0 2 13
7 35
9 41
distance from the line
to the first data point 3 19
(𝜃0 − 𝑦 (1) )
(𝑥 (1) , 𝑦 (1) )
10 45
6 28
(𝑥 (1) , 𝑦 (1) ) 1 10
Total distance = (𝜃0 − 𝑦 (1) ) 8 55
HYPOTHESIS With 1 data point
ℎ𝜃 𝑥 : ℝ → ℝ
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥
𝜃0 , 𝜃1 ∈ ℝ

Mean of y
(30.75)
ℎ𝜃 (𝑥) = 𝜃0 Km (x) Price (y)

𝜃1 = 0
(𝑥 (2) , 𝑦 (2) ) 2 13
(𝜃0 − 𝑦 (2) ) 7 35
9 41
3 19
(𝑥 (2) , 𝑦 (2) )
10 45
6 28
1 10
Total distance = 𝜃0 − 𝑦 1 + (𝜃0 − 𝑦 (2) ) 8 55
HYPOTHESIS With 1 data point
ℎ𝜃 𝑥 : ℝ → ℝ
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥
𝜃0 , 𝜃1 ∈ ℝ
𝜃0 − 𝑦 5 <0
Mean of y
(30.75)
ℎ𝜃 (𝑥) = 𝜃0 Km (x) Price (y)

𝜃1 = 0 2 13
(𝑥 (5) , 𝑦 (5) ) 7 35
9 41
3 19
(𝑥 (2) , 𝑦 (2) )
10 45
6 28
1 10
Total distance = 𝜃0 − 𝑦 1 + ⋯ + (𝜃0 − 𝑦 (5) ) 8 55
HYPOTHESIS With 1 data point
ℎ𝜃 𝑥 : ℝ → ℝ
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥
𝜃0 , 𝜃1 ∈ ℝ
Mean of y
(30.75)
ℎ𝜃 (𝑥) = 𝜃0 Km (x) Price (y)

𝜃1 = 0 2 13
7 35
9 41
3 19
(𝑥 (8) , 𝑦 (8) ) 10 45
6 28
1 10
Total distance = 𝜃0 − 𝑦 1 + (𝜃0 − 𝑦 (2) ) + ⋯ + (𝜃0 − 𝑦 (8) ) 8 55
HYPOTHESIS With 1 data point
ℎ𝜃 𝑥 : ℝ → ℝ
ℎ𝜃 𝑥 = 𝜃0 + 𝜃1 𝑥
𝜃0 , 𝜃1 ∈ ℝ
Mean of y
(30.75)
Total distance = 𝜃0 − 𝑦 1
+ (𝜃0 − 𝑦 (2) ) + ⋯ + (𝜃0 − 𝑦 (8) )
ℎ𝜃 (𝑥) = 𝜃0
𝜃1 = 0 This total distance represents the error between the
model's prediction and the actual label.
HYPOTHESIS
𝑦ො 𝑦
Features Prediction Label
13 30.75 13
35 30.75 35
41 Model 30.75 Cost Function 41

8 19 30.75 19
data points 45 30.75 45
28 30.75 28
𝑦ො (𝑖) = ℎ𝜃 𝑥ො 𝑖
= 30.75
10 30.75 10
55 30.75 55
Initialized randomly at
the first step
𝜃0 = 30.75 𝜃0 = 0
HYPOTHESIS
𝑦ො 𝑦
𝑦ො − 𝑦 (𝑦ො − 𝑦)2
Prediction Label
30.75 13 17.75 315.0625
30.75 35 4.25 18.0625
30.75 Cost Function 41 10.25 105.0625
30.75 19 11.75 138.0625
30.75 45 14.25 14.25
30.75 28 2.75 2.75
30.75 10 20.75 20.75
30.75 55 24.25 24.25

Linear Regression Model Sum 106.0 106.0

106.0 / 8 = 13.25 106.0 / 8 = 13.25

𝑦ො (𝑖) = ℎ𝜃 𝑥ො 𝑖 = 30.75
MODEL
❑ Input: 𝑥𝑖 ∈ ℝ𝑛 , 𝑖 = 1, … , 𝑚
❑ Output: 𝑦𝑖 ∈ ℝ (regression task)
❑ Model Parameters: 𝜃 ∈ ℝ𝑘
❑ Predicted Output: 𝑦ෝ𝑖 ∈ ℝ
LINEAR REGRESSION
❑ Pros: Easy to interpret results, computationally inexpensive
❑ Cons: Poorly models nonlinear data
❑ Work with: Numeric values, nominal values
GENERAL APPROACH TO REGRESSION
1. Collect: Any method.
2. Prepare: We’ll need numeric values for regression. Nominal values should be mapped to binary values.
3. Analyze: It’s helpful to visualized 2D plots. Also, we can visualize the regression weights if we apply shrinkage
methods.
4. Train: Find the regression weights.
5. Test: We can measure the R2, or correlation of the predicted value and data, to measure the success of our
models.
6. Use: With regression, we can forecast a numeric value for a number of inputs. This is an improvement over
classification because we’re predicting a continu ous value rather than a discrete category.
SINGLE VARIABLE REGRESSION
Regression problems pop up whenever we want to predict a numerical value.
NORMALIZE DATA
Regression problems pop up whenever we want to predict a numerical value.
WEIGHT
Regression problems pop up whenever we want to predict a numerical value.
BIAS
Regression problems pop up whenever we want to predict a numerical value.
LOSS FUNCTION
Regression problems pop up whenever we want to predict a numerical value.
MULTIPLE VARIABLES REGRESSION
Regression problems pop up whenever we want to predict a numerical value.
K-NEAREST NEIGHBORS
❑ Pros: High accuracy, insensitive to outliers, no assumptions about data
❑ Cons: Computationally expensive, requires a lot of memory
❑ Works with: Numeric values, nominal values
GENERAL APPROACH TO K-NEAREST NEIGHBORS
1. Collect: Any method.
2. Prepare: Numeric values are needed for a distance calculation. A structured data format is best.
3. Analyze: Any method.
4. Train: Does not apply to the k-NN algorithm.
5. Test: Calculate the error rate.
6. Use: This application needs to get some input data and output structured num eric values. Next, the
application runs the k-NN algorithm on this input data and determines which class the input data should
belong to. The application then takes some action on the calculated class.
PSEUDOCODE
𝐹𝑜𝑟 𝑒𝑣𝑒𝑟𝑦 𝑝𝑜𝑖𝑛𝑡 𝑖𝑛 𝑜𝑢𝑟 𝑑𝑎𝑡𝑎𝑠𝑒𝑡:
𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒 𝑡ℎ𝑒 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑖𝑛𝑋 𝑎𝑛𝑑 𝑡ℎ𝑒 𝑐𝑢𝑟𝑟𝑒𝑛𝑡 𝑝𝑜𝑖𝑛𝑡
𝑠𝑜𝑟𝑡 𝑡ℎ𝑒 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒𝑠 𝑖𝑛 𝑖𝑛𝑐𝑟𝑒𝑎𝑠𝑖𝑛𝑔 𝑜𝑟𝑑𝑒𝑟
𝑡𝑎𝑘𝑒 𝑘 𝑖𝑡𝑒𝑚𝑠 𝑤𝑖𝑡ℎ 𝑙𝑜𝑤𝑒𝑠𝑡 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒𝑠 𝑡𝑜 𝑖𝑛𝑋
𝑓𝑖𝑛𝑑 𝑡ℎ𝑒 𝑚𝑎𝑗𝑜𝑟𝑖𝑡𝑦 𝑐𝑙𝑎𝑠𝑠 𝑎𝑚𝑜𝑛𝑔 𝑡ℎ𝑒𝑠𝑒 𝑖𝑡𝑒𝑚𝑠
𝑟𝑒𝑡𝑢𝑟𝑛 𝑡ℎ𝑒 𝑚𝑎𝑗𝑜𝑟𝑖𝑡𝑦 𝑐𝑙𝑎𝑠𝑠 𝑎𝑠 𝑜𝑢𝑟 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛 𝑓𝑜𝑟 𝑡ℎ𝑒 𝑐𝑙𝑎𝑠𝑠 𝑜𝑓 𝑖𝑛𝑋
DISTANCE METRIC
Distance metrics are used in supervised and unsupervised learning to calculate similarity in data points.
ADVANCED ALGORITHMS
Regression problems pop up whenever we want to predict a numerical value.
DECISION TREES
Regression problems pop up whenever we want to predict a numerical value.
DECISION TREES
❑ Pros: Computationally cheap to use, easy for humans to understand learned results, missing values OK, can
deal with irrelevant features.
❑ Cons: Prone to overfitting
❑ Works with: Numeric values, nominal values
GENERAL APPROACH TO DECISION TREES
1. Collect: Any method.
2. Prepare: This tree-building algorithm works only on nominal values, so any continuous values will need to
be quantized.
3. Analyze: Any method. You should visually inspect the tree after it is built.
4. Train: Construct a tree data structure.
5. Test: Calculate the error rate with the learned tree.
6. Use: This can be used in any supervised learning task. Often, trees are used to better understand the data.
THANK YOU
INFORMATION (IF AVAILABLE)
Information (if available)

Information (if available)

UNIDEN Dect 6.0 - D1484 - D1481 Manual
83% (41)
UNIDEN Dect 6.0 - D1484 - D1481 Manual
20 pages
Range Rover L322 Electrical Complete
100% (8)
Range Rover L322 Electrical Complete
21 pages
Fa Subjects For Fine Arts General Science Humanities Part 1 - 2
33% (3)
Fa Subjects For Fine Arts General Science Humanities Part 1 - 2
4 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Classification
No ratings yet
Classification
74 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
ML unit-2 (CEC)
No ratings yet
ML unit-2 (CEC)
96 pages
DSV ia2
No ratings yet
DSV ia2
18 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
KNN
No ratings yet
KNN
29 pages
ML Unit V
No ratings yet
ML Unit V
10 pages
Aiml K2
No ratings yet
Aiml K2
8 pages
SL
No ratings yet
SL
30 pages
Session 5 ppt
No ratings yet
Session 5 ppt
36 pages
Chapter#10 (Part#01) SL (K-NN)
No ratings yet
Chapter#10 (Part#01) SL (K-NN)
27 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
5. K-Nearest Neighbors
No ratings yet
5. K-Nearest Neighbors
35 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
INSTANCE Based Learning
No ratings yet
INSTANCE Based Learning
12 pages
Machine Learning unit 3
No ratings yet
Machine Learning unit 3
40 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
ML04_KNN-SVM_2024-2025
No ratings yet
ML04_KNN-SVM_2024-2025
57 pages
Unit 5 Learning with Algorithm
No ratings yet
Unit 5 Learning with Algorithm
7 pages
ML UNIT-2
No ratings yet
ML UNIT-2
33 pages
ML Unit-2
No ratings yet
ML Unit-2
55 pages
Lect 1
No ratings yet
Lect 1
24 pages
Chapter 6 ML Classifications
No ratings yet
Chapter 6 ML Classifications
51 pages
Unit-5
No ratings yet
Unit-5
73 pages
ml unit2
No ratings yet
ml unit2
38 pages
Unit 2 ML
No ratings yet
Unit 2 ML
89 pages
ML-MID1-MYANS
No ratings yet
ML-MID1-MYANS
24 pages
ML u4 Omkar Pawar
No ratings yet
ML u4 Omkar Pawar
11 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Machine learning Lecture 02
No ratings yet
Machine learning Lecture 02
25 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
ML-Module 3
No ratings yet
ML-Module 3
64 pages
AIML-Unit 4 Notes-Assignment 4
No ratings yet
AIML-Unit 4 Notes-Assignment 4
21 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Week 03
No ratings yet
Week 03
28 pages
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
61 pages
K-Nearest Neighbors
100% (1)
K-Nearest Neighbors
32 pages
Sayan Das - Machine Learning
No ratings yet
Sayan Das - Machine Learning
4 pages
Module 3 (1)
No ratings yet
Module 3 (1)
63 pages
cs4302-lecture2
No ratings yet
cs4302-lecture2
40 pages
MLT unit 3 part 2 (1)
No ratings yet
MLT unit 3 part 2 (1)
57 pages
Machine Learning in A Nutshell
No ratings yet
Machine Learning in A Nutshell
36 pages
Data Science Unit 3 (1) - Copy
No ratings yet
Data Science Unit 3 (1) - Copy
33 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
CH 04 Classification Techniques
No ratings yet
CH 04 Classification Techniques
89 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
Supervised Classification Notes
No ratings yet
Supervised Classification Notes
31 pages
Unit 1
No ratings yet
Unit 1
15 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
CSE445 NSU Week_5
No ratings yet
CSE445 NSU Week_5
26 pages
4+KNN+Classifier
No ratings yet
4+KNN+Classifier
6 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Sistem de Balustrada Vormatic - KGS - GB
No ratings yet
Sistem de Balustrada Vormatic - KGS - GB
16 pages
Facility Layout
No ratings yet
Facility Layout
14 pages
GRINNELL Figure 780 Grooved Snap Couplings 1-1/4 Thru 8 Inch (DN32 Thru DN200) General Description
No ratings yet
GRINNELL Figure 780 Grooved Snap Couplings 1-1/4 Thru 8 Inch (DN32 Thru DN200) General Description
2 pages
Unit 4
100% (1)
Unit 4
36 pages
CROMA PPT
No ratings yet
CROMA PPT
9 pages
BIOE 340: Physiology Review Questions: Student Number: Name and Surname
No ratings yet
BIOE 340: Physiology Review Questions: Student Number: Name and Surname
3 pages
Chapter Ten
No ratings yet
Chapter Ten
18 pages
5np Relief
No ratings yet
5np Relief
3 pages
Pcc-Ee 304
100% (1)
Pcc-Ee 304
3 pages
Implementation and Enforcement in Adopting Eurocode 2 For Concrete Structures Design in Malaysia
100% (1)
Implementation and Enforcement in Adopting Eurocode 2 For Concrete Structures Design in Malaysia
13 pages
Laboratory Inspection Checklist Form Hsse World: SN Observations Recommendation (S) By: Status
No ratings yet
Laboratory Inspection Checklist Form Hsse World: SN Observations Recommendation (S) By: Status
4 pages
School of Knowledge For Industrial Labor, Leadership and Services (Skills)
No ratings yet
School of Knowledge For Industrial Labor, Leadership and Services (Skills)
1 page
Consolidated Readiness Assessment Tool - 23 July 2024
No ratings yet
Consolidated Readiness Assessment Tool - 23 July 2024
8 pages
Conductimetro de Linea Cm230 A y C (D)
No ratings yet
Conductimetro de Linea Cm230 A y C (D)
3 pages
11 - Javascript Control Flow
No ratings yet
11 - Javascript Control Flow
29 pages
Major Project Report Guidelines
No ratings yet
Major Project Report Guidelines
6 pages
流浮山市中心活躍出行通道設計建議 Proposal for Design Scheme for Active Mobility Pathway in Lau Fau Shan Town Centre
No ratings yet
流浮山市中心活躍出行通道設計建議 Proposal for Design Scheme for Active Mobility Pathway in Lau Fau Shan Town Centre
24 pages
Science Subject For High School - 9th Grade - Chemistry by Slidesgo
No ratings yet
Science Subject For High School - 9th Grade - Chemistry by Slidesgo
55 pages
OP5205: BUSINESS STATISTICS (2 Credits) Session 21-24 - Confidence Interval & Hypothesis Testing
No ratings yet
OP5205: BUSINESS STATISTICS (2 Credits) Session 21-24 - Confidence Interval & Hypothesis Testing
77 pages
PowerCube 500 Installation Guide (ICC350-H3-C2)
No ratings yet
PowerCube 500 Installation Guide (ICC350-H3-C2)
74 pages
Pipe-Volume-Calculator-FREE-Spreadsheet
No ratings yet
Pipe-Volume-Calculator-FREE-Spreadsheet
6 pages
Eta and Pearson: Variable Eta Value Significance Value Interpretation Decision To Null Hypothesis
No ratings yet
Eta and Pearson: Variable Eta Value Significance Value Interpretation Decision To Null Hypothesis
2 pages
CUET 2024
No ratings yet
CUET 2024
33 pages
Torsion Experiment
No ratings yet
Torsion Experiment
6 pages
Operation and Service Manual: Trailer Refrigeration Unit
100% (1)
Operation and Service Manual: Trailer Refrigeration Unit
82 pages
Hum Network Limited
0% (1)
Hum Network Limited
7 pages
TMC 1 Organizing The Physical Environment
100% (2)
TMC 1 Organizing The Physical Environment
16 pages