0% found this document useful (0 votes)

14 views

Lect02 Problem ML

A computer program interacts with a dynamic environment in which it must perform a certain goal (finding a balance between exploration (of uncharted territory) and exploitation (of current knowledge) • The program is provided feedback in terms of rewards and punishments as it navigates its problem space

Uploaded by

Just Do It FireFly

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Lect02 Problem ML

Uploaded by

Just Do It FireFly

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

LEARNING PROBLEM

Bùi Tiến Lên

2023
Contents

1. Learning Components

2. A Simple Learning Model

3. Feasibility Of Learning

4. Risk and Emprical Risk

Notation
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm

Feasibility Of
symbol meaning
Learning
Probability to the
a, b, c, N . . . scalar number
rescue
w, v, x, y . . . column vector
Risk and
Emprical Risk X, Y . . . matrix operator meaning
Loss function
Empirical risk
R set of real numbers w| transpose
Regularizer
Z set of integer numbers XY matrix multiplication
N set of natural numbers X −1 inverse
RD set of vectors
X , Y, . . . set
A algorithm

3
Learning Components
Credit Approval
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• Suppose that a bank receives thousands of credit card applications every day,
Feasibility Of
Learning and it wants to automate the process of evaluating them.
Probability to the
rescue
• Applicant information
Risk and
Emprical Risk age 23 years
gender male
Loss function
Empirical risk
Regularizer
annual salary $30000
years in residence 1 year
years in job 1 year
current debt $15000
... ...
• Approve credit?

5
Problem Statement
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
Formalization
Feasibility Of
Learning • Input: x (customer application)
Probability to the
rescue
• Output: y (good/bad customer? or {1, −1})
Risk and
Emprical Risk • Data (x 1 , y1 ), (x 2 , y2 ), ...(x N , yN ) (historical records)
Loss function
Empirical risk • Target function: f : X → Y (ideal credit approval formula)
Regularizer

• Best approximate function g : X → Y (formula to be used)

6
Inductive Bias
Learning
Components

A Simple
Learning Model
Hypothesis Set

Theorem 1 (No Free Lunch Theorems)

Learning Algorithm

Feasibility Of
Learning
Probability to the
An unbiased learner can never generalize.
rescue

Risk and
Emprical Risk Concept 1
An inductive bias of a learner is the set of assumptions a learner uses to predict
Loss function
Empirical risk
Regularizer
results given inputs it has not yet encountered.
• Consider: arbitrarily wiggly functions or random truth tables.

0 0 0 0

0 0 1 ?

0 1 0 1

0 1 1 1

1 0 0 0

1 0 1 ?

1 1 0 1

1 1 1 ?
7
Inductive Bias (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set

Inductive Learning Hypothesis

Learning Algorithm

Feasibility Of
Learning
Probability to the
Generalization is possible.
rescue

Risk and
Emprical Risk • If a machine performs well on most training data AND it is not too
Loss function
Empirical risk
complex, it will probably do well on similar test data.
Regularizer

8
Components of Learning
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
UNKNOWN TARGET FUNCTION TRAINING EXAMPLES
Feasibility Of
Learning
Probability to the
rescue

Risk and
Emprical Risk
Loss function
Empirical risk
LEARNING
Regularizer HYPOTHESIS SET
ALGORITHM

FINAL HYPOTHESIS

9
Learning Model
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
The two components are referred as the
Feasibility Of
Learning
learning model
• The hypothesis set H is a set of
f, H

Probability to the
rescue

Risk and
Emprical Risk
functions that is potentially similar
Loss function to f
H = {hθ1 , hθ2 , ...}
Empirical risk
Regularizer

• The learning algorithm A is a

f, H

hθ
1

search algorithm which finds

g ∈ H such that
hθ hθ
2 3

A ∣ D
g ≈ f

hθ
i

best
g ≈ f
10
What is hypothesis set
Learning
Components

A Simple
Learning Model
Hypothesis Set

Concept 2
Learning Algorithm

Feasibility Of
Learning
Probability to the
Hypothesis set is a set of potential functions, models or solutions
rescue

Risk and
Emprical Risk • Hypothesis set can be finite. For example
Loss function
Empirical risk • {guilty, not guilty}
Regularizer
• {accept, reject}
• {happy, sad}
• {1, 2, 3, 4, 5, 6}

11
What is hypothesis set (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm • Hypothesis set can be infinite. For example, sets of functions y = θ0 + θ1 x
Feasibility Of
Learning
and y = θ0 + θ1 x + θ2 x 2 + θ3 x 3
Probability to the
rescue

Risk and
Emprical Risk
Loss function
Empirical risk
Regularizer

12
Parameter representations
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• Each element of hypothesis set often indexed by parameters or weights (θ
Feasibility Of
Learning or w)
Probability to the
rescue
• Two basic representations for parameters: factored, and structured
Risk and
Emprical Risk 1. Factored: a paramater set consists of a vector of attribute values; values
can be boolean, real-valued, or one of a fixed set of symbols.
Loss function
Empirical risk
Regularizer
2. Structured: a paramater set includes objects, each of which may have
attributes of its own as well as relationships to other objects.

13
A Simple Learning Model
• Hypothesis Set
• Learning Algorithm
A Simple Hypothesis Set
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
We starts with the simple model (the perceptron model)
Feasibility Of
Learning • For input x = (x1 , ..., xd ) (attributes of a customer)
Probability to the
rescue

Risk and
d
Approve credit if wi xi ≥ threshold
X
Emprical Risk
Loss function
Empirical risk i=1
Regularizer

Deny credit if wi xi < threshold (1)

i=1

• This linear formula h ∈ H can be written as

d
! !
h(x) = hw,threshold (x) = sign w i xi − threshold (2)
X

i=1

15
A Simple Hypothesis Set (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm • Set w0 = −threshold
Feasibility Of
Learning
d
! !
Probability to the

h(x) = hw (x) = sign w i xi + w0 (3)

X
rescue

Risk and
Emprical Risk i=1
Loss function
Empirical risk
Regularizer
• Introduce an artificial coordinate x0 = 1
d
!
h(x) = hw (x) = sign w i xi (4)
X

i=0

• In vector form, the perceptron implements

h(x) = hw (x) = sign (w | x) (5)

16
2D Model Visualization
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• Decision boundaries: line
Feasibility Of
Learning • Decision regions: approve and deny regions
Probability to the
rescue

Risk and
Emprical Risk
Loss function
Empirical risk
Regularizer Approve

Attribute 2
Deny

Attribute 1

17
A Simple Learning Algorithm
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• The performance measure: the error rate
Feasibility Of
Learning • We uses the simple learning algorithm (perceptron learning algorithm -
Probability to the
rescue
PLA) to find w
Risk and
Emprical Risk arg min E (hw (x), y | D) (6)
Loss function
w
Empirical risk
Regularizer

18
A Simple Learning Algorithm (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• Given the training set
Feasibility Of
Learning
Probability to the
rescue
D = {(x 1 , y1 ), (x 2 , y2 ), ...(x N , yN )}
Risk and
Emprical Risk
Loss function 1. Init w
2. Repeat until satisfied
Empirical risk
Regularizer

• At iteration t = 1, 2, 3, ..., pick a misclassified point (x i , yi )

sign(w | x i ) 6= yi (7)

• and update the weight vector

w ← w + yi x i (8)

19
A Simple Explanation
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm

Feasibility Of
Learning incorrect correct
Probability to the
rescue

Risk and
Emprical Risk
Loss function
Empirical risk
Regularizer
=0

20
Is It Learning Algorithm?
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm

Feasibility Of
Learning
Probability to the
rescue

Risk and
Emprical Risk
Loss function
Empirical risk
Regularizer

21
A Learning Puzzle
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm

Feasibility Of
Learning
Probability to the y = -1
rescue

Risk and
Emprical Risk
Loss function
Empirical risk
Regularizer
y = +1

y=?

22
Feasibility Of Learning
• Probability to the rescue
Feasibility Of Learning
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
The feasibility of learning is thus split into two questions:
Feasibility Of
Learning 1. Can we make the performance good enough?
• run our learning algorithm on the actual data D and see how good we
Probability to the
rescue

Risk and
Emprical Risk can get.
2. Can we make sure that the performance inside of D is close enough to the
Loss function
Empirical risk
Regularizer
performance outside of D?
• probability theory

24
A Related Experiment - Bin Problem
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm

Feasibility Of
• Consider a BIN with red and green
marbles
Learning BIN SAMPLE
Probability to the
rescue

Risk and
Emprical Risk P[picking a red marble] = µ = fraction of red marbles

P[picking a green marble] = 1 − µ

Loss function
Empirical risk
Regularizer

• The value of µ is unknown to us

• We pick N marbles independently
• The fraction of red marbles in
SAMPLE = ν = probability of red marbles

25
Does ν say anything about µ?
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm

Feasibility Of
Learning
Probability to the
rescue

Risk and
Emprical Risk
Loss function
Empirical risk
Regularizer

• No! (certain answer): Sample can • Yes! (uncertain answer): Sample

be mostly red while bin is mostly red frequency ν is likely close to bin
frequency µ

26
What does ν say about µ?
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• In a big sample (large N), ν is probably close µ (within )
Feasibility Of
Learning • Formally,
P[|ν − µ| > ] ≤ 2e −2 N for any > 0
Probability to the

(9)
2
rescue

Risk and
Emprical Risk
Loss function
This is called Hoeffding’s Inequality
Empirical risk
Regularizer
• Bound does not depend on µ; tradeoff: N, and the bound
• We have
ν ≈ µ =⇒ µ ≈ ν
• In other words, the statement “µ = ν” is probably approximately correct
(P.A.C)

27
Connection to Learning
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm

Feasibility Of
Bin problem Learning problem
Learning
Probability to the
The unknown is a number µ The unknown is a function f : X → Y
rescue
a marble a point x ∈ X
Risk and
Emprical Risk hypothesis got it right h(x) = f (x)
Loss function
Empirical risk
hypothesis got it wrong h(x) 6= f (x)
Regularizer

28
Connection to Learning (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm • The error rate within the sample D, which corresponds to ν in the bin model,
Feasibility Of
Learning
will be called the in-sample error
Probability to the
rescue

Risk and
Ein (h) = fraction of D where f and h disagree
Emprical Risk
N
Loss function
1 X
Empirical risk
= I(h(x n ) 6= f (x n ))
Regularizer
N
n=1

where I(...) = 1 if the statement is true, and I(...) = 0 if the statement is

false
• In the same way, we define the out-of-sample error , (domain X )

Eout (h) = P(h(x) 6= f (x)), x ∈ X

which corresponds to µ in the bin model.

29
Connection to Learning (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm • The Hoeffding inequality becomes:
Feasibility Of
Learning 2N
Probability to the
rescue
P[|Ein (h) − Eout (h)| > ] ≤ 2e −2 for any > 0 (10)
Risk and
Emprical Risk
Loss function In a big sample D, the performance inside of D is close enough to the
performance outside of D
Empirical risk
Regularizer

30
Risk and Emprical Risk
• Loss function
• Empirical risk
• Regularizer
Loss function
Learning
Components

A Simple
Learning Model
Hypothesis Set

Concept 3
Learning Algorithm

Feasibility Of
Learning
Probability to the
Given a hypothesis ŷ = h(x) ∈ H, a non-negative real-valued loss function
`(ŷ, y) which measures how different the prediction ŷ of a hypothesis is from the
rescue

Risk and
Emprical Risk true outcome y.
Loss function
Empirical risk
Regularizer

32
Loss Functions for Binary Classification
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• Zero-one loss
Feasibility Of
Learning I(h(x) 6= y) (11)
Probability to the

• Log loss (logistic regression)

rescue

Risk and
Emprical Risk
Loss function
Empirical risk
log(1 + e −h(x)y ) (12)
Regularizer

• Exponential loss (AdaBoost)

e −h(x)y (13)

33
Loss Functions for Regression
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• Squared loss
Feasibility Of
Learning (h(x) − y)2 (14)
Probability to the

• Absolute loss
rescue

Risk and
Emprical Risk |h(x) − y| (15)
Loss function
Empirical risk
Regularizer

34
Risk
Learning
Components

A Simple
Learning Model
Hypothesis Set

Concept 4
Learning Algorithm

Feasibility Of
Learning
Probability to the
The risk E associated with hypothesis h(x) is defined as the expectation of the
loss function
rescue

Risk and
Emprical Risk
Loss function
E (h) = E[`(h(x), y)] = `(h(x), y)dp(x, y) (16)
Empirical risk
Regularizer

35
Empirical Risk
Learning
Components

A Simple
Learning Model
Hypothesis Set

Concept 5
Learning Algorithm

Feasibility Of
Learning
Probability to the The empirical risk Ê is the average of the loss function on the training set
D = {(x 1 , y1 ), (x 2 , y2 ), ...(x N , yN )}
rescue

Risk and
Emprical Risk
Loss function
N
Empirical risk 1 X
Regularizer Ê = `(hw (x i ), yi ) (17)
N
i=1

Theorem 2
The empirial risk is unbiased estimate of the risk

36
Empirical Risk (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set

Concept 6
Learning Algorithm

Feasibility Of
Learning
Probability to the
Empirical risk of hypothesis hw (x) with a loss function ` and a regularizer reg
rescue

Risk and N
Emprical Risk 1 X
Ê = `(hw (x i ), yi ) + λreg(w) (18)
N
Loss function
Empirical risk
i=1
| {z } | {z }
Regularizer
Loss Regularizer

37
The empirical risk minimization principle
Learning
Components

A Simple
Learning Model
Hypothesis Set

Principle
Learning Algorithm

Feasibility Of
Learning
Probability to the
The learning algorithm should choose a hypothesis hw which minimizes the
rescue
empirical risk
Risk and
Emprical Risk hw = arg min Ê (hw | D) (19)
Loss function hw ∈H
Empirical risk
Regularizer

38
Regularizers
Learning
Components

A Simple
Learning Model
Hypothesis Set

Theorem 3
Learning Algorithm

Feasibility Of
Learning
Probability to the
For each λ ≥ 0, there exists B ≥ 0. such that the two formulations are equivalent,
rescue

Risk and N
Emprical Risk
arg min `(hw (x i ), yi ) + λreg(w) (20)
X
Loss function
w
i=1
Empirical risk
Regularizer

N
arg min `(hw (x i ), yi ) subject to reg(w) ≤ B (21)
X
w
i=1

39
Regularizers (cont.)
Learning
Components

A Simple
Learning Model
Hypothesis Set
Learning Algorithm
• L2 -regularization
Feasibility Of
Learning reg(w) = w > w = kwk22 (22)
Probability to the

• L1 -regularization
rescue

Risk and
Emprical Risk reg(w) = kwk1 (23)
Loss function
Empirical risk
Regularizer

40
References

Goodfellow, I., Bengio, Y., and Courville, A. (2016).

Deep learning.
MIT press.
Lê, B. and Tô, V. (2014).
Cở sở trí tuệ nhân tạo.
Nhà xuất bản Khoa học và Kỹ thuật.
Russell, S. and Norvig, P. (2021).
Artificial intelligence: a modern approach.
Pearson Education Limited.

Lect02 Problem ML
No ratings yet
Lect02 Problem ML
29 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
AI-unit-4
No ratings yet
AI-unit-4
91 pages
Ai and ML Module 3
No ratings yet
Ai and ML Module 3
12 pages
Lecture13 - ML Linear & Log-Linear Models
No ratings yet
Lecture13 - ML Linear & Log-Linear Models
34 pages
Lecture4 Foundations Supervised Learning
No ratings yet
Lecture4 Foundations Supervised Learning
22 pages
Introduction To Machine Learning: Workshop On Machine Learning For Intelligent Image Processing
No ratings yet
Introduction To Machine Learning: Workshop On Machine Learning For Intelligent Image Processing
44 pages
Machine_learning
No ratings yet
Machine_learning
29 pages
DSA5102X_lecture1
No ratings yet
DSA5102X_lecture1
51 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
136 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
DSA5105 Lecture1
No ratings yet
DSA5105 Lecture1
51 pages
4.0 ALGO211 Week10 Computational Learning Theory
No ratings yet
4.0 ALGO211 Week10 Computational Learning Theory
16 pages
CPSC 540: Machine Learning: Mixture Models, Expectation Maximization
No ratings yet
CPSC 540: Machine Learning: Mixture Models, Expectation Maximization
38 pages
SML_Lecture1
No ratings yet
SML_Lecture1
37 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
DSA5102_lecture1
No ratings yet
DSA5102_lecture1
60 pages
Pre University 2023-24
No ratings yet
Pre University 2023-24
51 pages
shawe-taylor-slides Statiscal Learning Theory for Modern Machine Learning
No ratings yet
shawe-taylor-slides Statiscal Learning Theory for Modern Machine Learning
195 pages
Cooperating Intelligent Systems: Learning From Observations Chapter 18, AIMA
No ratings yet
Cooperating Intelligent Systems: Learning From Observations Chapter 18, AIMA
53 pages
DTreesAndOverfitting-1-11-2011_final
No ratings yet
DTreesAndOverfitting-1-11-2011_final
20 pages
Perceptron 2014
No ratings yet
Perceptron 2014
44 pages
Lecture1
No ratings yet
Lecture1
56 pages
Introduction To Machine Learning: Ning Xiong Mälardalen University
No ratings yet
Introduction To Machine Learning: Ning Xiong Mälardalen University
23 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
w11-cp-pt1
No ratings yet
w11-cp-pt1
54 pages
Machine Learning
No ratings yet
Machine Learning
64 pages
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
No ratings yet
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
62 pages
Machine Learning - Intro Bayes DecisionT Neural
No ratings yet
Machine Learning - Intro Bayes DecisionT Neural
81 pages
W2 Ecs7020p
No ratings yet
W2 Ecs7020p
54 pages
Intro_DL_01
No ratings yet
Intro_DL_01
64 pages
5.2 MLBasics-Capacity
No ratings yet
5.2 MLBasics-Capacity
30 pages
Algorithm-Independent Learning
No ratings yet
Algorithm-Independent Learning
10 pages
02 02 Quantiles
No ratings yet
02 02 Quantiles
53 pages
UNIT 5
No ratings yet
UNIT 5
21 pages
ML Lecture 1 Iitg
No ratings yet
ML Lecture 1 Iitg
32 pages
Basics of Learning Theory
No ratings yet
Basics of Learning Theory
35 pages
1 - Modelul Supervizat Al Invatarii Din Date
No ratings yet
1 - Modelul Supervizat Al Invatarii Din Date
16 pages
Class 02
No ratings yet
Class 02
42 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
QSRI-lecture1
No ratings yet
QSRI-lecture1
45 pages
Lecture 2 - Principle of Machine Learning
No ratings yet
Lecture 2 - Principle of Machine Learning
39 pages
Vladimir Cherkassky IJCNN05
No ratings yet
Vladimir Cherkassky IJCNN05
40 pages
Logistic Regression in Data Analysis: An Overview
No ratings yet
Logistic Regression in Data Analysis: An Overview
21 pages
Unit 2 Machine Learning
No ratings yet
Unit 2 Machine Learning
32 pages
Deep Learning As A Building Block in Probabilistic Models: Pierre-Alexandre Mattei
No ratings yet
Deep Learning As A Building Block in Probabilistic Models: Pierre-Alexandre Mattei
62 pages
03-computational cognitive science
No ratings yet
03-computational cognitive science
42 pages
Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)
No ratings yet
Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)
22 pages
TheLearningTheory 2
No ratings yet
TheLearningTheory 2
90 pages
Combined
No ratings yet
Combined
483 pages
Lecture-4 Emprical Risk and Optimization
No ratings yet
Lecture-4 Emprical Risk and Optimization
20 pages
Lec 2
No ratings yet
Lec 2
5 pages
22.InfoTheory-DecisionTrees-short
No ratings yet
22.InfoTheory-DecisionTrees-short
25 pages
Lecture Notes: Artificial Intelligence: The Value Added of Machine Learning To Causal Inference
No ratings yet
Lecture Notes: Artificial Intelligence: The Value Added of Machine Learning To Causal Inference
57 pages
Bayesian Learning Rules
No ratings yet
Bayesian Learning Rules
37 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
ML-2
No ratings yet
ML-2
155 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
32 pages
The Matrixial Brain: Experiments in Reality
From Everand
The Matrixial Brain: Experiments in Reality
Paul Chaplin
No ratings yet
Formelsammlung Nonlinear Dynamics
No ratings yet
Formelsammlung Nonlinear Dynamics
4 pages
The Role of Artificial Intelligence in Financial Analysis and Forecasting: Using Data and Algorithms
No ratings yet
The Role of Artificial Intelligence in Financial Analysis and Forecasting: Using Data and Algorithms
14 pages
Module Combustion Engineering-1-2
No ratings yet
Module Combustion Engineering-1-2
13 pages
Shaping Portland Anatomy of A Healthy City Paddy Tillett Routledge
No ratings yet
Shaping Portland Anatomy of A Healthy City Paddy Tillett Routledge
180 pages
Hermite Differential Equation
No ratings yet
Hermite Differential Equation
10 pages
© 2004 Cummins Power Generation All Copies Are Uncontrolled: Application Manual - Liquid Cooled Generator Sets
No ratings yet
© 2004 Cummins Power Generation All Copies Are Uncontrolled: Application Manual - Liquid Cooled Generator Sets
3 pages
ISO 14001:2015 General Awareness Program
No ratings yet
ISO 14001:2015 General Awareness Program
40 pages
Kinderfield Primary 2
No ratings yet
Kinderfield Primary 2
17 pages
List.speak 2.Unit 7. Listening 2
No ratings yet
List.speak 2.Unit 7. Listening 2
3 pages
Worksheet Vedic Mathematics With Answers For Saraswati
No ratings yet
Worksheet Vedic Mathematics With Answers For Saraswati
2 pages
Responsibility and Accountability in Nursing
No ratings yet
Responsibility and Accountability in Nursing
36 pages
Myofascial Trigger Points
No ratings yet
Myofascial Trigger Points
11 pages
Harmonic Oscillator
No ratings yet
Harmonic Oscillator
25 pages
Lirik Lagu Someone Like You Adele Dan Terjemahan Artinya
No ratings yet
Lirik Lagu Someone Like You Adele Dan Terjemahan Artinya
7 pages
Parent Teacher Conference FEEDBACK FORM
No ratings yet
Parent Teacher Conference FEEDBACK FORM
1 page
Sports Psychology Assignment 2
No ratings yet
Sports Psychology Assignment 2
2 pages
Electrical Grid Impact of Ground Source Heat Pump Technologies
No ratings yet
Electrical Grid Impact of Ground Source Heat Pump Technologies
78 pages
Friendship Quality Scale
No ratings yet
Friendship Quality Scale
7 pages
Shipboard Literary Cultures: Reading, Writing, and Performing at Sea Susann Liebich (Editor) download
No ratings yet
Shipboard Literary Cultures: Reading, Writing, and Performing at Sea Susann Liebich (Editor) download
75 pages
Copia de Unit 3. Task 1. English 10. 8T
No ratings yet
Copia de Unit 3. Task 1. English 10. 8T
3 pages
EDU301 General Methods of Teaching (PAST PAPER SOLVED QUESTIONS) (FINAL TERM)
No ratings yet
EDU301 General Methods of Teaching (PAST PAPER SOLVED QUESTIONS) (FINAL TERM)
10 pages
GRX 3
No ratings yet
GRX 3
2 pages
Gec 2 Modu e 1
No ratings yet
Gec 2 Modu e 1
3 pages
MCS System Proposal-Shanghai Huaxin
No ratings yet
MCS System Proposal-Shanghai Huaxin
51 pages
Mcqs of Leadership And Management
No ratings yet
Mcqs of Leadership And Management
24 pages
Pokemon Locations V2.3a
No ratings yet
Pokemon Locations V2.3a
7 pages
Giving Directions Paper Version
No ratings yet
Giving Directions Paper Version
12 pages
Conservation Activities of Old Traditional Mosque in Malaysia: An
No ratings yet
Conservation Activities of Old Traditional Mosque in Malaysia: An
9 pages
Expert Systems With Applications: Freddie Åström, Rasit Koker
No ratings yet
Expert Systems With Applications: Freddie Åström, Rasit Koker
5 pages
DLP in Science Group 11 2024 1
No ratings yet
DLP in Science Group 11 2024 1
11 pages

Lect02 Problem ML

Uploaded by

Lect02 Problem ML

Uploaded by

LEARNING PROBLEM

Bùi Tiến Lên

2. A Simple Learning Model

4. Risk and Emprical Risk

• Best approximate function g : X → Y (formula to be used)

Theorem 1 (No Free Lunch Theorems)

Inductive Learning Hypothesis

• The learning algorithm A is a

search algorithm which finds

Deny credit if wi xi < threshold (1)

• This linear formula h ∈ H can be written as

h(x) = hw (x) = sign w i xi + w0 (3)

• In vector form, the perceptron implements

h(x) = hw (x) = sign (w | x) (5)

• At iteration t = 1, 2, 3, ..., pick a misclassified point (x i , yi )

• and update the weight vector

P[picking a green marble] = 1 − µ

• The value of µ is unknown to us

• No! (certain answer): Sample can • Yes! (uncertain answer): Sample

where I(...) = 1 if the statement is true, and I(...) = 0 if the statement is

Eout (h) = P(h(x) 6= f (x)), x ∈ X

which corresponds to µ in the bin model.

• Log loss (logistic regression)

• Exponential loss (AdaBoost)

Goodfellow, I., Bengio, Y., and Courville, A. (2016).

You might also like