0% found this document useful (0 votes)

20 views

1 Introduction

Uploaded by

microstart95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

1 Introduction

Uploaded by

microstart95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

STAT6121-ML

Introduction
Machine learning
Machine learning approaches to data analysis
cannot be done without computers
Common with statistical modelling/analysis
• For prediction and classification
• Requires an optimisation procedure
• Obtain parameters or functions from observations
• Uncertainty of learning vs. prediction/classification
New elements or techniques
• Explanation or theoretical construct not emphasis
• Data can be ‘organic’, such as text, image
• Distinction of training vs. validation/test data
• Reliance on ready-made software for implementation

2
Some broad remarks
Supervised vs. unsupervised learning
• Target outcome y and covariates/features x?
NB. log-linear models of contingency tables
• Can the learned result be applied to unseen units?
NB. principal components, clustering
Prediction vs. classification
• Best prediction of y is its expectation µx = E(y | x)
2 2 2

E (y − µ) | x = (µx − µ) + E (y − µx) | x

• Best classification of categorical y is

′
y0 = arg max
′
Pr(y = y | x)
y

e.g. if y ∼ N (µ, σ 2), then E(y) = µ but Pr(y = µ) = 0

however, let z = I(y > µ − σ), then z0 = 1

3
Some broad remarks
Parametric vs. non-parametric models
• function/model f (x; θ) fixed given θ, i.e. parameters
f (x) = E(y | x) or f (x) = Pr(y | x)
• parametric if θ contains a fixed number of constants
NB. linear regression model as a typical example
• non-parametric if no. unknowns in θ grows with the
no. observations, or if f is indeterminate in advance
Error vs. residual
• Given f (x) = E(y | x) or y0 = arg max
′
Pr(y = y ′
| x), error
y

e = y − f (x) or e = I(y = y0)

• Given fˆ or ŷ0 as estimate f or y0, residual
ê = y − fˆ(x) or ê = I(y = ŷ0)
if (y, x) are used for obtaining fˆ or ŷ0
4
Bias-variance trade-off

Eq. (2.7), mean squared error (MSE) of fˆ(x) for y given x

2 2
ˆ ˆ
E{ y − f (x) } = E{ y − f (x) + f (x) − f (x) }
2 2
ˆ
= E{ y − f (x) } + E f (x) − f (x)
ˆ

− 2E{ y − f (x) f (x) − f (x) }
2
ˆ
= V e(x) + V f (x) + Bias f (x) ˆ

over fˆ(x) and

y that are independent of each other
NB. V e(x) unaffected by whichever fˆ
2
ˆ ˆ

Q: Reduce V f (x) and Bias f (x) at the same time?
• to reduce V f (x) , let fˆ(x) be obtained based on many
ˆ

observations, e.g. by using parametric f (x; θ)...
2
• to reduce Bias f (x) , let fˆ(x) only depend on close-by
ˆ
observations, provided f is reasonably smooth...
• hence, the bias-variance trade-off
5
Ch. 3, exercise 4
Answer by ML, e.g. x ∈ (1, 10) and f (x) = β0 + β1 log(x) for (c)
get.dta <- function(n=100, beta=c(0.5,1), nonlnr=F)
{
x = seq(1,10,length=n)
if (nonlnr) { f = beta[1] + beta[2]*log(x) }
else { f = beta[1] + beta[2]*x }
y = f + rnorm(n, 0, 1)
x2 = x^2; x3 = x^3
data.frame(y,x,x2,x3,f)
}

main <- function(n=100, beta=c(0.5,1), nonlnr=F, vis=F)

{
dta = get.dta(n=n, beta=beta, nonlnr=nonlnr)
if (nonlnr) { cat("data generated under nonlinear model\n\n") }
else { cat("data generated under linear model\n\n") }
cat("fitting simple linear regression:\n")
print(summary(lm(y ~ x, data=dta)))
cat("fitting cubic (polynomial) regression:\n")
print(summary(lm(y ~ x + x2 + x3, data=dta)))
if (vis) { plot(dta$x, dta$y); lines(dta$x, dta$f) }
}

6
Additional exercise

16
14
12
y

10
8
6
4

6 8 10 12 14

Equally spaced x, fˆ1(x) = β̂x (solid), fˆ2(x) = y (dashed)

• What is V f (x) at any given x for fˆ = fˆ1 or fˆ2?
ˆ

ˆ
What can you say about Bias f (x) ?

Consider KNN predictor given K

• How would you apply the method if x = 5 or 10?
ˆ ˆ
• What about V f (x) and Bias f (x) in this case?

7
Additional exercise
n
X n
X
β̂ = xi yi / x2i
i=1 i=1
V fˆ1(x) = V (β̂x) = x V (β̂) 2

n n n
2 2
X X X
2 2 2
x2i

= x V (yi | xi) xi / xi = x V (yi | xi)/
i=1 i=1 i=1
n
X
V̂ (yi | xi) = (yi − β̂xi)2/(n − 1)
i=1
V fˆ2(x) = V (y | x) = V (yi | xi) NB. non-existant V̂ (yi | xi)

K
X
fˆ(x) = yj (x)/K
j=1
XK
V fˆ(x) = V yj (x) /K 2

j=1
K
X 2
yj (x) − fˆ(x) /(K − 1) NB. from K obs.

V̂ yj (x) =
j=1
Assume unbiasedness in all the cases...

002-2023-0717 DLMAIAI01 Course Book
No ratings yet
002-2023-0717 DLMAIAI01 Course Book
108 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
Business Analytics
No ratings yet
Business Analytics
19 pages
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
71 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
About Model Selection
No ratings yet
About Model Selection
33 pages
Notes2
No ratings yet
Notes2
16 pages
Model Fitting and Error Estimation: BSR 1803 Systems Biology: Biomedical Modeling
No ratings yet
Model Fitting and Error Estimation: BSR 1803 Systems Biology: Biomedical Modeling
34 pages
Briefly Explain The Trade-Offs Associated Between The Model Variance Versus Bias-Squared To Inform Model Selection
No ratings yet
Briefly Explain The Trade-Offs Associated Between The Model Variance Versus Bias-Squared To Inform Model Selection
7 pages
Machine Learning (CSO851) - Lecture 02
No ratings yet
Machine Learning (CSO851) - Lecture 02
74 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
2
No ratings yet
2
62 pages
Statistical Learning
No ratings yet
Statistical Learning
31 pages
Lec2 ASE
No ratings yet
Lec2 ASE
86 pages
Regression
No ratings yet
Regression
45 pages
Econometrics - Exercise set 2 (solution)
No ratings yet
Econometrics - Exercise set 2 (solution)
12 pages
Econometric Theory: Module - Iii
No ratings yet
Econometric Theory: Module - Iii
10 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Ch2_Statistical_Learning
No ratings yet
Ch2_Statistical_Learning
51 pages
8. Linear Regression
No ratings yet
8. Linear Regression
29 pages
Chapter three
No ratings yet
Chapter three
35 pages
Sta 3
No ratings yet
Sta 3
9 pages
Statistical Modelling: Regression: Choosing The Independent Variables
No ratings yet
Statistical Modelling: Regression: Choosing The Independent Variables
14 pages
What Is Empirical - Models
No ratings yet
What Is Empirical - Models
14 pages
226 Lecture5 Prediction
No ratings yet
226 Lecture5 Prediction
45 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
StatLearning2r PDF
No ratings yet
StatLearning2r PDF
267 pages
w3 - Linear Model - Linear Regression
No ratings yet
w3 - Linear Model - Linear Regression
33 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
2.SupervisedLearning Error
No ratings yet
2.SupervisedLearning Error
32 pages
lec1
No ratings yet
lec1
54 pages
NVT SDS Unit V Final PDF
No ratings yet
NVT SDS Unit V Final PDF
100 pages
Lecture 2
No ratings yet
Lecture 2
23 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Unit - 1
No ratings yet
Unit - 1
8 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Stat 473-573 Notes
No ratings yet
Stat 473-573 Notes
139 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
03 Regressionanalysis
No ratings yet
03 Regressionanalysis
21 pages
Lec-01-Introduction to Statistical Learning
No ratings yet
Lec-01-Introduction to Statistical Learning
38 pages
CH 2
No ratings yet
CH 2
31 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
ML Unit3
No ratings yet
ML Unit3
9 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
Machine Learning
No ratings yet
Machine Learning
92 pages
Linear Regression
No ratings yet
Linear Regression
108 pages
SDS Solution1
No ratings yet
SDS Solution1
26 pages
Week2 StatisticalLearning
No ratings yet
Week2 StatisticalLearning
46 pages
SLRM note
No ratings yet
SLRM note
15 pages
Lecture 09_02.09.2024_Regression-01
No ratings yet
Lecture 09_02.09.2024_Regression-01
62 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Nonparametric and Semiparametric Models
No ratings yet
Nonparametric and Semiparametric Models
325 pages
Metrics 8-30-2023
No ratings yet
Metrics 8-30-2023
16 pages
Ec2 1
No ratings yet
Ec2 1
11 pages
Least Squares Fit To Polynomial
No ratings yet
Least Squares Fit To Polynomial
12 pages
Econometrics I: TA Session 5: Giovanna Ubida
No ratings yet
Econometrics I: TA Session 5: Giovanna Ubida
20 pages
Homework
No ratings yet
Homework
6 pages
Simple Linear Regression: Parameters
No ratings yet
Simple Linear Regression: Parameters
34 pages
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Wms
No ratings yet
Wms
43 pages
Lane Cove Tunnel - Consolidated Approval PDF
No ratings yet
Lane Cove Tunnel - Consolidated Approval PDF
66 pages
KTGKI
No ratings yet
KTGKI
4 pages
Book Presentation of Prof Meera Chakravorty 9 Dec 2022
No ratings yet
Book Presentation of Prof Meera Chakravorty 9 Dec 2022
26 pages
Bla Bla
No ratings yet
Bla Bla
1 page
Taller Pre Icfes - 2020
No ratings yet
Taller Pre Icfes - 2020
6 pages
Experiment 302 Heat and Calorimetry
No ratings yet
Experiment 302 Heat and Calorimetry
2 pages
Param Booklet
No ratings yet
Param Booklet
12 pages
The Rise of Agribots IELTS Reading Answers With Explanation: Dol Ielts Đình L C
No ratings yet
The Rise of Agribots IELTS Reading Answers With Explanation: Dol Ielts Đình L C
6 pages
CHS390 Three Phase Smart Meter v1.0
No ratings yet
CHS390 Three Phase Smart Meter v1.0
2 pages
Ngay 03.11.24
No ratings yet
Ngay 03.11.24
9 pages
Challenges To The Transfer of Agricultural Technologies in Nigeria
No ratings yet
Challenges To The Transfer of Agricultural Technologies in Nigeria
7 pages
The Bhopal Lakefront As A Landscape of Affects
No ratings yet
The Bhopal Lakefront As A Landscape of Affects
9 pages
Jurnal Yop Androni
No ratings yet
Jurnal Yop Androni
24 pages
Australian Standard: Plastics-Standard Atmospheres For Conditioning and Testing
No ratings yet
Australian Standard: Plastics-Standard Atmospheres For Conditioning and Testing
6 pages
Sample Test For Axis Bank: Section A - Quantitative Aptitude
No ratings yet
Sample Test For Axis Bank: Section A - Quantitative Aptitude
3 pages
Carter - Doha
No ratings yet
Carter - Doha
60 pages
Lab Animal Vet College 13-17Jan
No ratings yet
Lab Animal Vet College 13-17Jan
3 pages
Agro-Forestry Project in Libmanan, Camarines Sur, Philippines
No ratings yet
Agro-Forestry Project in Libmanan, Camarines Sur, Philippines
9 pages
NationalParkCity Journey Book V1-1 August 4 2022
No ratings yet
NationalParkCity Journey Book V1-1 August 4 2022
38 pages
Paper 03 Wave Optics ANS
No ratings yet
Paper 03 Wave Optics ANS
8 pages
Niche Construction (Odling-Smee, Laland & Feldman, 1996)
No ratings yet
Niche Construction (Odling-Smee, Laland & Feldman, 1996)
10 pages
Tutorial Kit (Architechture-100 L) - Vol. 2 PDF
No ratings yet
Tutorial Kit (Architechture-100 L) - Vol. 2 PDF
17 pages
Workplan for Construction of Bosaso Stadium
No ratings yet
Workplan for Construction of Bosaso Stadium
2 pages
Final Scribd
No ratings yet
Final Scribd
23 pages
Stop Living Life As A Victim Worksheet
No ratings yet
Stop Living Life As A Victim Worksheet
5 pages
Artaud, Antonin - Letter To The Legislator of The Drug Act (1967)
No ratings yet
Artaud, Antonin - Letter To The Legislator of The Drug Act (1967)
8 pages
2014 - Progress and Problems in Micro-Grid Protection Schemes
No ratings yet
2014 - Progress and Problems in Micro-Grid Protection Schemes
6 pages
2024-2025 - SFT 04 - 1ST Round - Manual - en
No ratings yet
2024-2025 - SFT 04 - 1ST Round - Manual - en
5 pages

1 Introduction

Uploaded by

1 Introduction

Uploaded by

STAT6121-ML

• Best classification of categorical y is

e.g. if y ∼ N (µ, σ 2), then E(y) = µ but Pr(y = µ) = 0

e = y − f (x) or e = I(y = y0)

Eq. (2.7), mean squared error (MSE) of fˆ(x) for y given x

over fˆ(x) and

main <- function(n=100, beta=c(0.5,1), nonlnr=F, vis=F)

Equally spaced x, fˆ1(x) = β̂x (solid), fˆ2(x) = y (dashed)

Consider KNN predictor given K

You might also like