Supervised and Unsupervised Learning Feature

Uploaded by

Jeeva Harshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Supervised and Unsupervised Learning Feature

Uploaded by

Jeeva Harshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Supervised and Unsupervised Learning feature

Given an input or feature vector x, one of the main goals of machine learning is to predict response
an output or response variable y. For example, x could be a digitized signature and y a binary
variable that indicates whether the signature is genuine or false. Another example is where x
represents the weight and smoking habits of an expecting mother and y the birth weight of the
baby. The data science attempt at this prediction is encoded in a mathematical prediction function g,
called the prediction function function , which takes as an input x and outputs a guess g(x) for y
(denoted by by, for example). In a sense, g encompasses all the information about the relationship
between the variables x and y, excluding the effects of chance and randomness in nature. In

can only lie in a finite set, say y ∈ {0, . . . , c − 1}, then predicting y is conceptually the same as
regression problems, the response variable y can take any real value. In contrast, regression when y

classifying the input x into one of c categories, and so prediction becomes a classification
classification problem. We can measure the accuracy of a prediction by with respect to a given
response y by loss function using some loss function Loss(y,by). In a regression setting the usual
choice is the squarederror loss (y−by) 2 . In the case of classification, the zero–one (also written 0–1)
loss function Loss(y,by) = 1{y , by} is often used, which incurs a loss of 1 whenever the predicted class
by is not equal to the class y. Later on in this book, we will encounter various other useful loss
functions, such as the cross-entropy and hinge loss functions (see, e.g., Chapter 7). The word error is
often used as a measure of distance between a “true” object y and some approximation by thereof.

established error concepts, as are the norm ∥y−by∥ and squared norm ∥y−by∥ 2 for vectors. The
If y is real-valued, the absolute error |y − by| and the squared error (y−by) 2 are both well-

squared error (y−by) 2 is just one example of a loss function. It is unlikely that any mathematical
function g will be able to make accurate predictions for all possible pairs (x, y) one may encounter in
Nature. One reason for this is that, even with the same input x, the output y may be different,
depending on chance circumstances or randomness. For this reason, we adopt a probabilistic
approach and assume that each pair (x, y) is the outcome of a random pair (X, Y) that has some joint
probability density f(x, y). We then assess the predictive performance via the expected loss, usually
called the risk risk, for g: ℓ(g) = E Loss(Y, g(X)). (2.1) For example, in the classification case with zero–
one loss function the risk is equal to the Statistical Learning 21 probability of incorrect classification:

of (X, Y) and any loss function, we classifier can in principle find the best possible g ∗ := argming E
ℓ(g) = P[Y , g(X)]. In this context, the prediction function g is called a classifier. Given the distribution

Loss(Y, g(X)) that yields the smallest risk ℓ ∗ := ℓ(g ∗ ). We will see in Chapter 7 that in the
classification case with y ∈ {0, . . . , c−1} ☞ 251 and ℓ(g) = P[Y , g(X)], we have g ∗ (x) = argmax
y∈{0,...,c−1} f(y | x), where f(y | x) = P[Y = y | X = x] is the conditional probability of Y = y given X = x.

this setting, the optimal prediction function g ∗ is often called the regression function. The following
As already mentioned, for regression the most widely-used loss function is the squarederror loss. In

theorem specifies its exact form. regression function Theorem 2.1: Optimal Prediction Function for

g ∗ is equal to the conditional expectation of Y given X = x: g ∗ (x) = E[Y | X = x]. Proof: Let g ∗ (x) =
Squared-Error Loss For the squared-error loss Loss(y,by) = (y −by) 2 , the optimal prediction function

E[Y | X = x]. For any function g, the squared-error risk satisfies E(Y − g(X))2 = E[(Y − g ∗ (X) + g ∗ (X) −
g(X))2 ] = E(Y − g ∗ (X))2 + 2E[(Y − g ∗ (X))(g ∗ (X) − g(X))] + E(g ∗ (X) − g(X))2 ⩾ E(Y − g ∗ (X))2 +
2E[(Y − g ∗ (X))(g ∗ (X) − g(X))] = E(Y − g ∗ (X))2 + 2E {(g ∗ (X) − g(X))E[Y − g ∗ (X) | X]} . In the last

have E[Y − g ∗ (X) | X] = 0. It follows that E(Y − g(X))2 ⩾ E(Y − g ∗ (X))2 , showing that g ∗ yields the
equation we used the tower property. By the definition of the conditional expect- ☞ 431 ation, we
(random) response Y can be written as Y = g ∗ (x) + ε(x), (2.2) where ε(x) can be viewed as the
smallest squared-error risk. □ One consequence of Theorem 2.1 is that, conditional on X = x, the

random deviation of the response from its conditional mean at x. This random deviation satisfies E
ε(x) = 0. Further, the conditional variance of the response Y at x can be written as Var ε(x) = v 2 (x)

unspecified. Since, the optimal prediction function g ∗ depends on the typically unknown joint
for some unknown positive function v. Note that, in general, the probability distribution of ε(x) is

distribution of (X, Y), it is not available in practice. Instead, all that we have available is a finite
number of (usually) independent realizations from the joint density f(x, y). We denote this sample by
T = {(X1, Y1), . . . ,(Xn, Yn)} and call it the training set (T is a mnemonic for training set training) with n
examples. It will be important to distinguish between a random training set T and its (deterministic)
outcome {(x1, y1), . . . ,(xn, yn)}. We will use the notation τ for the latter. We will also add the
subscript n in τn when we wish to emphasize the size of the training set

Analytics Compendium
No ratings yet
Analytics Compendium
41 pages
Tuo Zhao Notes
No ratings yet
Tuo Zhao Notes
47 pages
6 437-Pset1
No ratings yet
6 437-Pset1
8 pages
Gas Chromatography (GC)
67% (3)
Gas Chromatography (GC)
4 pages
1 Intro
No ratings yet
1 Intro
5 pages
ML-2
No ratings yet
ML-2
155 pages
ESL: Chapter 1: 1.1 Introduction To Linear Regression
No ratings yet
ESL: Chapter 1: 1.1 Introduction To Linear Regression
4 pages
Loss
No ratings yet
Loss
18 pages
Linear Regression - Module 3
No ratings yet
Linear Regression - Module 3
16 pages
eng
No ratings yet
eng
10 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
480-note-lin
No ratings yet
480-note-lin
11 pages
Statistical Learning
No ratings yet
Statistical Learning
31 pages
Stat Risk
No ratings yet
Stat Risk
6 pages
Unit - 1
No ratings yet
Unit - 1
8 pages
Regression
No ratings yet
Regression
60 pages
Presentation of Statistics
No ratings yet
Presentation of Statistics
21 pages
MIT18 657F15 LecNote PDF
No ratings yet
MIT18 657F15 LecNote PDF
194 pages
Mathematics of Machine Learning MIT
No ratings yet
Mathematics of Machine Learning MIT
411 pages
Unit 2&3_250421_215911
No ratings yet
Unit 2&3_250421_215911
60 pages
Non Parametric Prediction
No ratings yet
Non Parametric Prediction
16 pages
Assessing Forecasting Error: The Prediction Interval
No ratings yet
Assessing Forecasting Error: The Prediction Interval
10 pages
Unit 3 notes
No ratings yet
Unit 3 notes
35 pages
ML UNIT II
No ratings yet
ML UNIT II
30 pages
Regression
No ratings yet
Regression
45 pages
Chapter 5 Learning Deterministic Models
No ratings yet
Chapter 5 Learning Deterministic Models
28 pages
Linear Regression
No ratings yet
Linear Regression
10 pages
lec24 linear regression
No ratings yet
lec24 linear regression
10 pages
Cp4252 Ml Unit-II
No ratings yet
Cp4252 Ml Unit-II
44 pages
MIT18 650F16 Regression
No ratings yet
MIT18 650F16 Regression
44 pages
FALLSEM2024-25 BCSE401L TH VL2024250102078 2024-09-04 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE401L TH VL2024250102078 2024-09-04 Reference-Material-I
27 pages
Chapter 3 Notes
No ratings yet
Chapter 3 Notes
5 pages
SIMPLE LINEAR REGRESSION ANALYSIS..
No ratings yet
SIMPLE LINEAR REGRESSION ANALYSIS..
51 pages
Unit 2 ML_Ver 2
No ratings yet
Unit 2 ML_Ver 2
129 pages
Week 5 Notes
No ratings yet
Week 5 Notes
175 pages
Sparse Regression
No ratings yet
Sparse Regression
37 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Chapter4_Regression.docx
No ratings yet
Chapter4_Regression.docx
15 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
HW1 (1)
No ratings yet
HW1 (1)
7 pages
T7.6
No ratings yet
T7.6
6 pages
Chapter Regression
No ratings yet
Chapter Regression
10 pages
DMJAP-LinearRegression-3
No ratings yet
DMJAP-LinearRegression-3
28 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
02 Regression and Classification Problems
No ratings yet
02 Regression and Classification Problems
7 pages
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
No ratings yet
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
23 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
Unit-5
No ratings yet
Unit-5
9 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
Regression Equation: Independent Variable Predictor Variable Explanatory Variable Dependent Variable Response Variable
No ratings yet
Regression Equation: Independent Variable Predictor Variable Explanatory Variable Dependent Variable Response Variable
60 pages
Chapter 3 Summary
No ratings yet
Chapter 3 Summary
8 pages
��
No ratings yet
��
3 pages
Tema 0 Econometrics
No ratings yet
Tema 0 Econometrics
6 pages
Regression
No ratings yet
Regression
24 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
StatLearning2r PDF
No ratings yet
StatLearning2r PDF
267 pages
Linear Regression
No ratings yet
Linear Regression
75 pages
Chapter 6: How To Do Forecasting by Regression Analysis
No ratings yet
Chapter 6: How To Do Forecasting by Regression Analysis
7 pages
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
No ratings yet
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
16 pages
Capsule Calculus
From Everand
Capsule Calculus
Ira Ritow
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Indicative Syllabus Discipline: Finance
No ratings yet
Indicative Syllabus Discipline: Finance
10 pages
Coupling CFD and Analytical Modeling For Investigation of Monolayer Particle Resuspension by Transient Flows
No ratings yet
Coupling CFD and Analytical Modeling For Investigation of Monolayer Particle Resuspension by Transient Flows
12 pages
Stress Detection in IT Professional by Image Processing and Machine Learning
No ratings yet
Stress Detection in IT Professional by Image Processing and Machine Learning
2 pages
X Y Korelasi Regresi: 0.58 35 Regression Statistics
No ratings yet
X Y Korelasi Regresi: 0.58 35 Regression Statistics
3 pages
Sy 150523053001
No ratings yet
Sy 150523053001
33 pages
Chemistry Practical Assesment
No ratings yet
Chemistry Practical Assesment
2 pages
Exercise 6: Time Series Analysis and Stochastic Modelling
No ratings yet
Exercise 6: Time Series Analysis and Stochastic Modelling
18 pages
Probability, Surveyng, Transpotation Engring
No ratings yet
Probability, Surveyng, Transpotation Engring
13 pages
Problems For Finite Element Method PDF
100% (1)
Problems For Finite Element Method PDF
48 pages
Soft Lubrication of Model Hydrocolloids PDF
No ratings yet
Soft Lubrication of Model Hydrocolloids PDF
9 pages
The Indian Community School, Kuwait Syllabus Plan For The Year 2017-2018
No ratings yet
The Indian Community School, Kuwait Syllabus Plan For The Year 2017-2018
8 pages
How To Estimate Compressor Efficiency - Campbell Tip of The Month
No ratings yet
How To Estimate Compressor Efficiency - Campbell Tip of The Month
8 pages
Multiphase Equilibria Calculation by Direct Minimization
No ratings yet
Multiphase Equilibria Calculation by Direct Minimization
23 pages
ABAP Dictionary Interview Questions With Answers / Dictionary FAQ
No ratings yet
ABAP Dictionary Interview Questions With Answers / Dictionary FAQ
12 pages
Dam Stability and Structural Analyses: Project: Location: Details
No ratings yet
Dam Stability and Structural Analyses: Project: Location: Details
13 pages
Em I & A.C.
No ratings yet
Em I & A.C.
20 pages
Micro Center 3 Best Computers
No ratings yet
Micro Center 3 Best Computers
4 pages
Presentation AES Vs Serpent
No ratings yet
Presentation AES Vs Serpent
23 pages
BS en 00933-6-2014
No ratings yet
BS en 00933-6-2014
26 pages
L81 Drilling, Centering: Emco W NC Sinumerik 810/820 M P
No ratings yet
L81 Drilling, Centering: Emco W NC Sinumerik 810/820 M P
11 pages
Programmable Peripheral Interface - 8255
No ratings yet
Programmable Peripheral Interface - 8255
30 pages
Stucture of Atoms Obsidian Notes
No ratings yet
Stucture of Atoms Obsidian Notes
6 pages
Vmware Architecture
No ratings yet
Vmware Architecture
152 pages
CLASS XI PHYSICS System of Particles Questions
No ratings yet
CLASS XI PHYSICS System of Particles Questions
2 pages
86517
100% (1)
86517
39 pages
Answers SS ch1
No ratings yet
Answers SS ch1
1 page
Rs900 C Datasheet
No ratings yet
Rs900 C Datasheet
10 pages
Self-Efficacy and Academic Performance in Algebra Among First-Year Private Non-Sectarian High School Students
No ratings yet
Self-Efficacy and Academic Performance in Algebra Among First-Year Private Non-Sectarian High School Students
5 pages
Autumn Bybee Standards and Objectives
No ratings yet
Autumn Bybee Standards and Objectives
2 pages

Supervised and Unsupervised Learning Feature

Uploaded by

Supervised and Unsupervised Learning Feature

Uploaded by

Supervised and Unsupervised Learning feature

You might also like