Kernel

Uploaded by

srmathsatu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Kernel

Uploaded by

srmathsatu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Kernel Regression

Advanced Methods for Data Analysis (36-402/36-608)

Spring 2014

1 Linear smoothers and kernels

• Recall our basic setup: we are given i.i.d. samples (xi , yi ), i = 1, . . . n from the model
yi = r(xi ) + i , i = 1, . . . n,
and our goal is to estimate r with some function r̂. Assume for now that each xi ∈ R (i.e., the
predictors are 1-dimensional)
• We talked about consider r̂ in the class of linear smoothers, so that
n
X
r̂(x) = w(x, xi ) · yi (1)
i=1

for some choice of weights w(x, xi ). Indeed, both linear regression and k-nearest-neighbors are
special cases of this
• Here we will examine another important linear smoother, called kernel smoothing or kernel
regression. We start by defining a kernel function K : R → R, satisfying
Z
K(x) dx = 1, K(x) = K(−x)

• Three common examples are the box kernel:

(
1/2 if |x| ≤ 1
K(x) = ,
0 otherwise
the Gaussian kernel:
1
K(x) = √ exp(−x2 /2),
2π
and the Epanechnikov kernel:
(
3/4(1 − x2 ) if |x| ≤ 1
K(x) =
0 else

• Given a choice of kernel K, and a bandwidth h, kernel regression is defined by taking

K xih−x

w(x, xi ) = P
n xj −x
j=1 K h

in the linear smoother form (1). In other words, the kernel regression estimator is
Pn xi −x

i=1 K h · yi
r̂(x) = Pn xi −x

i=1 K h

1
• What is this doing? This is a weighted average of yi values. Think about laying doing a
Gaussian kernel around a specific query point x, and evaluating its height at each xi in order
to determine the weight associate with yi
• Because these weights are smoothly varying with x, the kernel regression estimator r̂(x) itself
is also smoothly varying with x; compare this to k-nearest-neighbors regression

• What’s in the choice of kernel? Different kernels can give different results. But many of the
common kernels tend to produce similar estimators; e.g., Gaussian vs. Epanechnikov, there’s
not a huge difference
• A much bigger difference comes from choosing different bandwidth values h. What’s the
tradeoff present when we vary h? Hint: as we’ve mentioned before, you should always keep
these two quantities in mind ...

2 Bias and variance of kernels

• At a fixed query point x, recall our fundamental decomposition
2
E[TestErr(r̂(x))] = E Y − r̂(x) X = x
= σ 2 + Bias(r̂(x))2 + Var(r̂(x)).

So what is the bias and variance of the kernel regression estimator?

• Fortunately, these can actually roughly be worked out theoretically, under some smoothness
assumptions on r (and other assumptions). We can show that
2
Bias(r̂(x))2 = E[r̂(x)] − r(x) ≤ C1 h2

and
C2
Var(r̂(x)) ≤ ,
nh
for some constants C1 and C2 . Does this make sense? What happens to the bias and variance
as h shrinks? As h grows?
• This means that
C2
E[TestErr(r̂(x))] = σ 2 + C1 h2 + .
nh
We can find the best bandwidth h, i.e., the one minimizing test error, by differentiating and
setting equal to 0: this yields
C2
h= .
2C1 n1/3
Is this is a realistic choice for the bandwidth? Problem is that we don’t know C1 and C2 !
(And even if we did, it may not be a good idea to use this ... why?)

3 Practical considerations, multiple dimensions

• In practice, we tend to select h by, you guessed it, cross-validation
• Kernels can actually suffer bad bias at the boundaries ... why? Think of the asymmetry of
the weights

2
• In multiple dimensions, say, each xi ∈ Rp , we can easily use kernels, we just replace xi − x in
the kernel argument by kxi − xk2 , so that the multivariate kernel regression estimator is

Pn kxi −xk2
i=1 K h · yi
r̂(x) = P
n kxi −xk2
i=1 K h

• The same calculations as those that went into producing the bias and variance bounds above
can be done in this multivariate case, showing that

Bias(r̂(x))2 ≤ C̃1 h2

and
C̃2
Var(r̂(x)) ≤ .
nhp
Why is the variance so strongly affected now by the dimension p? What is the optimal h, now?
• A little later we’ll see an alternative extension to higher dimensions that doesn’t nearly suffer
the same variance; this is called an additive model

Kernel Smoothing-MP Wand-MC Jones-1995
100% (1)
Kernel Smoothing-MP Wand-MC Jones-1995
228 pages
Hardle - Applied Nonparametric Regression
No ratings yet
Hardle - Applied Nonparametric Regression
433 pages
Non-Parametric Methods Using Kernel Density Estimation
No ratings yet
Non-Parametric Methods Using Kernel Density Estimation
1 page
Chapter 4 in Managerial Economic
100% (2)
Chapter 4 in Managerial Economic
46 pages
Multivariat Kernel Regression
No ratings yet
Multivariat Kernel Regression
3 pages
Article Kernal Model
No ratings yet
Article Kernal Model
9 pages
Elements of Statistical Learning II - Ch.6 Kernel Smoothing Methods - Notes
No ratings yet
Elements of Statistical Learning II - Ch.6 Kernel Smoothing Methods - Notes
5 pages
Nonparametric regression
No ratings yet
Nonparametric regression
7 pages
Nadaraya-Watson Teoria PDF
No ratings yet
Nadaraya-Watson Teoria PDF
9 pages
intro to regression
No ratings yet
intro to regression
4 pages
Intro&NP Stat
No ratings yet
Intro&NP Stat
122 pages
Norway04 Nonparametric
No ratings yet
Norway04 Nonparametric
32 pages
Applied Nonparametric Regression
No ratings yet
Applied Nonparametric Regression
433 pages
Applied Nonparametric Regression: Wolfgang H Ardle
No ratings yet
Applied Nonparametric Regression: Wolfgang H Ardle
433 pages
(eBook-PDF) - Statistics - Applied Nonparametric Regression
No ratings yet
(eBook-PDF) - Statistics - Applied Nonparametric Regression
433 pages
Lecture03_kernel
No ratings yet
Lecture03_kernel
28 pages
Smoothing: Smooth
No ratings yet
Smoothing: Smooth
19 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
Kde Presentation PDF
No ratings yet
Kde Presentation PDF
105 pages
W6a Gaussian Process Kernels
No ratings yet
W6a Gaussian Process Kernels
6 pages
Nonparametric Regression Analysis: Chapter Three
No ratings yet
Nonparametric Regression Analysis: Chapter Three
11 pages
Lecture 12
No ratings yet
Lecture 12
4 pages
Kernel Methods: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
No ratings yet
Kernel Methods: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
29 pages
Kernel Smoothers: An Overview of Curve Estimators For The First Graduate Course in Nonparametric Statistics
No ratings yet
Kernel Smoothers: An Overview of Curve Estimators For The First Graduate Course in Nonparametric Statistics
13 pages
Cours2 ML
No ratings yet
Cours2 ML
21 pages
Conditional Density Estimation With Spatially Dependent Data
No ratings yet
Conditional Density Estimation With Spatially Dependent Data
22 pages
Local Linear Regression For Functional Data: Alain Berlinet, Abdallah Elamine, André Mas Université Montpellier 2
No ratings yet
Local Linear Regression For Functional Data: Alain Berlinet, Abdallah Elamine, André Mas Université Montpellier 2
23 pages
Getdist: Kernel Density Estimation: Url: Http://Cosmologist - Info
No ratings yet
Getdist: Kernel Density Estimation: Url: Http://Cosmologist - Info
11 pages
The Annals of Statistics 10.1214/009053606000000830 Institute of Mathematical Statistics
No ratings yet
The Annals of Statistics 10.1214/009053606000000830 Institute of Mathematical Statistics
22 pages
A Short Course On Nonparametric Curve Estimation R PDF
No ratings yet
A Short Course On Nonparametric Curve Estimation R PDF
114 pages
Simon Sheather 2004 PDF
No ratings yet
Simon Sheather 2004 PDF
10 pages
2014 02 26 Kernels
No ratings yet
2014 02 26 Kernels
140 pages
Classification and kernel density estimation
No ratings yet
Classification and kernel density estimation
7 pages
Lecture17 Kernels
No ratings yet
Lecture17 Kernels
23 pages
09 SS049
No ratings yet
09 SS049
14 pages
Divide and Conquer Kernel Ridge Regression: University of California, Berkeley University of California, Berkeley
No ratings yet
Divide and Conquer Kernel Ridge Regression: University of California, Berkeley University of California, Berkeley
26 pages
Kernel Density Estimation
No ratings yet
Kernel Density Estimation
10 pages
Eco No Metrics
No ratings yet
Eco No Metrics
312 pages
The Optimal Bandwidth For Kernel Density Estimation of Skewed Distribution: A Case Study On Survival Time Data of Cancer Patients
No ratings yet
The Optimal Bandwidth For Kernel Density Estimation of Skewed Distribution: A Case Study On Survival Time Data of Cancer Patients
9 pages
SDV
No ratings yet
SDV
82 pages
Michael Creel - Econometrics
No ratings yet
Michael Creel - Econometrics
490 pages
Hyperkernels: Cheng Soon Ong, Alexander J. Smola, Robert C. Williamson
No ratings yet
Hyperkernels: Cheng Soon Ong, Alexander J. Smola, Robert C. Williamson
8 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
non-par-regression
No ratings yet
non-par-regression
35 pages
Barndorff-Nielsen, Hansen, Lunde & Shephard (2009)
No ratings yet
Barndorff-Nielsen, Hansen, Lunde & Shephard (2009)
32 pages
2 NW and Local Linear Regression
No ratings yet
2 NW and Local Linear Regression
14 pages
Econometrics
No ratings yet
Econometrics
310 pages
Kernel Density Estimation and Its Application
No ratings yet
Kernel Density Estimation and Its Application
8 pages
Non Parametric Density Estimation
No ratings yet
Non Parametric Density Estimation
4 pages
1018031201
No ratings yet
1018031201
24 pages
Articulo Sheather
No ratings yet
Articulo Sheather
11 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Features Election
No ratings yet
Features Election
18 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
8 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Fundamentals of Business Statistics
No ratings yet
Fundamentals of Business Statistics
2 pages
Download Complete Learning Statistics Using R 1st Edition Randall E. Schumacker PDF for All Chapters
100% (2)
Download Complete Learning Statistics Using R 1st Edition Randall E. Schumacker PDF for All Chapters
88 pages
Arithmetic Mean and Standard Deviation
No ratings yet
Arithmetic Mean and Standard Deviation
15 pages
Python Final Project Description 1
No ratings yet
Python Final Project Description 1
3 pages
Statistics Formulas
No ratings yet
Statistics Formulas
6 pages
Sma 160 Introduction To Probability and Statistics
No ratings yet
Sma 160 Introduction To Probability and Statistics
4 pages
VGG 16
No ratings yet
VGG 16
18 pages
Practical On SPSS
No ratings yet
Practical On SPSS
2 pages
Module 4 (301 SI-2) (1)
No ratings yet
Module 4 (301 SI-2) (1)
24 pages
Queuing Theory - With Probability Distributions
No ratings yet
Queuing Theory - With Probability Distributions
71 pages
Applications Noncentral T Distribution PDF
No ratings yet
Applications Noncentral T Distribution PDF
22 pages
Mapsy-02 855
No ratings yet
Mapsy-02 855
19 pages
Estimation Error and Portfolio Optimization A Resampling Solution by Richard and Robert Michaud, 2007
No ratings yet
Estimation Error and Portfolio Optimization A Resampling Solution by Richard and Robert Michaud, 2007
20 pages
Francis Diebold - Econometrics Slides
No ratings yet
Francis Diebold - Econometrics Slides
281 pages
2descriptive Numerical Summary Measures Central
No ratings yet
2descriptive Numerical Summary Measures Central
52 pages
BBS11 ISM Ch12
No ratings yet
BBS11 ISM Ch12
66 pages
Regresi Linear Sederhana
No ratings yet
Regresi Linear Sederhana
10 pages
NUS MA2216 Final Exam Solution - 2009/10 Sem 2
No ratings yet
NUS MA2216 Final Exam Solution - 2009/10 Sem 2
7 pages
Linear Correlation and Regression
No ratings yet
Linear Correlation and Regression
42 pages
(Ebook) Mathematical Theory of Bayesian Statistics by Watanabe, Sumio ISBN 9781315373010, 9781482238082, 1315373017, 148223808X 2024 Scribd Download
100% (5)
(Ebook) Mathematical Theory of Bayesian Statistics by Watanabe, Sumio ISBN 9781315373010, 9781482238082, 1315373017, 148223808X 2024 Scribd Download
57 pages
ML Lab6.Ipynb - Colaboratory
100% (1)
ML Lab6.Ipynb - Colaboratory
5 pages
Lecture 3 With Notes - PDF
No ratings yet
Lecture 3 With Notes - PDF
11 pages
Detecting Multicollinearity in Regression Analysis: Keywords
No ratings yet
Detecting Multicollinearity in Regression Analysis: Keywords
4 pages
Sudhir-Demand Estimation-Aggregate Data Workshop-Updated 2013 PDF
No ratings yet
Sudhir-Demand Estimation-Aggregate Data Workshop-Updated 2013 PDF
72 pages
Module 1 - Introduction to Forecasting
No ratings yet
Module 1 - Introduction to Forecasting
11 pages
Econometrics Assignment
No ratings yet
Econometrics Assignment
2 pages
FIN 640 - Lecture Notes 4 - Sampling and Estimation
100% (1)
FIN 640 - Lecture Notes 4 - Sampling and Estimation
40 pages
STA641 Final Term Paper
50% (2)
STA641 Final Term Paper
11 pages
Ts Maths 2a 2024
No ratings yet
Ts Maths 2a 2024
3 pages

Kernel

Uploaded by

Kernel

Uploaded by

Kernel Regression

Advanced Methods for Data Analysis (36-402/36-608)

1 Linear smoothers and kernels

• Three common examples are the box kernel:

• Given a choice of kernel K, and a bandwidth h, kernel regression is defined by taking

2 Bias and variance of kernels

So what is the bias and variance of the kernel regression estimator?

3 Practical considerations, multiple dimensions

You might also like