0% found this document useful (0 votes)

132 views

Kernel Regression Section3

Locally weighted regression addresses issues with kernel regression by fitting a polynomial model to the data using weighted least squares regression. It assigns weights to observations based on their proximity to the query point, and uses these weights to estimate the coefficients of a local polynomial function. This allows for a more accurate fit, especially near boundaries and points of curvature in the data. The process involves estimating regression coefficients for each query point by applying weighted least squares to a matrix of regressors and the weighted observations. While more computationally intensive than kernel regression, locally weighted regression provides greater flexibility to model patterns in non-uniformly distributed or curved data.

Uploaded by

Partho Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

132 views

Kernel Regression Section3

Uploaded by

Partho Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

A Note on Kernel Regression Partho Sarkar

3. Locally weighted regression

There is one pitfall inherent to kernel regression. Consider what occurs when xj approaches a boundary of the data (left or right). The kernel weights can no longer be symmetric. To illustrate, consider the right boundary of the data. Specifically, consider the process of obtaining a prediction y0 at x0, where x0 is at or near this right boundary. Only points to the left of x0 are capable of receiving kernel weights (other than x0 itself). There are simply no points to the right of x0 to receive any weight. Now, if the data (and the true function f) are decreasing toward the right boundary, then all yvalues in the weighted sum used to obtain y0 are, most likely, greater than or equal to the value y0 at x0. The bias at the boundary will result in a prediction, y0 , that will be too high (see the figure below).

Y
20

0 0 2 4 6 8 10 12 14

Kernel fit Memory points

Query points Locally Linear

Figure 5: Kernel vs. Locally Weighted regression

Locally weighted regression (also called Local polynomial regression) is a form of nonparametric regression that addresses this boundary problem Locally weighted regression uses weighted least squares (WLS) regression5 to fit a d-th degree polynomial to the data, where d is an integer, e.g., d=1 is local linear regression, d=2

Weighted least squares is a method of regression, similar to least squares in that it uses the same minimization of the sum of the residuals. However, instead of weighting all the residuals equally, they are weighted such that points with a greater weight contribute more to the sum.

Page 8 of 15

A Note on Kernel Regression Partho Sarkar is local quadratic regression, etc. The weights assigned to the observations are calculated via the kernel function, as above. Then these weights are used to estimate the coefficients of a local polynomial function fit. The simple kernel regression, as described previously, is just a special form of locally weighted regression, with d = 0. Apart from the boundary problem present in kernel regression, mentioned earlier, locally weighted regression, also addresses the problem of potentially inflated bias and variance in the interior of the data set if the points are not uniformly densely distributed or if substantial curvature is present in the underlying, though undefined, regression. The above figure illustrates these points- the locally linear regression fit seems more accurate than the kernel regression fit, especially towards the boundaries and at points of curvature. We now sketch the procedure for locally weighted regression. Consider as before fitting yj at the point xj. First, the weights, wij are obtained for the i=1,2,,m points in the memory set. This results in the vector of kernel weights wj:

w j = ( w1 j w2 j wmj )
Recall that the simple (zero order/Nadaraya-Watson) KR estimate of yi is a weighted sum of the yis: 14.

y j = m( x j ) = wij yi
i =1

where y j is the predicted value of yj

With local polynomial regression though, the wij, for a fixed j, become the weights to be used in weighted least squares regression6. The idea behind locally weighted regression is to use weighted least squares regression to fit a dth order polynomial: 15. y j = 0 j + 1 j x j + 2 j x j 2 + ... + dj x j d Where the coefficients kj depend on the (X,Y) points in memory and the kernel weights wij. This is explained below. The weight matrix for local polynomial regression is derived from the elements of wj as:
w 1j 16. W j = diag (w j ) = 0 wmj

Following the procedure of weighted least squares, the estimated coefficients for the locally weighted regression fit at xj are then found via 17. j = ( X'Wj X ) X'Wjy
-1

where j = ( 0 j 1 j ... dj ) ' is the column of regression coefficients and X is the matrix
6

It is important to note that these distinct weights vary with changing j

Page 9 of 15

A Note on Kernel Regression Partho Sarkar of regressors

1 x1 18. X = 1 xm x12
2 xm

xd1 xd m

for locally weighted regression determined by the degree d of the polynomial. Note that a column of constants (1s) is the first column- this corresponds to the constant term 0 in the equation below. Thus, provided ( X'Wj X ) exists, the fit at xj is obtained as:
-1

19. y j = x j j = 0 j + 1 j x j + 2 j x j 2 + ... + dj x j d

where x j is the j-th row of the X matrix Note that a separate regression on all the memory points has to be carried out for every query point, i.e., the coefficients have to re-estimated for every xj (though they are used to estimate yj only for the j-th point). This makes local polynomial regression even more computationally intensive than simple kernel regression for sizeable memory and query sets. Authors generally agree that for the majority of cases, a first order fit (local linear regression) is an adequate choice for d. Local linear regression is suggested to balance computational ease with the flexibility to reproduce patterns that exist in the data. Nonetheless, local linear regression may fail to capture sharp curvature if present in the data structure. In such cases, local quadratic regression (d=2) may be needed to provide an adequate fit. Most authors agree there is usually no need for polynomials of order d>2).

4. Multivariate Kernel Regression

When there are multiple explanatory variables (k>1), the basic principles of kernel regression remain the same, but their implementation becomes more complex. Our independent variable data will now look like a matrix7 X, where xki is the i-th value of the k-th variable Xk

x11 X= x1m

x21 x2 m

xk1 x km

The data in memory will now take the form of pairs of vectors of values of the independent and dependent variables, ( X1 , y1 ) , ( X 2 , y2 ) , , ( X m , ym ) , where Xi is the i-th independent variable observation vector8, Xi = [ x1i
7 8

x2i xki ] '

Matrices and vectors are shown in bold type. It is more convenient for later work to express this as a column vector, hence the transpose operator

Page 10 of 15

E Ticket Receipt
No ratings yet
E Ticket Receipt
1 page
Quantitative Equity Portfolio Management: An Active Approach to Portfolio Construction and Management
From Everand
Quantitative Equity Portfolio Management: An Active Approach to Portfolio Construction and Management
Ludwig B. Chincarini
4.5/5 (2)
Level 5 The Management of Quality in Health and Social Care Final
No ratings yet
Level 5 The Management of Quality in Health and Social Care Final
9 pages
Autoencoder Asset Pricing Models
No ratings yet
Autoencoder Asset Pricing Models
22 pages
Northfield Fundamental Model - Highlighted
No ratings yet
Northfield Fundamental Model - Highlighted
16 pages
YCharts Excel Reference
No ratings yet
YCharts Excel Reference
503 pages
The Correlation Structure of Security Returns: Multiindex Models and Grouping Techniques
No ratings yet
The Correlation Structure of Security Returns: Multiindex Models and Grouping Techniques
26 pages
Applied Regression Analysis: Third Edition
0% (1)
Applied Regression Analysis: Third Edition
9 pages
Comprehension Questions - Percy Jackson and The Olympians #1 The Lightning Thief
0% (1)
Comprehension Questions - Percy Jackson and The Olympians #1 The Lightning Thief
6 pages
IRR RA 7160 Admin Order 270
No ratings yet
IRR RA 7160 Admin Order 270
241 pages
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Classical Least Squares Theory - Lecture Notes
No ratings yet
Classical Least Squares Theory - Lecture Notes
109 pages
YCharts API Reference
No ratings yet
YCharts API Reference
373 pages
Nonlinear Least Squares Theory - Lecture Notes
No ratings yet
Nonlinear Least Squares Theory - Lecture Notes
33 pages
Kbai Study Guide
No ratings yet
Kbai Study Guide
5 pages
20 A Long-Run and Short-Run Component Model of Stock Return Volatility
No ratings yet
20 A Long-Run and Short-Run Component Model of Stock Return Volatility
23 pages
Two Stage Fama Macbeth
100% (1)
Two Stage Fama Macbeth
5 pages
Aaron Brown - VaR The Next 10 Disasters
No ratings yet
Aaron Brown - VaR The Next 10 Disasters
5 pages
Harlow 1991
No ratings yet
Harlow 1991
13 pages
Improving Trading Technical Analysis With TensorFlow Long Short-Term Memory (LSTM) Neural Network
No ratings yet
Improving Trading Technical Analysis With TensorFlow Long Short-Term Memory (LSTM) Neural Network
11 pages
Artificial Neural Network Model For Forecasting Foreign Exchange Rate
No ratings yet
Artificial Neural Network Model For Forecasting Foreign Exchange Rate
9 pages
Bootstrap: Estimate Statistical Uncertainties
No ratings yet
Bootstrap: Estimate Statistical Uncertainties
22 pages
Smart Beta: Presented By: Leon D'silva Rahul Choudhary
No ratings yet
Smart Beta: Presented By: Leon D'silva Rahul Choudhary
12 pages
Portfolio Optimization CVaR
No ratings yet
Portfolio Optimization CVaR
36 pages
preview-9781000176766_A39526004
No ratings yet
preview-9781000176766_A39526004
35 pages
Time Series Models With Discrete Wavelet Transform
No ratings yet
Time Series Models With Discrete Wavelet Transform
11 pages
Rapport
No ratings yet
Rapport
39 pages
The Advantages of Least Squares Monte Carlo
0% (1)
The Advantages of Least Squares Monte Carlo
9 pages
CH 12
No ratings yet
CH 12
82 pages
Understanding The Kelly Capital Growth Investment Strategy
No ratings yet
Understanding The Kelly Capital Growth Investment Strategy
7 pages
TG - Momentum, Acceleration
No ratings yet
TG - Momentum, Acceleration
25 pages
Introduction Mathematical Portfolio Theo
No ratings yet
Introduction Mathematical Portfolio Theo
159 pages
EDHEC Publi Performance Measurement For Traditional Investment PDF
No ratings yet
EDHEC Publi Performance Measurement For Traditional Investment PDF
66 pages
Performance Attribution
No ratings yet
Performance Attribution
7 pages
Some Applications of Mathematics in Finance (7 November 2008)
No ratings yet
Some Applications of Mathematics in Finance (7 November 2008)
52 pages
A Bayesian Approximation Method For Online Ranking: Ruby C. Weng
No ratings yet
A Bayesian Approximation Method For Online Ranking: Ruby C. Weng
34 pages
Investment Theory
No ratings yet
Investment Theory
29 pages
Exercise 1.3: Numerical Methods
No ratings yet
Exercise 1.3: Numerical Methods
7 pages
Model Description: Ticker Name
No ratings yet
Model Description: Ticker Name
14 pages
Kyle Market Microstructure Syllabus
No ratings yet
Kyle Market Microstructure Syllabus
7 pages
P. Christoffersen. Evaluating Interval Forecast. Internatinal Economic Review, 39, 1998.
No ratings yet
P. Christoffersen. Evaluating Interval Forecast. Internatinal Economic Review, 39, 1998.
23 pages
T2.Statistics Review (Stock & Watson)
No ratings yet
T2.Statistics Review (Stock & Watson)
15 pages
Data Clustering: 50 Years Beyond K-Means
No ratings yet
Data Clustering: 50 Years Beyond K-Means
35 pages
Introduction To Economic Fluctuations: Chapter 10 of Edition, by N. Gregory Mankiw
No ratings yet
Introduction To Economic Fluctuations: Chapter 10 of Edition, by N. Gregory Mankiw
27 pages
Factor B
No ratings yet
Factor B
68 pages
DeepThought FinML
No ratings yet
DeepThought FinML
124 pages
K - Stockton
No ratings yet
K - Stockton
20 pages
Dynamic Asset Allocation For Varied Financial Markets Under Regime Switching Framework
No ratings yet
Dynamic Asset Allocation For Varied Financial Markets Under Regime Switching Framework
9 pages
Advance Stats
No ratings yet
Advance Stats
233 pages
Estimating Private Equity Returns From Limited Partner Cash Flows
No ratings yet
Estimating Private Equity Returns From Limited Partner Cash Flows
33 pages
Data Pre-Processing - by Quant Arb - The Quant Stack
No ratings yet
Data Pre-Processing - by Quant Arb - The Quant Stack
9 pages
Stock Market Analysis Using Supervised Machine Learning
No ratings yet
Stock Market Analysis Using Supervised Machine Learning
3 pages
Stochastic Analysis in Finance II
No ratings yet
Stochastic Analysis in Finance II
16 pages
Download ebooks file An introduction to financial mathematics : option valuation Second Edition. Edition Hastings all chapters
100% (5)
Download ebooks file An introduction to financial mathematics : option valuation Second Edition. Edition Hastings all chapters
55 pages
Risk Premia Asymmetric Tail Risks and Excess Returns
No ratings yet
Risk Premia Asymmetric Tail Risks and Excess Returns
25 pages
Bayesian Methods in Finance-Nick Polson
No ratings yet
Bayesian Methods in Finance-Nick Polson
38 pages
Rough Volatility 2023 Part 1 Handout
No ratings yet
Rough Volatility 2023 Part 1 Handout
43 pages
Axioma 004 - Constraint Attribution
No ratings yet
Axioma 004 - Constraint Attribution
19 pages
Hp1047, Vmr286 Loan Default Prediction Final Report
No ratings yet
Hp1047, Vmr286 Loan Default Prediction Final Report
8 pages
Valuation Ratios
No ratings yet
Valuation Ratios
136 pages
Case 1
No ratings yet
Case 1
2 pages
Full Download Adaptive Asset Allocation Dynamic Global Portfolios to Profit in Good Times and Bad First Edition Adam Butler Michael Philbrick Rodrigo Gordillo PDF DOCX
100% (1)
Full Download Adaptive Asset Allocation Dynamic Global Portfolios to Profit in Good Times and Bad First Edition Adam Butler Michael Philbrick Rodrigo Gordillo PDF DOCX
65 pages
Sketches in Quantitative Finance A Translation of Bachelier's Le Jeu, la Chance et le Hasard
From Everand
Sketches in Quantitative Finance A Translation of Bachelier's Le Jeu, la Chance et le Hasard
Harding Edward
No ratings yet
Lesson 7 - Simple Machines (Kinds and Safety Use)
100% (1)
Lesson 7 - Simple Machines (Kinds and Safety Use)
22 pages
Mass Transfer, CHE545 Sept. 2019 - Jan. 2020 Dr.-Ing. Amizon Azizan
No ratings yet
Mass Transfer, CHE545 Sept. 2019 - Jan. 2020 Dr.-Ing. Amizon Azizan
5 pages
SHS CREATIVEWRITING Q1 LAS Wk2Day2
No ratings yet
SHS CREATIVEWRITING Q1 LAS Wk2Day2
4 pages
Eee 312 Part A Electromechanical Energy Conversion
No ratings yet
Eee 312 Part A Electromechanical Energy Conversion
10 pages
Structural Details of Burj Khalifa
No ratings yet
Structural Details of Burj Khalifa
2 pages
Atpg Question Answer: Name: Meet Zankat
100% (2)
Atpg Question Answer: Name: Meet Zankat
7 pages
Eric Helms - Nutrition - Calculator
No ratings yet
Eric Helms - Nutrition - Calculator
16 pages
Downloads IndyKB
No ratings yet
Downloads IndyKB
47 pages
About Blank
No ratings yet
About Blank
1 page
Essential Blender Essential Blender 11 Lighting Tutorial
100% (1)
Essential Blender Essential Blender 11 Lighting Tutorial
34 pages
Selenuim Training Notes
No ratings yet
Selenuim Training Notes
9 pages
The Outsiders STUDY GUIDE For Unit Exam-Handout
No ratings yet
The Outsiders STUDY GUIDE For Unit Exam-Handout
2 pages
Service Manual Awz - 241
No ratings yet
Service Manual Awz - 241
14 pages
Teaching Reading Skills
No ratings yet
Teaching Reading Skills
34 pages
Frameworks For Technological Integration
No ratings yet
Frameworks For Technological Integration
9 pages
BSSE Week 3 Language of Mathematics and Concepts On Sets
No ratings yet
BSSE Week 3 Language of Mathematics and Concepts On Sets
16 pages
Uppcl Training Report
100% (2)
Uppcl Training Report
18 pages
PRPA - Trucking Company List
No ratings yet
PRPA - Trucking Company List
13 pages
ossd physics workbook
No ratings yet
ossd physics workbook
68 pages
Logistics Companies in Chennai 2
No ratings yet
Logistics Companies in Chennai 2
4 pages
What I Know
No ratings yet
What I Know
15 pages
functional-movement-assessment-lower-extremity
No ratings yet
functional-movement-assessment-lower-extremity
3 pages
Economic Analysis Report On Four Markets in Fiji
No ratings yet
Economic Analysis Report On Four Markets in Fiji
31 pages
Enterprise Angular Mono Repo Patterns
No ratings yet
Enterprise Angular Mono Repo Patterns
60 pages
Business Combinations-Conso at DOA Pt1
No ratings yet
Business Combinations-Conso at DOA Pt1
12 pages
Sham Peer Review and Covid-19 - Physicians Under Attack
100% (1)
Sham Peer Review and Covid-19 - Physicians Under Attack
55 pages

Kernel Regression Section3

Uploaded by

Kernel Regression Section3

Uploaded by

A Note on Kernel Regression Partho Sarkar

3. Locally weighted regression

Kernel fit Memory points

Query points Locally Linear

Figure 5: Kernel vs. Locally Weighted regression

where y j is the predicted value of yj

It is important to note that these distinct weights vary with changing j

A Note on Kernel Regression Partho Sarkar of regressors

4. Multivariate Kernel Regression

x2i xki ] '

You might also like