0% found this document useful (0 votes)

271 views39 pages

SMSP

This document provides an introduction to smoothing splines. It discusses linear and polynomial regression models and their limitations in modeling data. It introduces the concept of roughness penalties, which aim to balance goodness of fit with smoothness of the fitted curve. It describes cubic splines, interpolating splines, and smoothing splines. Smoothing splines are defined as the minimizers of a penalized sum of squares, trading off residual error and roughness of the fitted curve. Algorithms for fitting smoothing splines using natural cubic splines are provided.

Uploaded by

zulaiha rahasia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

271 views39 pages

SMSP

Uploaded by

zulaiha rahasia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 39

Introduction to

Smoothing Splines
Tongtong Wu
Feb 29, 2004
Outline
 Introduction
 Linear and polynomial regression, and
interpolation
 Roughness penalties
 Interpolating and Smoothing splines
 Cubic splines
 Interpolating splines
 Smoothing splines
 Natural cubic splines
 Choosing the smoothing parameter
 Available software
Key Words
 roughness penalty
 penalized sum of squares
 natural cubic splines
Motivation
10
8
6
(y18)

4
2

5 10 15

Index
Motivation
10
8
6
y18

4
2

5 10 15

Index
Motivation
10
8
6
y18

4
2

5 10 15

Index
Motivation
10
8
6
(y18)

4
2

Spline(y18)
5 10 15

Index
Introduction
 Linear and polynomial regression :
 Global influence
 Increasing of polynomial degrees happens in
discrete steps and can not be controlled
continuously
 Interpolation
 Unsatisfactory as explanations of the given
data
Roughness penalty approach
 A method for relaxing the model
assumptions in classical linear regression
along lines a little different from
polynomial regression.
Roughness penalty approach
 Aims of curving fitting
 A good fit to the data
 To obtain a curve estimate that does not
display too much rapid fluctuation
 Basic idea: making a necessary
compromise between the two rather
different aims in curve estimation
Roughness penalty approach
 Quantifying the roughness of a curve
An intuitive way:
 g ' ' (t ) dt
 b 2
a
(g: a twice-differentiable curve)
 Motivation from a formalization of a
mechanical device: if a thin piece of
flexible wood, called a spline, is bent to
the shape of the graph g, then the leading
term in the strain energy is proportional to

2
g ' '
Roughness penalty approach
 Penalized sum of squares
n
S ( g )   Yi  g (ti )    g ' ' (t ) dt
2 b 2
a
i 1
 g: any twice-differentiable function on [a,b]
  : smoothing parameter (‘rate of exchange’
between residual error and local variation)

 Penalized least squares estimator

gˆ  arg min S ( g )
Roughness penalty approach
Curve for a large value of 
10
8
6
y18

4
2

5 10 15

Index
Roughness penalty approach
Curve for a small value of 
10
8
6
y18

4
2

5 10 15

Index
Interpolating and Smoothing Splines
 Cubic splines
 Interpolating splines
 Smoothing splines
 Choosing the smoothing parameter
Cubic Splines
 Given a<t1<t2<…<tn<b, a function g is a
cubic spline if
1. On each interval (a,t1), (t1,t2), …, (tn,b), g is a
cubic polynomial
2. The polynomial pieces fit together at points ti
(called knots) s.t. g itself and its first and
second derivatives are continuous at each ti,
and hence on the whole [a,b]
Cubic Splines
 How to specify a cubic spline
g (t )  d i (t  ti )3  ci (t  ti ) 2  bi (t  ti )  ai for ti  t  ti 1
 Natural cubic spline (NCS) if its second
and third derivatives are zero at a and b,
which implies d0=c0=dn=cn=0, so that g is
linear on the two extreme intervals [a,t1]
and [tn,b].
Natural Cubic Splines
Value-second derivative representation
 We can specify a NCS by giving its value
and second derivative at each knot ti.
 Define g  ( g1 ,, g n )' , where gi  g (ti )
  ( 2 ,,  n1 )' , where  i  g ' ' (ti )
which specify the curve g completely.
 However, not all possible vectors
represent a natural spline!
Natural Cubic Splines
Value-second derivative representation
 Theorem 2.1
The vector g and  specify a natural
spline g if and only if
Q ' g  R
Then the roughness penalty will satisfy
b
a
g ' ' (t ) 2 dt   ' R  g ' Kg
Natural Cubic Splines
Value-second derivative representation
 h11 0  0  hi  ti 1  ti for i  1,, n
 1 1 
 h1  h2 h21  0 
 h21  h21  h31  0 
Q 
 0 h31  0 
     
 1

 0 0  hn 1  n( n  2 )

1 1 
( h
3 1 3  h ) h2  0 
6
 1 1 
R  h 2 (h2  h3 )  0 
 6 3 
     
 0 0 
1
(hn  2  hn 1 )
 3  ( n  2 )( n  2 )
Natural Cubic Splines
Value-second derivative representation
 R is strictly diagonal dominant, i.e.
| rii |  j i | rij |, i
 R is positive definite, so we can define
K  QR 1Q'
Interpolating Splines
 To find a smooth curve that interpolate (ti,zi),
i.e. g(ti)=zi for all i.
 Theorem 2.2
Suppose n  2 and t1<…<tn. Given any
values z1,…,zn, there is a unique natural cubic
spline g with knots ti satisfying
g (ti )  zi for i  1,, n
Interpolating Splines
 The natural cubic spline interpolant is the
unique minimizer of  g ' '2 over S2[a,b] that
interpolate the data.
 Theorem 2.3
Suppose g is the interpolant natural cubic
~  S [a, b] with g~(t )  z for i  1,, n
spline, g 2 i i
then
 
~ ' '2  g ' '2
g
Smoothing Splines
 Penalized sum of squares
n
S ( g )   Yi  g (ti )    g ' ' (t ) dt
2 b 2
a
i 1
 g: any twice-differentiable function on [a,b]
  : smoothing parameter (‘rate of exchange’
between residual error and local variation)

 Penalized least squares estimator

gˆ  arg min S ( g )
Smoothing Splines
1. The curve estimator ĝ is necessarily
a natural cubic spline with knots at ti,
for i=1,…,n.
Proof: suppose g is the NCS
n n

 iY  g (t i )2
 
 iY  ~ (t )2
g i
i 1 i 1

 g ' ' (t ) dt   g ' ' (t ) dt

b b
2 ~ 2
a a

 S ( g )  S ( g~)
Smoothing Splines
2. Existence and uniqueness
Let Y  (Y1 ,, Yn )' then
n

 i
Y  g (t i )2
 (Y  g )' (Y  g )
i 1
since g be precisely the vector of g (ti ) .
Express  g ' ' 2
 g ' Kg ,
S(g)  (Y  g)'(Y  g)  g'Kg
 g'(I  K)g  2Y' g  Y'Y
Minimum is achieved by setting g  ( I  K ) 1Y


Smoothing Splines
2. Theorem 2.4
Let ĝ be the natural cubic spline with
knots at ti for which g  ( I  K ) Y . Then
1

for any g in S2[a,b]

S ( gˆ )  S ( g )
Smoothing Splines
3. The Reinsch algorithm
Y  ( I  K ) g  ( I  QR 1Q) g
 g  Y  QR 1Q) g  Y  Q ( Q' g  R )
 Q' Y  ( R  Q' Q)

The matrix ( R  Q' Q) has bandwidth 5 and is

symmetric and strictly positive-definite,
therefore it has a Cholesky decomposition
R  Q' Q  LDL '
Smoothing Splines
3. The Reinsch algorithm for spline smoothing
Step 1: Evaluate the vector Q' Y .
Step 2: Find the non-zero diagonals of
R  Q ' Q
and hence the Cholesky decomposition
factors L and D.
Step 3: Solve
LDL '   Q' Y
for  by forward and back substitution.
Step 4: Find g by g  Y  Q .
Smoothing Splines
4. Some concluding remarks
 Minimizing curve ĝ essentially does not depend
on a and b, as long as all the data points lie
between a and b.
 If n=2, for any  , setting ĝ to be the straight
line through the two points (t1,Y1) and (t2,Y2) will
reduce S(g) to zero.
 If n=1, the minimizer is no longer unique, since
any straight line through (t1,Y1) will yield a zero
value S(g).
Choosing the Smoothing Parameter
 Two different philosophical
approaches
 Subjective choice
 Automatic method – chosen by data
 Cross-validation
 Generalized cross-validation
Choosing the Smoothing Parameter
 Cross-validation
 Y  gˆ 
n
min CV ( )  n (ti ; )
1 ( i ) 2
i

i 1
2
 Yi  gˆ (ti ) 
n
 n  
1
 if gˆ is the spline smoother w ith 
i 1  1  Aii ( ) 

 Generalized cross-validation
n

 i i
Y  ˆ
g (t ) 2

n  residual sum of squares

min GCV ( )  n 1 i 1


1  n trA( )
1
 2
(equivalen t df) 2
Available Software
smooth.spline in R
 Description:
Fits a cubic smoothing spline to the supplied data.
 Usage:
plot(speed, dist)
cars.spl <- smooth.spline(speed, dist)
cars.spl2 <- smooth.spline(speed, dist, df=10)
lines(cars.spl, col = "blue")
lines(cars.spl2, lty=2, col = "red")
Available Software
Example 1
library(modreg)
y18 <- c(1:3,5,4,7:3,2*(2:5),rep(10,4))
xx <- seq(1,length(y18), len=201)
(s2 <- smooth.spline(y18)) # GCV
(s02 <- smooth.spline(y18, spar = 0.2))
plot(y18, main=deparse(s2$call), col.main=2)
lines(s2, col = "blue");
lines(s02, col = "orange");
lines(predict(s2, xx), col = 2)
lines(predict(s02, xx), col = 3);
mtext(deparse(s02$call), col = 3)
Available Software
Example 1
Available Software
Example 2
data(cars) ## N=50, n (# of distinct x) =19
attach(cars)
plot(speed, dist, main = "data(cars) & smoothing splines")
cars.spl <- smooth.spline(speed, dist)
cars.spl2 <- smooth.spline(speed, dist, df=10)

lines(cars.spl, col = "blue")

lines(cars.spl2, lty=2, col = "red")
lines(smooth.spline(cars, spar=0.1))
## spar: smoothing parameter (alpha) in (0,1]
legend(5,120,c(paste("default [C.V.] => df
=",round(cars.spl$df,1)), "s( * , df = 10)"), col =
c("blue","red"), lty = 1:2, bg='bisque')
detach()
Available Software
Example 2
Extensions of
Roughness penalty approach
 Semiparametric modeling: a simple application
to multiple regression
Y  g (t )  x'   
 Generalized linear models (GLM)
 To allow all the explanatory variables to be
nonlinear
Y  g (t )  
 Additive model approach
d
Y   g j (t j )  
j 1
Reference
 P.J. Green and B.W. Silverman (1994)
Nonparametric Regression and Generalized
Linear Models. London: Chapman & Hall

Applied Linear Statistical Models 5th Edition Student S Solutions Manual PDF
67% (3)
Applied Linear Statistical Models 5th Edition Student S Solutions Manual PDF
111 pages
(CMBS-NSF 59) Grace Wahba - Spline Models For Observational Data-SIAM (1990)
No ratings yet
(CMBS-NSF 59) Grace Wahba - Spline Models For Observational Data-SIAM (1990)
179 pages
Spline Methods Draft: Tom Lyche and Knut Mørken
No ratings yet
Spline Methods Draft: Tom Lyche and Knut Mørken
235 pages
KB20082 - Classifying Estimated Blocks (Measured, Indicated, Inferred)
No ratings yet
KB20082 - Classifying Estimated Blocks (Measured, Indicated, Inferred)
10 pages
Smoothspline PDF
No ratings yet
Smoothspline PDF
4 pages
Spline Models: - Introduction To CS and NCS - Regression Splines - Smoothing Splines
No ratings yet
Spline Models: - Introduction To CS and NCS - Regression Splines - Smoothing Splines
24 pages
15 Splines
No ratings yet
15 Splines
51 pages
Flexible Regression - Lecture 6: Marnie Mclean Room 344 Mathematics and Statistics Building
No ratings yet
Flexible Regression - Lecture 6: Marnie Mclean Room 344 Mathematics and Statistics Building
27 pages
Isotonic Smoothing Spline Regression
No ratings yet
Isotonic Smoothing Spline Regression
18 pages
Cubic Spline
No ratings yet
Cubic Spline
5 pages
A Robust Variant of Cubic Smoothing Spline Approximation
No ratings yet
A Robust Variant of Cubic Smoothing Spline Approximation
21 pages
Penalized Regression
No ratings yet
Penalized Regression
6 pages
Wahba Improper Priors
No ratings yet
Wahba Improper Priors
9 pages
16-Splines and Piecewise Interpolation
No ratings yet
16-Splines and Piecewise Interpolation
17 pages
Estimating Penalized Spline Regressions: Theory and Application To Economics
No ratings yet
Estimating Penalized Spline Regressions: Theory and Application To Economics
16 pages
Pspline
No ratings yet
Pspline
6 pages
Splines
No ratings yet
Splines
35 pages
A Practical Guide To Splines
No ratings yet
A Practical Guide To Splines
7 pages
A Practical Guide To Splines: Revised Edition
No ratings yet
A Practical Guide To Splines: Revised Edition
7 pages
A Practical Guide To Spline: Mathematics of Computation January 1978
No ratings yet
A Practical Guide To Spline: Mathematics of Computation January 1978
8 pages
B-Spline Interpolation: Charles Frye Introduction To Splines
No ratings yet
B-Spline Interpolation: Charles Frye Introduction To Splines
10 pages
A - Practical - Guide - To - Spline - de Boor
No ratings yet
A - Practical - Guide - To - Spline - de Boor
8 pages
Monotonic Cubic Spline Interpolation
No ratings yet
Monotonic Cubic Spline Interpolation
8 pages
Splines and Piecewise Interpolation: Powerpoints Organized by Dr. Michael R. Gustafson Ii, Duke University
No ratings yet
Splines and Piecewise Interpolation: Powerpoints Organized by Dr. Michael R. Gustafson Ii, Duke University
17 pages
Spline Interpolation: Problems
No ratings yet
Spline Interpolation: Problems
17 pages
Numerical Computation - 10 - Interpolasi Spline
No ratings yet
Numerical Computation - 10 - Interpolasi Spline
37 pages
Spline and Penalized Regression
No ratings yet
Spline and Penalized Regression
45 pages
Spline
No ratings yet
Spline
13 pages
Lecture 8
No ratings yet
Lecture 8
24 pages
A Practical Guide To Spline
No ratings yet
A Practical Guide To Spline
8 pages
Lecture 19
No ratings yet
Lecture 19
4 pages
Lec 08 - Polynomial Regression
No ratings yet
Lec 08 - Polynomial Regression
56 pages
Module08 PolynomialRegressionSplineGAMs
No ratings yet
Module08 PolynomialRegressionSplineGAMs
56 pages
Lect 6 C
No ratings yet
Lect 6 C
31 pages
Smoothing: Smooth
No ratings yet
Smoothing: Smooth
19 pages
Spllinefit
100% (1)
Spllinefit
17 pages
tr867 PDF
No ratings yet
tr867 PDF
48 pages
Smooth Splines Large Data
No ratings yet
Smooth Splines Large Data
22 pages
Polynomial and Spline Interpolation: A Chemical Reaction
No ratings yet
Polynomial and Spline Interpolation: A Chemical Reaction
4 pages
Slides4 mrbm2324
No ratings yet
Slides4 mrbm2324
40 pages
Univariate Smoothing
No ratings yet
Univariate Smoothing
37 pages
Schumaker 1983
No ratings yet
Schumaker 1983
11 pages
Computer Graphics For Engineers: Splines and Bezier Curves
No ratings yet
Computer Graphics For Engineers: Splines and Bezier Curves
79 pages
Numerical Methods (Answer Key)
No ratings yet
Numerical Methods (Answer Key)
12 pages
Lecture02 95791
No ratings yet
Lecture02 95791
94 pages
Curves
No ratings yet
Curves
61 pages
MAE 152 Computer Graphics For Scientists and Engineers: Splines and Bezier Curves
No ratings yet
MAE 152 Computer Graphics For Scientists and Engineers: Splines and Bezier Curves
76 pages
Lec02-4 Splines Interpolation
No ratings yet
Lec02-4 Splines Interpolation
14 pages
Lecture19 PDF
No ratings yet
Lecture19 PDF
4 pages
Basis Approaches
No ratings yet
Basis Approaches
9 pages
L5 Spline Regression
No ratings yet
L5 Spline Regression
74 pages
Spline - Cubic Spline Data Interpolation: Syntax
No ratings yet
Spline - Cubic Spline Data Interpolation: Syntax
5 pages
Matlab 3
No ratings yet
Matlab 3
42 pages
Basis Expansion and Regularization: Prof. Liqing Zhang
No ratings yet
Basis Expansion and Regularization: Prof. Liqing Zhang
45 pages
Splines Toolbos Use Matlab
No ratings yet
Splines Toolbos Use Matlab
110 pages
Splines
No ratings yet
Splines
110 pages
J784 Wang - Cubic Splines
No ratings yet
J784 Wang - Cubic Splines
15 pages
34 Splines
No ratings yet
34 Splines
14 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
From Everand
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
Analog Dialogue
4/5 (1)
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
Interview Questions - Linear Regression
No ratings yet
Interview Questions - Linear Regression
6 pages
Final Report Econometric
No ratings yet
Final Report Econometric
29 pages
Multinomial Probit and Flogit Models Examples
No ratings yet
Multinomial Probit and Flogit Models Examples
11 pages
CH 02 Wooldridge 5e ppt20250307
No ratings yet
CH 02 Wooldridge 5e ppt20250307
51 pages
Chapter - Five - Limited Dependent Variable Models
No ratings yet
Chapter - Five - Limited Dependent Variable Models
75 pages
MA Econometrics II: Midterm 2019: Instructor: Bipasha Maity Ashoka University Spring Semester 2019 March 13, 2019
No ratings yet
MA Econometrics II: Midterm 2019: Instructor: Bipasha Maity Ashoka University Spring Semester 2019 March 13, 2019
2 pages
FIN435 Individual Assignment
No ratings yet
FIN435 Individual Assignment
13 pages
Estadistica, Articulo, Analyzing Outliers: Influential or Nuisance?
No ratings yet
Estadistica, Articulo, Analyzing Outliers: Influential or Nuisance?
3 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
18 pages
Problem Set 2 SOLUTIONS
No ratings yet
Problem Set 2 SOLUTIONS
9 pages
ML Module-02
No ratings yet
ML Module-02
37 pages
Classic Assumption Testing
No ratings yet
Classic Assumption Testing
29 pages
Statistics Syllabus
No ratings yet
Statistics Syllabus
37 pages
Alessandro Carrato From Chain Ladder To Individual Claims Reserving Using Machine Learning ASTIN Colloquium
No ratings yet
Alessandro Carrato From Chain Ladder To Individual Claims Reserving Using Machine Learning ASTIN Colloquium
19 pages
Childhood Obesity and Schools Evidence From The National Survey of Childrens Health
No ratings yet
Childhood Obesity and Schools Evidence From The National Survey of Childrens Health
9 pages
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
17 pages
Solution 9 8
No ratings yet
Solution 9 8
17 pages
Mathematical Statistics Borovkov A. A. Download
No ratings yet
Mathematical Statistics Borovkov A. A. Download
63 pages
Robust Regression Shrinkage and Consistent Variable Selection Through The LAD-Lasso
No ratings yet
Robust Regression Shrinkage and Consistent Variable Selection Through The LAD-Lasso
9 pages
Supervised Regression Notes
No ratings yet
Supervised Regression Notes
11 pages
Chapter 3 Econometrics
No ratings yet
Chapter 3 Econometrics
9 pages
SPSS Aktivitas Diuretik
No ratings yet
SPSS Aktivitas Diuretik
3 pages
Unit 2 Svms Linear Logistic Regression
No ratings yet
Unit 2 Svms Linear Logistic Regression
9 pages
Quiz 1 Practice Solutions: Conceptual Exercises
No ratings yet
Quiz 1 Practice Solutions: Conceptual Exercises
6 pages
Slides Prepared by John S. Loucks St. Edward's University
No ratings yet
Slides Prepared by John S. Loucks St. Edward's University
48 pages
ECN224 Exe 2
No ratings yet
ECN224 Exe 2
2 pages
Activity 4 CGPA Vs Placement Package Program
No ratings yet
Activity 4 CGPA Vs Placement Package Program
4 pages
ECS4222 - Time Series Econometrics - UG - 4th Sem - 2023
No ratings yet
ECS4222 - Time Series Econometrics - UG - 4th Sem - 2023
1 page

SMSP

Uploaded by

SMSP

Uploaded by

Introduction to

 Penalized least squares estimator

 Penalized least squares estimator

 g ' ' (t ) dt   g ' ' (t ) dt

for any g in S2[a,b]

The matrix ( R  Q' Q) has bandwidth 5 and is

n  residual sum of squares

lines(cars.spl, col = "blue")

You might also like