0% found this document useful (0 votes)

37 views20 pages

Process Convolutions

Process convolutions provide a convenient representation of Gaussian processes. A Gaussian process can be defined by convolving a kernel function with white noise. This results in a stationary process whose covariance is related to the Fourier transform of the kernel. Discrete approximations of the kernel convolution can be used for practical applications and allow for non-stationary processes by using location-dependent kernels or latent processes. The model can be fitted by treating it as a linear model with a specially structured design matrix defined by the kernel. Generalizations allow for non-Gaussian latent processes and spatially-varying kernels.

Uploaded by

Felipe Leiva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views20 pages

Process Convolutions

Uploaded by

Felipe Leiva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Process Convolutions

A convenient representation of a Gaussian process is given by

process convolutions. Consider a kernel function k(s) and a white
noise process w(s), with E(w(s)) = 0, var(w(s)) = 2 and
cov(w(s), w(s )) = 0. Then define
Z
x(s) = k(s u)w(u)du
S

More formally we define the process as

Z Z
x(s) = k(s u)dB(u), where dB(u) = B(A) N (0, 2 |A|)
S A

and cov(B(A), B(C)) = 2 |A C|, which corresponds to a

Brownian motion.

1
Process Convolutions
We have that E(x(s)) = 0
Z
var(x(s)) = 2 k 2 (s u)du

and
Z Z
cov(x(s), x(s )) = 2 k(s u)k(s u)du = k(t)k(t d)dt

where d = s s . Thus the process x(s) is stationary.

2
Process Convolutions
We have that E(x(s)) = 0
Z
var(x(s)) = 2 k 2 (s u)du

and
Z Z
cov(x(s), x(s )) = 2 k(s u)k(s u)du = k(t)k(t d)dt

where d = s s . Thus the process x(s) is stationary.

The Fourier transform of the covariance of x(s) is the square of the
Fourier transform of the k. Thus, for a given covariance C, the
corresponding kernel is the inverse-transform of the root of the
spectrum of C:
p
k = IF T ( F T (C))

2
Process Convolutions Examples
The Gaussian correlation corresponds to a Gaussian kernel.
The Matern correlation corresponds to the kernel
1/2
s s
k(s) = K1/2 > 0, > 1

The exponential correlation corresponds to a spike.

The covariance has twice the number of derivatives that the kernel
has.

3
Process Convolutions Examples
The Gaussian correlation corresponds to a Gaussian kernel.
The Matern correlation corresponds to the kernel
1/2
s s
k(s) = K1/2 > 0, > 1

The exponential correlation corresponds to a spike.

The covariance has twice the number of derivatives that the kernel
has.
A kernel offering varying degrees of smoothing and compact
support is the Bezier kernel

1 ||s||2 if ||s|| < 1
k(s) = >0
0 otherwise.

3
Discrete Approximations

In practice we consider a grid of points in S to approximate the

kernel convolution. So, for u1 , . . . , up we have that
p
X
x(s) = k(s uj )w(uj )
j=1

We observe hat
p
X
var(x(s)) = 2 k 2 (s uj )
j=1

and
p
X
cov(x(s), x(s )) = 2 k(s uj )k(s uj )
j=1

which imply that the resulting covariance is NOT stationary.

4
Non-Stationarity

The continuous version of the kernel convolution can be used to

obtain non-stationary Gaussian processes by:
Kernel dependent on location-varying parameters
Z
x(s) = k(s u; (u))w(u)du
S

Convolving process with covariance function dependent on

location-varying parameters.
Z
x(s) = k(s u)w(u) (s)du
S

5
Fitting the Model
Given a set of observations y1 , . . . , ym at locations s1 , . . . , sm we fit
the model
p
X
yi = (s) + k(s uj ; )wj + i , i N (0, 2 )
j=1

with priors wj N (0, 2 ), p( 2 ), p() and p( 2 ).

6
Fitting the Model
Given a set of observations y1 , . . . , ym at locations s1 , . . . , sm we fit
the model
p
X
yi = (s) + k(s uj ; )wj + i , i N (0, 2 )
j=1

with priors wj N (0, 2 ), p( 2 ), p() and p( 2 ).

This is a hierarchical linear model where the design matrix is
defined by the kernel.

Y = + K()w +

The dimension of w is p, which corresponds to the size of the grid

that is used for w(s). As p is much smaller than m, this results is
an important reduction in the dimension of the problem for
computational purposes. Usually (s) = z (s).

6
Non-Gaussian Processes

In many situations we need to consider spatial processes that are

not Gaussian. Here are several options:
Generalized linear models. Use a Gaussian process to model
the mean function of the observations transformed using the
link function.
Non-linear transformations and clipping. This is useful, for
example, for binary data.
Process convolutions for non-Gaussian latent processes. w(ui )
can be taken as realizations of distributions other than
Gaussian. For example, a positive distribution will provide a
positive-valued random field.

7
Non-Gaussian Process Convolutions
Elaborating on the previous slide
p
X
x(s) = k(s uj ; )wj , wj F
j=1

where F is a distribution with support in R+ , then x(s) 0, s.

8
Non-Gaussian Process Convolutions
Elaborating on the previous slide
p
X
x(s) = k(s uj ; )wj , wj F
j=1

where F is a distribution with support in R+ , then x(s) 0, s.

Consider the normalized kernel
k(s uj ; )
k (s uj ; ) = P
l k(s ul ; )

then
p
X
x(s) = k (s uj ; )wj , wj F
j=1

where F has (0, 1) support, like a beta distribution, then

x(s) (0, 1), s, as x(s) is a convex combination of wj .

8
Spatially-Varying Kernels

A process with spatially varying kernel can be written as

p
X
x(s) = b(s ui ; (s))wi
i=1

where

1 ||s j||2 1 if ||s j|| < 1

b(s j; )
0 otherwise.

and = (1 , . . . , 4 ).

9
Spatially-Varying Kernels

A process with spatially varying kernel can be written as

p
X
x(s) = b(s ui ; (s))wi
i=1

where

1 ||s j||2 1 if ||s j|| < 1

b(s j; )
0 otherwise.

and = (1 , . . . , 4 ).
The distance is given as
q
T
||s j|| ((xs xj ), (ys yj )) 1 ((xs xj ), (ys yj )).

9
Bezier Kernels

1.0
a b
66

0.8
64
62

0.6
j
latitude

K(.,w)
s
60

0.4
j
58

0.2
56
54

0.0
12 8 6 4 2 0 12 10 8 6 4 2 0

longitude longitude

10
Spatially-Varying Kernels
The ellipsoidal shape is controlled by the parameters in

1 + 2 cos 24 2 sin 24
1
2 sin 24 1 2 cos 24

1 1 1 1 1
= + ,
2 a2 A2 a 2 A2
a = L + 2 (U L), A = a + 3 (U a), 2 , 3 (0, 1)
So the semi-minor and semi-major axes a and A belong to (L, U ).

11
Spatially-Varying Kernels
The ellipsoidal shape is controlled by the parameters in

1 + 2 cos 24 2 sin 24
1
2 sin 24 1 2 cos 24

1 1 1 1 1
= + ,
2 a2 A2 a 2 A2
a = L + 2 (U L), A = a + 3 (U a), 2 , 3 (0, 1)
So the semi-minor and semi-major axes a and A belong to (L, U ).
The spatial variation of is obtained, with a normalized b, as
p
X
(s) = b(s ui ; )(ui ) = (2, 1, 0, 0)
i=1

with appropriate uniform priors on each k (ui ), k = 1, . . . , 4.

11
Scallops Data

38.5 39.0 39.5 40.0 40.5 41.0 41.5

0.1
0.3
0.4
0.6 0.8
latitude

latitude
0.9

0.7
0.5
0.2

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

longitude longitude
38.5 39.0 39.5 40.0 40.5 41.0 41.5

38.5 39.0 39.5 40.0 40.5 41.0 41.5

3 2
2
4 4
3 4 0
1 0
latitude

latitude
4 4
1 2 2
3 5 2 6
6 2
2 10

2 0
1 6 2

6
0 2
8 4
4
1 6
2
1
6
2

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

longitude longitude

Using the DPC with fixed parameter Bezier kernels on the scallops data.
We use a fixed ellipsoidal kernel that follows the coastline.

12
Space-Varying Kernels

38.5 39.0 39.5 40.0 40.5 41.0 41.5

3
3
1 0
4 2
latitude

latitude
4
1
2
2
6 5
3 6

7
1
2

4
5
3

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

longitude longitude
38.5 39.0 39.5 40.0 40.5 41.0 41.5

38.5 39.0 39.5 40.0 40.5 41.0 41.5

0.62
2.95
0.6
0.58 2.9
0.6 2.85
latitude

latitude
0.62
0.64
0.66 2.8
0.68 2.9
0 .7 0.72
0.

2.8
58

2.75 2.75
0.5

2.7 2.8
4
56
0.

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

longitude longitude

Letting the four parameters be DPCs with spherical kernels we have

space-varying ellipsoidal shapes and smoothness. Lower panels:
eccentricity (left) and smoothness (right).

13
Modeling issues

In the previous model we add measurement error and usually a

linear drift.
We need to specify a prior for w. Usually p(w) 1 works fine.
The compact support of b() produces sparse matrices in the
resulting design matrix of the linear model. This can be used
to speed up calculations.
How many knots do we use? Where do we put them? There
are no good answers to these questions. We can use model
comparison methods to choose between different configurations.
Nevertheless, intuitively, the space-varying nature of the
support should compensate for small or sparse grids.

Computer Vision Lec-1
No ratings yet
Computer Vision Lec-1
110 pages
Selected Filtration Methods of ISO-16610
No ratings yet
Selected Filtration Methods of ISO-16610
34 pages
Ek 2020
No ratings yet
Ek 2020
203 pages
Discrete Stochastic Processes and Optimal Filtering Second Edition Jean?Claude Bertein 2025 Scribd Download
No ratings yet
Discrete Stochastic Processes and Optimal Filtering Second Edition Jean?Claude Bertein 2025 Scribd Download
91 pages
Random Processes and Fields
No ratings yet
Random Processes and Fields
42 pages
ML.5-Clustering Techniques (Week 9)
No ratings yet
ML.5-Clustering Techniques (Week 9)
71 pages
KalmanSlides 2
No ratings yet
KalmanSlides 2
57 pages
W6a Gaussian Process Kernels
No ratings yet
W6a Gaussian Process Kernels
6 pages
3.5 Smoothing (Lowpass) Spatial Filters - Lowpass Gaussian Filter Kernels
No ratings yet
3.5 Smoothing (Lowpass) Spatial Filters - Lowpass Gaussian Filter Kernels
38 pages
6-7noise Filtering
No ratings yet
6-7noise Filtering
86 pages
12.estimation - Correlations
100% (2)
12.estimation - Correlations
18 pages
Lecture17 Kernels
No ratings yet
Lecture17 Kernels
23 pages
Hota ML13
No ratings yet
Hota ML13
24 pages
Learning Graphical Models For Stationary Time Series: Fbach@cs - Berkeley.edu Jordan@cs - Berkeley.edu
No ratings yet
Learning Graphical Models For Stationary Time Series: Fbach@cs - Berkeley.edu Jordan@cs - Berkeley.edu
20 pages
Durrande 2020
No ratings yet
Durrande 2020
90 pages
ML Lecture06 2
No ratings yet
ML Lecture06 2
63 pages
cs229 Notes3
No ratings yet
cs229 Notes3
30 pages
(Ebook PDF) Qualitative Data Analysis: A Methods Sourcebook 4th Editioninstant Download
100% (3)
(Ebook PDF) Qualitative Data Analysis: A Methods Sourcebook 4th Editioninstant Download
57 pages
Mathematics of Signals, Networks, and Learning
No ratings yet
Mathematics of Signals, Networks, and Learning
68 pages
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
No ratings yet
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
47 pages
Kernel Models 1233
No ratings yet
Kernel Models 1233
56 pages
Tungban Probabilistic ML 2021 - Lecture09
No ratings yet
Tungban Probabilistic ML 2021 - Lecture09
46 pages
Booklet Exercises
No ratings yet
Booklet Exercises
31 pages
Diffusion Filters
No ratings yet
Diffusion Filters
45 pages
A Step by Step Mathematical Derivation A
No ratings yet
A Step by Step Mathematical Derivation A
32 pages
Conditional Density Estimation With Spatially Dependent Data
No ratings yet
Conditional Density Estimation With Spatially Dependent Data
22 pages
Spatiotemporal Learning Via Infinite-Dimensional Bayesian Filtering and Smoothing A Look at Gaussian Process Regression Through Kalman Filtering
No ratings yet
Spatiotemporal Learning Via Infinite-Dimensional Bayesian Filtering and Smoothing A Look at Gaussian Process Regression Through Kalman Filtering
11 pages
Intro&NP Stat
No ratings yet
Intro&NP Stat
122 pages
ANoteon Krigingand Gaussian Processes
No ratings yet
ANoteon Krigingand Gaussian Processes
6 pages
Lecture 4
No ratings yet
Lecture 4
38 pages
Getdist: Kernel Density Estimation: Url: Http://Cosmologist - Info
No ratings yet
Getdist: Kernel Density Estimation: Url: Http://Cosmologist - Info
11 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
Lecture6 2015
No ratings yet
Lecture6 2015
36 pages
Mean Shift
No ratings yet
Mean Shift
5 pages
28.8 - RBF-Kernel - mp4
No ratings yet
28.8 - RBF-Kernel - mp4
5 pages
Spatial Point Patterns
No ratings yet
Spatial Point Patterns
27 pages
Lecture 16
No ratings yet
Lecture 16
5 pages
Machine Learning and Pattern Recognition Minimal GP Demo
No ratings yet
Machine Learning and Pattern Recognition Minimal GP Demo
3 pages
5 2-5 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-5 Spatial Environmental Data Gaussian Processes
5 pages
METULecture 1
No ratings yet
METULecture 1
15 pages
The Use of Gaussian Processes in System Identification
No ratings yet
The Use of Gaussian Processes in System Identification
13 pages
A Tutorial On Gaussian Processes (Or Why I Don'T Use SVMS) : Zoubin Ghahramani
No ratings yet
A Tutorial On Gaussian Processes (Or Why I Don'T Use SVMS) : Zoubin Ghahramani
31 pages
5772 Learning Stationary Time Series Using Gaussian Processes With Nonparametric Kernels
No ratings yet
5772 Learning Stationary Time Series Using Gaussian Processes With Nonparametric Kernels
9 pages
Gaussian Process Kernels For Pattern Discovery and Extrapolation
No ratings yet
Gaussian Process Kernels For Pattern Discovery and Extrapolation
10 pages
Report
No ratings yet
Report
23 pages
Unscented Kalman Filter For Dummies - Robotics Stack Exchange
No ratings yet
Unscented Kalman Filter For Dummies - Robotics Stack Exchange
4 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
8 pages
Image Filtering: Davide Scaramuzza
No ratings yet
Image Filtering: Davide Scaramuzza
63 pages
Elements of Statistical Learning II - Ch.6 Kernel Smoothing Methods - Notes
No ratings yet
Elements of Statistical Learning II - Ch.6 Kernel Smoothing Methods - Notes
5 pages
Ghahramani Lecture2
No ratings yet
Ghahramani Lecture2
30 pages
Classes of Kernels For Machine Learning: A Statistics Perspective
No ratings yet
Classes of Kernels For Machine Learning: A Statistics Perspective
14 pages
Chapter 14 MCMC For Continuous Distribution, Gaussian Process (Lecture On 02-18-2021) - STAT 243 - Stochastic Process
No ratings yet
Chapter 14 MCMC For Continuous Distribution, Gaussian Process (Lecture On 02-18-2021) - STAT 243 - Stochastic Process
6 pages
Advanced ML Notes (Midterm)
No ratings yet
Advanced ML Notes (Midterm)
10 pages
7 - Chapter 7-Chapter 7 - Density-Based Clustering Methods
No ratings yet
7 - Chapter 7-Chapter 7 - Density-Based Clustering Methods
30 pages
Cok Rigging
No ratings yet
Cok Rigging
64 pages
Skill and Competency
100% (2)
Skill and Competency
28 pages
Gaussian Process - Part 2: 1 2 N T I 1 2 N T
No ratings yet
Gaussian Process - Part 2: 1 2 N T I 1 2 N T
4 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Linkers in The English Language
No ratings yet
Linkers in The English Language
3 pages
Blind Source Seperation
No ratings yet
Blind Source Seperation
4 pages
PLAXIS - 3D2018 Tutorial Lesson 09 PDF
No ratings yet
PLAXIS - 3D2018 Tutorial Lesson 09 PDF
14 pages
Allama Iqbal Open University Islamabad Assignment#1
No ratings yet
Allama Iqbal Open University Islamabad Assignment#1
27 pages
Y4 Place Number and Place Value End-of-Strand Assessment
100% (1)
Y4 Place Number and Place Value End-of-Strand Assessment
3 pages
Characterization PDF
No ratings yet
Characterization PDF
80 pages
ANSYS CFX Tutorials
No ratings yet
ANSYS CFX Tutorials
610 pages
9 - Maths - L-3-Coordinate Geometry WS-1
No ratings yet
9 - Maths - L-3-Coordinate Geometry WS-1
6 pages
Iso 67892003
No ratings yet
Iso 67892003
5 pages
Converse, Inverse, Contrapositive, and Biconditional: Welcome, Grade 8
No ratings yet
Converse, Inverse, Contrapositive, and Biconditional: Welcome, Grade 8
20 pages
Binary Arithmetic: Example of Binary Addition
No ratings yet
Binary Arithmetic: Example of Binary Addition
22 pages
QA 27 Geometry - 2
No ratings yet
QA 27 Geometry - 2
33 pages
Game Theory Presentation of Cindy Pe Benito-Quinitio
No ratings yet
Game Theory Presentation of Cindy Pe Benito-Quinitio
8 pages
Physics Mechanics Review
No ratings yet
Physics Mechanics Review
17 pages
File: XFINAL06new2: I. Course Description
No ratings yet
File: XFINAL06new2: I. Course Description
3 pages
Practice B: Surface Area of Prisms and Cylinders
No ratings yet
Practice B: Surface Area of Prisms and Cylinders
10 pages
Cs 101
No ratings yet
Cs 101
29 pages
AL2 Series SOFTWARE MANUAL Jy992d74001l PDF
No ratings yet
AL2 Series SOFTWARE MANUAL Jy992d74001l PDF
124 pages
GATE 2023: (Forenoon Session) Computer Science Engineering
No ratings yet
GATE 2023: (Forenoon Session) Computer Science Engineering
55 pages
1-PAC - Learning Framework - Example-20-12-2024
No ratings yet
1-PAC - Learning Framework - Example-20-12-2024
75 pages
Social Sciences
No ratings yet
Social Sciences
26 pages
PBL
No ratings yet
PBL
16 pages
What Is Language?: Medium of Communication
No ratings yet
What Is Language?: Medium of Communication
3 pages
One Dimensional Array in Java - Tutorial & Example
No ratings yet
One Dimensional Array in Java - Tutorial & Example
4 pages
Fibonacci Sequence
No ratings yet
Fibonacci Sequence
6 pages
Core Lap
No ratings yet
Core Lap
1 page
New Hexagonal Geometry in Cellular Network Systems
No ratings yet
New Hexagonal Geometry in Cellular Network Systems
8 pages
5.1 Oscillations (Part 1)
No ratings yet
5.1 Oscillations (Part 1)
2 pages
Global and Local Variables
No ratings yet
Global and Local Variables
2 pages

Process Convolutions

Uploaded by

Process Convolutions

Uploaded by

Process Convolutions

A convenient representation of a Gaussian process is given by

More formally we define the process as

and cov(B(A), B(C)) = 2 |A C|, which corresponds to a

where d = s s . Thus the process x(s) is stationary.

where d = s s . Thus the process x(s) is stationary.

The exponential correlation corresponds to a spike.

The exponential correlation corresponds to a spike.

In practice we consider a grid of points in S to approximate the

which imply that the resulting covariance is NOT stationary.

The continuous version of the kernel convolution can be used to

Convolving process with covariance function dependent on

with priors wj N (0, 2 ), p( 2 ), p() and p( 2 ).

with priors wj N (0, 2 ), p( 2 ), p() and p( 2 ).

The dimension of w is p, which corresponds to the size of the grid

In many situations we need to consider spatial processes that are

where F is a distribution with support in R+ , then x(s) 0, s.

where F is a distribution with support in R+ , then x(s) 0, s.

where F has (0, 1) support, like a beta distribution, then

A process with spatially varying kernel can be written as

A process with spatially varying kernel can be written as

with appropriate uniform priors on each k (ui ), k = 1, . . . , 4.

38.5 39.0 39.5 40.0 40.5 41.0 41.5

38.5 39.0 39.5 40.0 40.5 41.0 41.5

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

38.5 39.0 39.5 40.0 40.5 41.0 41.5

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

38.5 39.0 39.5 40.0 40.5 41.0 41.5

38.5 39.0 39.5 40.0 40.5 41.0 41.5

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

38.5 39.0 39.5 40.0 40.5 41.0 41.5

73.5 73.0 72.5 72.0 73.5 73.0 72.5 72.0

Letting the four parameters be DPCs with spherical kernels we have

In the previous model we add measurement error and usually a

You might also like