5 3-2 Spatial Environmental Data Model Selection Long-Range Dependencies

Uploaded by

golgothgolgoth039

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views3 pages

5 3-2 Spatial Environmental Data Model Selection Long-Range Dependencies

Uploaded by

golgothgolgoth039

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

5/16/2021 Sensing and Analyzing global patterns of dependence | Module 5: Environmental Data and Gaussian Processes | Data Analysis:

cesses | Data Analysis: Statistical Modeling and Computation in Applications | edX

MITx 6.419x Help HuitianDiao

Data Analysis: Statistical Modeling and Computation in Applications
Course Progress Dates Discussion Resources

Course / Module 5 Environmental Data and G… / Sensing and Analyzing global patter…
Previous Next
2. Model Selection
Bookmark this page

Model Selection
to fit Gaussian processes on a variety
of data
without even that much prior
knowledge.
It's still good to know what these
different kernels do,
so that you can already come up with
a good set of candidate
kernels.
But other than that, you can actually
 fit the rest to the data at hand that
you have.

 20 43 / 20 43  1.50x    

Video Transcripts
Download video file Download SubRip (.srt) file
Download Text (.txt) file

Initially, let's recall our setup where we have a pair of multivariate Gaussian random variables , and 𝐗1 ∈ ℝ
𝑑

𝐗2 ∈ ℝ
𝑁 −𝑑
. These two random variables are used to represent the temperature at two sets of cities: are the 𝐗1

cities for which we do no have temperature measurements, and are the cities for which we do have temperate 𝐗2

measurements. In addition, we also have access to the means of both of these random variables, which are
denoted by and respectively — these will be the mean temperature at each of the cities.
𝜇1 𝜇2

The random variables are associated with physical locations represented by the variables and 𝐙1 ∈ ℝ
𝑀 ×𝑑

𝐙2 ∈ ℝ where we have assumed that we are working on an -dimensional space; typically,

𝑀 ×(𝑁 −𝑑)
for 𝑀 𝑀 = 2

spatial data. Further, we have selected a covariance function that serves as proxy for the relation
𝑘 (𝑧𝑖 , 𝑧𝑗 )

between two random variables as a function of their spatial locations. We use this kernel function to construct a
covariance matrix so that . Thus, we build the matrix:
Σ 𝑖𝑗 = 𝑐𝑜𝑣 (𝑋𝑖 , 𝑋𝑗 ) = 𝑘 (𝑧𝑖 , 𝑧𝑗 )

𝑑×𝑑 𝑑×(𝑁 −𝑑)

𝚺 11 ∈ ℝ 𝚺 12 ∈ ℝ
𝚺 =
[ (𝑁 −𝑑)×𝑑 (𝑁 −𝑑)×(𝑁 −𝑑) ]
𝚺 21 ∈ ℝ 𝚺 22 ∈ ℝ

In the previous sections we have shown that the distribution of the random variable 𝐗1 conditioned on 𝐗 2 = 𝐱2

is a Normal distribution with

https://learning.edx.org/course/course-v1:MITx+6.419x+1T2021/block-v1:MITx+6.419x+1T2021+type@sequential+block@gp_lec3/block-v1:MITx+6.419x+1T2021+type@vertical+block@gp_lec3-tab2 1/3
5/16/2021 Sensing and Analyzing global patterns of dependence | Module 5: Environmental Data and Gaussian Processes | Data Analysis: Statistical Modeling and Computation in Applications | edX

−1
𝜇𝐗 = 𝜇1 + Σ12 Σ (𝐱2 − 𝜇2 )
1 |𝐗2 22

−1
Σ𝐗 = Σ11 − Σ12 Σ Σ21 .
1 |𝐗2 22

The main running assumption in this process is to model the variables to be measured – like temperature – as a
jointly Normally distributed random variable with correlations determined as a function of location through the
kernel function . Once the means have been specified, we may predict the unobserved random variables
𝑘 (𝑧𝑖 , 𝑧𝑗 )

by computing the marginal distributions conditioned on the observed variables.

Here, we study the fundamental question of how to select this kernel function. One could create a countable
number of models, such as Gaussian processes with different kernels, or use the same kernel with a set of
different parameters. However, from these sets of kernels, how do we specify and select which model for the
kernel is best?
We present two possible approaches for such a problem of model selection. These are not an exhaustive
exposition of the possible approaches but are useful for the problems at hand. The interested reader may wish to
consult the literature of Gaussian processes and model selection for further approaches.
We will proceed by first constructing an additional abstraction: we will consider the parameters of the kernel
function to be some generic value . That is, for example in the case of the kernel function
𝜃

2
‖𝑦 𝑖 − 𝑦 𝑗 ‖
𝑘 (𝑦 𝑖 , 𝑦 𝑗 ) = exp − .
2
( 2ℓ )

We can say that 𝜃 = {ℓ} , and our objective is to find the “best" in some particular sense that will be defined
𝜃

later.
The two approaches we will explore are:
Estimate Generalization error: cross-validation, leave-one-out, or k-fold. This defines a “good model"' as one
that predicts best data that we have not seen before, i.e., generalization. This approach corresponds to the
classical tension between having a model that fits the data well, and at the same time, generalizes to
unobserved data.
Here we assume we have a probabilistic
Maximize the log marginal likelihood of the data, 𝑝 (𝑦|𝑋, 𝜃) to 𝜃 .
model, where we compute how likely the data is that we have seen, under the chosen model. Alternatively, in
short, how well the model fits the data as measured by a normalized probability. This approach balances fitting
power and the simplicity of the model.

Discussion Hide Discussion

Topic: Module 5 Environmental Data and Gaussian Processes:Sensing and Analyzing
global patterns of dependence / 2. Model Selection
Add a Post

Show all posts by recent activity

There are no posts in this topic yet.

Previous Next

https://learning.edx.org/course/course-v1:MITx+6.419x+1T2021/block-v1:MITx+6.419x+1T2021+type@sequential+block@gp_lec3/block-v1:MITx+6.419x+1T2021+type@vertical+block@gp_lec3-tab2 2/3
5/16/2021 Sensing and Analyzing global patterns of dependence | Module 5: Environmental Data and Gaussian Processes | Data Analysis: Statistical Modeling and Computation in Applications | edX

edX
About
Affiliates
edX for Business
Open edX
Careers
News
Legal
Terms of Service & Honor Code
Privacy Policy
Accessibility Policy
Trademark Policy
Sitemap
Connect
Blog
Contact Us
Help Center
Media Kit
Donate

深圳市恒宇博科技有限公司粤ICP备17044299号 2

(Graduate Studies in Mathematics 199) Weinan E - Tiejun Li - Eric Vanden-Eijnden - Applied Stochastic Analysis-American Mathematical Society (2019)
No ratings yet
(Graduate Studies in Mathematics 199) Weinan E - Tiejun Li - Eric Vanden-Eijnden - Applied Stochastic Analysis-American Mathematical Society (2019)
330 pages
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
No ratings yet
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
288 pages
Monte Carlo Simulations With Implementations in MATLAB
No ratings yet
Monte Carlo Simulations With Implementations in MATLAB
10 pages
Z-Chart & Loss Function Tables
No ratings yet
Z-Chart & Loss Function Tables
1 page
5 2-2 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-2 Spatial Environmental Data Gaussian Processes
3 pages
5 2-5 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-5 Spatial Environmental Data Gaussian Processes
5 pages
5 2-3 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-3 Spatial Environmental Data Gaussian Processes
5 pages
5 2-6 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-6 Spatial Environmental Data Gaussian Processes
4 pages
5 3-3 Spatial Environmental Data Model Selection Long-Range Dependencies
No ratings yet
5 3-3 Spatial Environmental Data Model Selection Long-Range Dependencies
4 pages
Previewpdf
No ratings yet
Previewpdf
46 pages
The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou
No ratings yet
The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou
49 pages
Notes MSM
No ratings yet
Notes MSM
66 pages
GMM Methodandapplication
No ratings yet
GMM Methodandapplication
28 pages
The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou Instant Download
100% (3)
The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou Instant Download
71 pages
LectureNotes22 WI4455
No ratings yet
LectureNotes22 WI4455
154 pages
5 2-4 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-4 Spatial Environmental Data Gaussian Processes
3 pages
Foundations of Statistical Inference
No ratings yet
Foundations of Statistical Inference
89 pages
1
No ratings yet
1
130 pages
Computational Stadistic With Matlab
No ratings yet
Computational Stadistic With Matlab
11 pages
Distributional Regression Klein
No ratings yet
Distributional Regression Klein
26 pages
Bayesian Applications in Environmental and Ecological Studies With R and Stan, 1st Edition High-Quality Download
100% (13)
Bayesian Applications in Environmental and Ecological Studies With R and Stan, 1st Edition High-Quality Download
16 pages
Diggle Slides
No ratings yet
Diggle Slides
140 pages
Introduction To Statistical Modeling With SAS/STAT Software
No ratings yet
Introduction To Statistical Modeling With SAS/STAT Software
60 pages
Modeling and Analysis of Stochastic Systems 3rd Edition Vidyadhar G. Kulkarni - Read The Ebook Now With The Complete Version and No Limits
100% (1)
Modeling and Analysis of Stochastic Systems 3rd Edition Vidyadhar G. Kulkarni - Read The Ebook Now With The Complete Version and No Limits
87 pages
Statistics For Applied Science 200l
No ratings yet
Statistics For Applied Science 200l
122 pages
Essentials of Bayesian Inference 1706204646
No ratings yet
Essentials of Bayesian Inference 1706204646
21 pages
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
No ratings yet
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
20 pages
5 1-6 Spatial Environmental Data Intro Local Correlations
No ratings yet
5 1-6 Spatial Environmental Data Intro Local Correlations
4 pages
Lecture Notes
No ratings yet
Lecture Notes
138 pages
Bayesian Disease Mapping Hierarchical Modeling in Spatial Epidemiology, Third Edition 3rd Edition Full Text
100% (14)
Bayesian Disease Mapping Hierarchical Modeling in Spatial Epidemiology, Third Edition 3rd Edition Full Text
14 pages
MAT 211 Introduction To Business Statistics I Lecture Notes
No ratings yet
MAT 211 Introduction To Business Statistics I Lecture Notes
69 pages
Chapter 1 - Part1
No ratings yet
Chapter 1 - Part1
56 pages
Multivariate Statistical Modelling Based On Generalized Linear Models 2nd Edition ISBN 0387951873, 9780387951874 PDF
No ratings yet
Multivariate Statistical Modelling Based On Generalized Linear Models 2nd Edition ISBN 0387951873, 9780387951874 PDF
17 pages
An Introduction To Generalized Linear Models Annette J. Dobson Download
100% (2)
An Introduction To Generalized Linear Models Annette J. Dobson Download
59 pages
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
No ratings yet
2017-AdaCluster Adaptive Clustering For Heterogeneous Data
34 pages
Glmext4 Preview
No ratings yet
Glmext4 Preview
27 pages
Statistical Methods For Data Science
100% (2)
Statistical Methods For Data Science
406 pages
Course Notes
No ratings yet
Course Notes
141 pages
Stats
100% (1)
Stats
1,561 pages
Stat 509 Notes
100% (1)
Stat 509 Notes
195 pages
Zhang Haoze 202112 MSC
No ratings yet
Zhang Haoze 202112 MSC
114 pages
Statistical Methods For Spatial Data Analysis 07f414bf098301cd
No ratings yet
Statistical Methods For Spatial Data Analysis 07f414bf098301cd
507 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
Notests PDF
No ratings yet
Notests PDF
153 pages
Lecture Notes
No ratings yet
Lecture Notes
80 pages
The Elements of Statistical Learning Dat
No ratings yet
The Elements of Statistical Learning Dat
9 pages
ST Flour Notes
No ratings yet
ST Flour Notes
104 pages
EVT - Inductrial 6 12
No ratings yet
EVT - Inductrial 6 12
7 pages
Advances in Statistical
No ratings yet
Advances in Statistical
328 pages
(Huebner International Series On Risk, Insurance and Economic Security) Klugman, S.a.-Bayesian Statistics in Actuarial Science - With Emphasis On Credibility. 15-Kluwer Academic Publishers (1992)
No ratings yet
(Huebner International Series On Risk, Insurance and Economic Security) Klugman, S.a.-Bayesian Statistics in Actuarial Science - With Emphasis On Credibility. 15-Kluwer Academic Publishers (1992)
125 pages
OSU Adjustment Notes Part 1
No ratings yet
OSU Adjustment Notes Part 1
230 pages
DMbook TOC1
No ratings yet
DMbook TOC1
8 pages
S 15 Notes
No ratings yet
S 15 Notes
216 pages
Bayesian Modeling and Analysis of Geostatistical Data, Alan E. Gelfand, Sudipto Banerjee - 2017
No ratings yet
Bayesian Modeling and Analysis of Geostatistical Data, Alan E. Gelfand, Sudipto Banerjee - 2017
22 pages
Linear Models and The Relevant Distributions and Matrix Algebra
No ratings yet
Linear Models and The Relevant Distributions and Matrix Algebra
539 pages
SAS 2130 Statistics 2021
No ratings yet
SAS 2130 Statistics 2021
212 pages
178 HW 6
No ratings yet
178 HW 6
125 pages
Statistical Toolbox For Use With MATLAB
No ratings yet
Statistical Toolbox For Use With MATLAB
420 pages
STAT 714 Linear Statistical Models: Lecture Notes
No ratings yet
STAT 714 Linear Statistical Models: Lecture Notes
150 pages
Linear Regression Affect
No ratings yet
Linear Regression Affect
7 pages
Introduction To Bayesian Methods in Ecology and Natural Resources Exclusive Download
100% (12)
Introduction To Bayesian Methods in Ecology and Natural Resources Exclusive Download
15 pages
Adv Stat Inf
No ratings yet
Adv Stat Inf
194 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Programming With Python and GUI Development... 2024
No ratings yet
Programming With Python and GUI Development... 2024
145 pages
Notebook - Main Code
No ratings yet
Notebook - Main Code
4 pages
RAGE Against The Machine - Retrieval-Augmented LLM Explanations
No ratings yet
RAGE Against The Machine - Retrieval-Augmented LLM Explanations
4 pages
Building A Tanh Activation Function
No ratings yet
Building A Tanh Activation Function
9 pages
Notebook - Deep Neural Networks
No ratings yet
Notebook - Deep Neural Networks
28 pages
Time Series Analysis 1718649022
No ratings yet
Time Series Analysis 1718649022
5 pages
Stock Market Dashboard in Python
No ratings yet
Stock Market Dashboard in Python
4 pages
New System To Harness 40% of The Sun's Heat To Produce Clean Hydrogen Fuel
No ratings yet
New System To Harness 40% of The Sun's Heat To Produce Clean Hydrogen Fuel
6 pages
Notebook - Agave Plant Maturation Model Inference and Testing
No ratings yet
Notebook - Agave Plant Maturation Model Inference and Testing
7 pages
Notebook - Text Classification
No ratings yet
Notebook - Text Classification
7 pages
Notebook - Music Recommendation System Reference
No ratings yet
Notebook - Music Recommendation System Reference
22 pages
MLS 1 - Presentation
No ratings yet
MLS 1 - Presentation
11 pages
1 3 Multiple Hypothesis Testing
No ratings yet
1 3 Multiple Hypothesis Testing
14 pages
Notebook - Geospatial
No ratings yet
Notebook - Geospatial
11 pages
MLS 1 - Regression
No ratings yet
MLS 1 - Regression
20 pages
Data Pipeline in ML
No ratings yet
Data Pipeline in ML
3 pages
Boston Dataset
No ratings yet
Boston Dataset
6 pages
Glossary of Notations - Recommender Systems Part 3
No ratings yet
Glossary of Notations - Recommender Systems Part 3
4 pages
ML LVC 3 Post-Session Summary
No ratings yet
ML LVC 3 Post-Session Summary
16 pages
ML LVC 3 Glossary
No ratings yet
ML LVC 3 Glossary
1 page
The CNN Architecture
No ratings yet
The CNN Architecture
15 pages
ML LVC 2 Post-Session Summary
No ratings yet
ML LVC 2 Post-Session Summary
12 pages
Data Screening Assumptions
No ratings yet
Data Screening Assumptions
29 pages
Supply Chain Management Assignment #2: Case: Sport Obermeyer
No ratings yet
Supply Chain Management Assignment #2: Case: Sport Obermeyer
4 pages
Student's T-Distribution
No ratings yet
Student's T-Distribution
18 pages
VaR Model Building Approach
No ratings yet
VaR Model Building Approach
67 pages
Statistical Modeling and Computation
No ratings yet
Statistical Modeling and Computation
6 pages
Gaussian Sequence Model
No ratings yet
Gaussian Sequence Model
470 pages
Central Limit Theorem
100% (1)
Central Limit Theorem
41 pages
STAT 3360 Homework Chapter 9
No ratings yet
STAT 3360 Homework Chapter 9
1 page
Get Statistics and Experimental Design For Toxicologists Shayne C. Gad Free All Chapters
100% (5)
Get Statistics and Experimental Design For Toxicologists Shayne C. Gad Free All Chapters
55 pages
Normal, Binomial, Poisson Distributions
No ratings yet
Normal, Binomial, Poisson Distributions
12 pages
Utkal University BA Hons. Psychology Syllabus
100% (1)
Utkal University BA Hons. Psychology Syllabus
30 pages
Machine Learning Handbook - Radivojac and White
No ratings yet
Machine Learning Handbook - Radivojac and White
108 pages
SHS - Statistics
No ratings yet
SHS - Statistics
7 pages
Applied Bayesian Econometrics For Central Bankers Updated 2017 PDF
No ratings yet
Applied Bayesian Econometrics For Central Bankers Updated 2017 PDF
222 pages
SDG 13
No ratings yet
SDG 13
13 pages
Syllabus BCA NEP2023-2024
No ratings yet
Syllabus BCA NEP2023-2024
18 pages
Sampling Distributions & Point Estimation
No ratings yet
Sampling Distributions & Point Estimation
13 pages
Chapter 4
No ratings yet
Chapter 4
11 pages
Detail Interp For Econometrics
No ratings yet
Detail Interp For Econometrics
8 pages
(Davami Etal 2025) Density Measurements Via Background-Oriented Schlieren and Parallel-Ray Omnidirectional Integration
No ratings yet
(Davami Etal 2025) Density Measurements Via Background-Oriented Schlieren and Parallel-Ray Omnidirectional Integration
21 pages
Fire Danger Index Efficiency As A Function of Fuel Moisture and Fire Behavior
No ratings yet
Fire Danger Index Efficiency As A Function of Fuel Moisture and Fire Behavior
7 pages
Math 3Eng-T Engineering Data Analysis Problem Set #4 Due Date: Sep 24, 2020 (12 NN) Justin M. Marquez BET-Automotive General Instructions
No ratings yet
Math 3Eng-T Engineering Data Analysis Problem Set #4 Due Date: Sep 24, 2020 (12 NN) Justin M. Marquez BET-Automotive General Instructions
5 pages
(Ebook PDF) Statistics For Managers Using Microsoft Excel 7th Editionpdf Download
No ratings yet
(Ebook PDF) Statistics For Managers Using Microsoft Excel 7th Editionpdf Download
54 pages
Rr311801 Probability and Statistics
No ratings yet
Rr311801 Probability and Statistics
9 pages
B.E. Control Systems Syllabus (PSG Tech)
No ratings yet
B.E. Control Systems Syllabus (PSG Tech)
48 pages
GATE 2022 General Aptitude (GA)
No ratings yet
GATE 2022 General Aptitude (GA)
42 pages
SND Au
No ratings yet
SND Au
5 pages