0% found this document useful (0 votes)

6 views5 pages

Statistical Foundaments

Uploaded by

patatapocha18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views5 pages

Statistical Foundaments

Uploaded by

patatapocha18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Fundamentals of Statistical Learning

In this chapter we are going to present the statistical fundamentals to

understand the concepts of the following topics.

1 Expected Value
Let X = [x1 · · · xp ]T be a random p-dimensional vector.

Definition. The expected value of X is defined as

 
E(x1 )
E(X) = µ =  ...  .
 
E(xp )

Properties
i If a is a constant vector, then E(a) = a
   
E(a1 ) a1
 ..   .. 
Proof. E(a) =  .  =  .  = a.
E(ap ap

ii E(a + X) = a + E(X).

Proof.
       
E(a1 + x1 ) a1 + E(x1 ) a1 E(x1 )
E(a + X) =  .. ..   ..   .. 
=  =  .  +  . .
  
. .
E(ap + xp ) ap + E(xp ) ap E(xp )

iii If A is a matrix (or vector) such that AX exist, then

a) E(AX) = AE(X),
b) E(XA) = E(X)A.

Proof.
Pp   Pp   Pp 
a1j xj
j E( j a1j xj ) j a1j E(xj )
E(AX) = E  .. .. ..
= =  = AE(X).
     
Pp . Pp . Pp .
j apj xj E( j apj xj ) j apj E(xj )

Part b) is similarly demonstrated.

iv E(X + Y) = E(X) + E(Y).

1
Proof.
       
x1 + y 1 E(x1 ) + E(y1 ) E(x1 ) E(y1 )
E(X + Y) = E  ...  =  ..   ..   .. 
 =  . + .  = E(X)+E(Y).
  
.
xp + y p E(xp ) + E(yp ) E(xp ) E(yp )

v If X and Y are independent, then E(XY) = E(X)E(X)

Proof. We know that two random variables are independents if f (X, Y) =

f (X)f (Y) or equivalently f (X|Y) = f (X), then
ZZ
E(XY) = XYf (X, Y)dXdY,
ZZ
= XYf (X)f (Y)dXdY,
Z Z
= Xf (X)dX Yf (Y)dY = E(X)E(X).

2 Dispersion matrix
Definition. The variance-covariance matrix of X, or dispersion matrix,
denoted by Σ , is defined by

D(X) = Σ = E[(X − µ)(X − µ)T ].

Note that
 
(x1 − µ1 )2
· · · (x1 − µ1 )(xp − µp )
(X − µ)(X − µ)T =  .. ... ..
,
 
. .
2
(xp − µp )(x1 − µ1 ) · · · (xp − µp )

therefore  
V (x1 )· · · cov(x1 , xp )
Σ= .. ... ..
.
 
. .
cov(xp , x1 ) · · · V (xp )

Theorem. The variance-covariance matrix is positive-semidefinite.

Proof. A square symmetric real matrix A is positive-semidefinite if

yT Ay ≥ 0 ∀y ∈ Rp .

We’re going to prove that the covariance matrix fulfills it.

2
yT Σy = yT E[(X − µ)(X − µ)T ]y,
= E[yT (X − µ) (X − µ)T y],
| {z } | {z }
1×1 1×1

= E[((X − µ) y) (X − µ)T y],

T T
a = aT if a ∈ R
= E[((X − µ)T y)(X − µ)T y],
= E[((X − µ)T y)2 ] ≥ 0.

Properties
i If b ∈ Rp is a constant vector, then D(b) = Σ = 0p×p .

Proof.
   
(b1 − b1 )(b1 − b1 ) · · · (b1 − b1 )(bp − bp ) 0 ··· 0
.. .. .. . .
D(b) = E   = E  .. . . . ..  = 0p×p .
 
. . .
(bp − bp )(b1 − b1 ) · · · (bp − bp )(bp − bp ) 0 ··· 0

ii If b ∈ Rp is a constant vector, then D(b + X) = D(X).

Proof.

D(b + X),
 
(b1 + x1 − b1 − µ1 )2 · · · (b1 + x1 − b1 − µ1 )( bp + xp − bp − µp )
= E .. ... ..
,
 
. .
2
bp + xp −
( bp − µp )(b1 + x1 − b1 − µ1 ) · · · bp + xp −
( bp − µp )
= D(X).

iii If A is a constant matrix, or a vector, such that AX exits, then

D(AX) = AΣAT .

Proof.

D(AX) = E[(AX − Aµ)(AX − Aµ)T ],

= E[A(X − µ)(A(X − µ))T ],
= E[A(X − µ)(X − µ)T AT ],
= AE[(X − µ)(X − µ)T ]AT = AΣAT .

3
iv If X and Y are independent random variables, then

D(X + Y) = D(X) + D(X).

Proof. Before we prove this property, we shall prove that if X and Y are
independent then xi and yi are also independent for all (i, j) ∈ (1, · · · , p)2 .
That is to say, that the components of X and Y are also independent. Let
X and Y two independent random vectors, then

f (X, Y) = f (X)f (Y).

Where for simplicity of notation and to better understand we assume that

X = [x1 , x2 ]T and Y = [y1 , y2 ]T . Let’s prove that f (x1 |y1 ) = f (x1 )

f (x1 , y1 )
f (x1 |y1 ) = ,
f (y1 ) marginal
probability
ZZ
1
= f (x1 , x2 , y1 , y2 )dx2 dy2 ,
f (y1 )
ZZ independence
1
= f (x1 , x2 )f (y1 , y2 )dx2 dy2 ,
f (y1 )
Z Z
1
= f (x1 , x2 )dx2 f (y1 , y2 )dy2 ,
f (y1 )
f (x1 )f (y1 )
= = f (x1 ).
f (y1 )
Then cov(xi , yj ) = 0, ∀(i, j) ∈ (1, · · · , p)2 . Now we have
 
cov(x1 + y1 , x1 + y1 ) · · · cov(x1 + y1 , xp + yp )
D(X + Y) =  .. .. ..
.
 
. . .
cov(xp + yp , x1 + y1 ) · · · cov(xp + yp , xp + yp )

Then we only have to note that

cov(xi + yj , xj + yj ) = cov(xi , xj ) + cov(xi , yj ) + cov(yi , xj ) + cov(yi , yj ),

= cov(xi , xj ) + cov(yi , yj ).

Therefore D(X + Y) = D(X) + D(Y).

3 Covariance matrix
Let there be given two stochastic variables X = [x1 · · · xp ]T and Y = [y1 · · · yp ]T of
dimension p and q, respectively, and with mean values µ and ν.
Definition. The covariance matrix between X and Y is defined by
 
cov(x1 , y1 ) · · · cov(x1 , yq )
C(X, Y ) = E[(X − µ)(Y − ν)] =  .. ... ..
.
 
. .
cov(xp , y1 ) · · · cov(xp , yq )

4
Properties
i C(X, X) = D(X).

ii C(X, Y) = C(X, Y)T .

iii If An×p and Bm×q are real matrix, then C(AX, BY) = AC(X, Y)B T .

Proof.

C(AX, BY) = E[(AX − Aµ)(BY − Bν)T ],

= E[A(X − µ)(Y − ν)T B T ],
= AE[(X − µ)(Y − ν)T ]B T ,
= AC(X, Y)B T .

iv Let U and V two p and q-dimensional random vectors with mean γ and δ,
respectively. Then

a) C(U + X, Y) = C(U, Y) + C(X, Y),

b) C(U, V + Y) = C(U, V) + C(U, Y).

Proof.

C(U + X, Y) = E[(U + X − γ − µ)(Y − δ)T ],

= E[((U − γ) + (X − µ))(Y − δ)T ],
= E[(U − γ)(Y − δ)T + (X − µ))(Y − δ)T ],
= C(U, Y) + C(X, Y).

Part b) is similarly demonstrated

v D(X + U) = D(X) + D(U) + C(X, U) + C(U, X).

Proof.

D(X + U) = C(X + U, X + U).

= C(X, X) + C(X, U) + C(U, X) + C(U, U).
= D(X) + D(U) + C(X, U) + C(U, X).

Global Ultimate Strength Analysis SBA KNPG-B - Rev0
100% (1)
Global Ultimate Strength Analysis SBA KNPG-B - Rev0
104 pages
Random Vectors 1
No ratings yet
Random Vectors 1
8 pages
Covariance Matrix
No ratings yet
Covariance Matrix
14 pages
Key Notes: Chapter - 6 Lines and Angles
No ratings yet
Key Notes: Chapter - 6 Lines and Angles
2 pages
Ganit Prakash English Download Here PDF
33% (3)
Ganit Prakash English Download Here PDF
5 pages
Diehl 3214-31 CMM 33-51-17 Rev 0 7-8-02
No ratings yet
Diehl 3214-31 CMM 33-51-17 Rev 0 7-8-02
18 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Fourier Transform
No ratings yet
Fourier Transform
19 pages
Gravity and Motion
No ratings yet
Gravity and Motion
29 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Aprendizaje Estadistico Final
No ratings yet
Aprendizaje Estadistico Final
71 pages
Class 10 Math Sample Paper 1
100% (1)
Class 10 Math Sample Paper 1
8 pages
Random Vectors
No ratings yet
Random Vectors
44 pages
Random Vectors
No ratings yet
Random Vectors
33 pages
STA302 Week08 Full
No ratings yet
STA302 Week08 Full
50 pages
4gaussian Discriminant
No ratings yet
4gaussian Discriminant
50 pages
Matrix Introduction
No ratings yet
Matrix Introduction
30 pages
A NGUNYI NOTES SMA 2332 Probability and Statistics IV
No ratings yet
A NGUNYI NOTES SMA 2332 Probability and Statistics IV
64 pages
Covalent Bonding: General Chemistry 1 1 SEMESTER, AY: 2017-2018
No ratings yet
Covalent Bonding: General Chemistry 1 1 SEMESTER, AY: 2017-2018
50 pages
HKNECE313 Cramming Carnival FA24
No ratings yet
HKNECE313 Cramming Carnival FA24
45 pages
Mathematical Expectation or Expected Value
No ratings yet
Mathematical Expectation or Expected Value
52 pages
Sst304 Lesson 1
No ratings yet
Sst304 Lesson 1
8 pages
Expectation
No ratings yet
Expectation
19 pages
Week 2 DrBuddhananda Banerjee Vector RV
No ratings yet
Week 2 DrBuddhananda Banerjee Vector RV
10 pages
Linear Algebra
No ratings yet
Linear Algebra
11 pages
Handout 2 Multivariate
No ratings yet
Handout 2 Multivariate
10 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Lesson 5
No ratings yet
Lesson 5
53 pages
7 Expectation
No ratings yet
7 Expectation
20 pages
Stat
No ratings yet
Stat
53 pages
2021 - Week - 3 - Ch.2 Random Process
No ratings yet
2021 - Week - 3 - Ch.2 Random Process
11 pages
Simple Algebra
No ratings yet
Simple Algebra
5 pages
6 Ports Antenna Datasheet PDF
No ratings yet
6 Ports Antenna Datasheet PDF
2 pages
L16 Moments, Covariance and Correlation Coefficient
No ratings yet
L16 Moments, Covariance and Correlation Coefficient
5 pages
Covariance
No ratings yet
Covariance
5 pages
Probability Practice Problems With Solutions 7
No ratings yet
Probability Practice Problems With Solutions 7
4 pages
Stats-Basic Proves
No ratings yet
Stats-Basic Proves
7 pages
2 Probability and Linear Algebra
No ratings yet
2 Probability and Linear Algebra
21 pages
Lec 4
No ratings yet
Lec 4
17 pages
Mathrecap Sol
No ratings yet
Mathrecap Sol
4 pages
Conditional Expectation
No ratings yet
Conditional Expectation
33 pages
W2e Multivariate Gaussian
No ratings yet
W2e Multivariate Gaussian
6 pages
Deep-Learning
No ratings yet
Deep-Learning
28 pages
Astm D854 00
No ratings yet
Astm D854 00
3 pages
Prob RV Opt Basics
No ratings yet
Prob RV Opt Basics
35 pages
Sum of Variances
No ratings yet
Sum of Variances
11 pages
Linear Model
No ratings yet
Linear Model
11 pages
CHB Connection Details: 1.0 General Notes
No ratings yet
CHB Connection Details: 1.0 General Notes
1 page
Topic 4 - Sequences of Random Variables
No ratings yet
Topic 4 - Sequences of Random Variables
32 pages
Ma702 - 12
No ratings yet
Ma702 - 12
2 pages
Chapter 4. Gauss-Markov Model
No ratings yet
Chapter 4. Gauss-Markov Model
20 pages
Inductance Capacitance and Mutual Inductance
No ratings yet
Inductance Capacitance and Mutual Inductance
47 pages
Stat 1
No ratings yet
Stat 1
6 pages
Random Vectors
No ratings yet
Random Vectors
9 pages
Gaussian Random Vectors
No ratings yet
Gaussian Random Vectors
6 pages
Multivariate Analysis (Slides 2)
No ratings yet
Multivariate Analysis (Slides 2)
25 pages
Random Vectors:: A Random Vector Is A Column Vector Whose Elements Are Random Variables
No ratings yet
Random Vectors:: A Random Vector Is A Column Vector Whose Elements Are Random Variables
7 pages
Week2 3 MatrixApproach Part2
No ratings yet
Week2 3 MatrixApproach Part2
29 pages
Expectation: Definition Expected Value of A Random Variable X Is Defined
No ratings yet
Expectation: Definition Expected Value of A Random Variable X Is Defined
15 pages
Mathematical Expectation: Examples
No ratings yet
Mathematical Expectation: Examples
12 pages
4-3 Gaussian Random Vector
No ratings yet
4-3 Gaussian Random Vector
20 pages
Pca
No ratings yet
Pca
73 pages
Briefing of Joint Probability
No ratings yet
Briefing of Joint Probability
14 pages
Estimation of Mean Vector and Variance Covariance Matrix PDF
No ratings yet
Estimation of Mean Vector and Variance Covariance Matrix PDF
7 pages
Multiple Regression Model - Matrix Form
No ratings yet
Multiple Regression Model - Matrix Form
22 pages
MCE Cambridge Primary Science 2E Stage6 SOW and LP C07
No ratings yet
MCE Cambridge Primary Science 2E Stage6 SOW and LP C07
14 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
17 Notes MFML Probreview
No ratings yet
17 Notes MFML Probreview
19 pages
Exercises Session1 PDF
No ratings yet
Exercises Session1 PDF
4 pages
MVA Section1 2012
No ratings yet
MVA Section1 2012
14 pages
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet
Econ 251 PS1 Solutions
No ratings yet
Econ 251 PS1 Solutions
5 pages
Ta 2
No ratings yet
Ta 2
7 pages
Generate Two Correlated Noise
No ratings yet
Generate Two Correlated Noise
6 pages
Comac - BMG Sanitizer PDF
No ratings yet
Comac - BMG Sanitizer PDF
28 pages
Clean Room Behaviours
No ratings yet
Clean Room Behaviours
6 pages
Lecture 5 2023
No ratings yet
Lecture 5 2023
44 pages
Tongyu TTB-809016-182017-182017DE-65P - Especificacao PDF
No ratings yet
Tongyu TTB-809016-182017-182017DE-65P - Especificacao PDF
1 page
Born-Haber Cycles: Learning Objectives
No ratings yet
Born-Haber Cycles: Learning Objectives
2 pages
MA 201: Partial Differential Equations Lecture - 3
No ratings yet
MA 201: Partial Differential Equations Lecture - 3
18 pages
A PERFORMANCE STUDY OF A NEW PERSONAL NEUTRON DOSIMETRY SYSTEM AT SNRC (Radiation Protection Dosimetry) (2020)
No ratings yet
A PERFORMANCE STUDY OF A NEW PERSONAL NEUTRON DOSIMETRY SYSTEM AT SNRC (Radiation Protection Dosimetry) (2020)
11 pages
Feispartic F520 TDS
No ratings yet
Feispartic F520 TDS
1 page
5 Oscillation
No ratings yet
5 Oscillation
14 pages
A N M Iso Contra Expansive Operators and Some Applications On Bergman Spaces
No ratings yet
A N M Iso Contra Expansive Operators and Some Applications On Bergman Spaces
19 pages
Physics Project Final
No ratings yet
Physics Project Final
24 pages
97730228
No ratings yet
97730228
71 pages
Lecture 3: Gauss's Law and Electric Potential
No ratings yet
Lecture 3: Gauss's Law and Electric Potential
3 pages
Devils Staircase
No ratings yet
Devils Staircase
3 pages
CT 7 - Class 12th - 05 09 2024 - Q
No ratings yet
CT 7 - Class 12th - 05 09 2024 - Q
7 pages
Emm - Unit - I
No ratings yet
Emm - Unit - I
95 pages

Statistical Foundaments

Uploaded by

Statistical Foundaments

Uploaded by

Fundamentals of Statistical Learning

In this chapter we are going to present the statistical fundamentals to

Definition. The expected value of X is defined as

iii If A is a matrix (or vector) such that AX exist, then

Part b) is similarly demonstrated.

iv E(X + Y) = E(X) + E(Y).

v If X and Y are independent, then E(XY) = E(X)E(X)

Proof. We know that two random variables are independents if f (X, Y) =

D(X) = Σ = E[(X − µ)(X − µ)T ].

Theorem. The variance-covariance matrix is positive-semidefinite.

Proof. A square symmetric real matrix A is positive-semidefinite if

We’re going to prove that the covariance matrix fulfills it.

= E[((X − µ) y) (X − µ)T y],

ii If b ∈ Rp is a constant vector, then D(b + X) = D(X).

iii If A is a constant matrix, or a vector, such that AX exits, then

D(AX) = E[(AX − Aµ)(AX − Aµ)T ],

D(X + Y) = D(X) + D(X).

f (X, Y) = f (X)f (Y).

Where for simplicity of notation and to better understand we assume that

Then we only have to note that

cov(xi + yj , xj + yj ) = cov(xi , xj ) + cov(xi , yj ) + cov(yi , xj ) + cov(yi , yj ),

Therefore D(X + Y) = D(X) + D(Y).

ii C(X, Y) = C(X, Y)T .

C(AX, BY) = E[(AX − Aµ)(BY − Bν)T ],

a) C(U + X, Y) = C(U, Y) + C(X, Y),

C(U + X, Y) = E[(U + X − γ − µ)(Y − δ)T ],

Part b) is similarly demonstrated

v D(X + U) = D(X) + D(U) + C(X, U) + C(U, X).

D(X + U) = C(X + U, X + U).

You might also like