Theory of Integration in Statistics: Courtesy: An Unknown Writer
Theory of Integration in Statistics: Courtesy: An Unknown Writer
Mohammed Nasser
Department of Statistics
Courtesy:
An unknown writer
Probability measures
Probability of an event
Lp space
Covariance, Correlation
Distributional Convergence
In Stochastic Process --
Pn P
f (w)dPn f (w)dP f C (),
Probability Theory
What is Integration?
f,a function
S
if ix
What is Integration?
f,a function
R2
R/R2
What is Integration?
f,a function
Rn
What is Integration?
f,a function
, any set
Integration =Integrate+ion
How long will the patient survive?
What percent of people earn more than 50,000 Tk
permonth?
What is Integration?
f,a function
B,
B-space
Bochner Integral
...
1 at irrationals
f
0 at rationals
Height
If we can measure the size of superlevel sets, we can integrate a lot of function!
-Algebras??
Measure theory is about measuring the size of things.
In math we measure the size of sets.
-Algebras??
Size should have the following property:
The size of the union of disjoint sets should equal the
sum of the sizes of the individual sets.
Makes sense.... but there is a problem with this...
Consider the interval [0,1]. It is the disjoint union of all
real numbers between 0 and 1. Therefore, according to
above, the size of [0,1] should be the sum of the sizes of a
single real number.
If the size of a singleton is 0 then the size of [0,1] is 0.
If the size of a singleton is non-zero, then the size of [0,1] is
infinity!
This shows we have to be a bit more careful.
1) F
C
A
A
F
2)
3) A1 , A2 F A1 A2 F
4) A1 , A2 , A3 ,... F
A F
i
i 1
An algebra of sets is
closed under finite set
operations. A -algebra
is closed under
countable set operations.
In mathematics, often
refers to countable.
In applications 2 is a metric space and F2 ,, Borel -algebra , the algebra generated by open sets.
How can we test measurability of a f?
The theorem that helps: f-1((S))=(f-1(S))
If f 1(S) F1
That implies f is F1 -F2 measurable.
1A1
1A2
A1
A2
A1 A2
11A1 21A2
A1 A2
Superlevel Sets
A1
A2
A1 A2
11A1 21A2
A1 A2
Superlevel Sets
A1
A2
Simple Functions
Simple functions are finite linear combinations of of
indicator functions. n
1
i 1
i Ai
Measurable Functions
Given a Measurable Space (,F)
We say that a function:
respect to F if:
f :
y f ( )
is measurable with
f ({ y }) F
1
[0,1)
F { , [0,1), [0, 12 ), [ 12 ,1)}
What do measurable functions look like?
Yes
Yes
1
2
1
2
No
1
2
Intuitively speaking, Measurable functions are constant on the sets in the -alg.
More accurately, they are limits of functions that are constant on the sets in the
-algebra. The in -algebra gives us this.
1
2
Definition of a measure:
Given a Measurable Space (,F),
A measure is a function
satisfying two properties:
:F
1) ( ) 0
2) If A1 , A2 , A3 ,... are disjoint
i 1
i 1
(Ai ) ( Ai )
(2) is known as countable additivity. However, it can
be interpreted as linearity and left continuity for sets.
Different Measures
Infinite different measures are available on a sample space
Counting measure: Let (A)= n if A contains n number
of elements, otherwise.
Discrete measure: Let ={x1, x2, -- - ,xn,-,- } .
p
i 1
xi
1, px 0, i
(A)
xi A
xi
Different Measures
Lebesgue measure ) : There is a unique measure m on
( R ,B ) that satisfies
m ( [a , b] ) = b - a
for every finite interval [a , b] , . This is called the Lebesgue
measure . If we restrict m to measurable space ( [0 , 1] ,
B) , then m is a probability measure
Lebesgue Stieltjes measure ) :Let F be a nondecreasing and right-continuous function from from R to
R.There is a unique measure m on ( R ,B ) that satisfies
m ( [a , b] ) = F(b) F(a)
for every finite interval [a , b] . This is called the LebesgueStieltjes measure . If we take bounded F, then m is a
Measure-Probability Measure-Cumulative
Distribution Function
Bounded measure
Probability
measure
Cumulative
Distribution
Function
F(x)=Pr{X<=x}
Point
Function
Set
Function
No easy
connection
with
measure
Continuity of Functions
For all sn
f(s )
f(s)
f: <S1,d1>
<S2,d2>
is continuous at s in S1.
Ai A then lim ( Ai ) ( A)
Ai A
( Ai )
then lim ( Ai ) ( A)
lim ( Ai ) ( A)
then
Ai A
Let
Continuity above-property
of Measure
f(A)
<S,B>
f-1(B)
1
i 1
n
i Ai
d ( A )
i 1
Positive Functions:
Since positive measurable functions can be written as the
increasing pointwise limit of simple functions, we define the
integral as
d
fd sup
f
For a general measurable function write
fd f d f d
fd
A
f
This is a set! and the
integral is measuring
the size of this set!
Measures are:
Left Continuous
Ai A lim ( Ai ) ( A)
Integrals are:
Left Continuous
f i f lim f i d fd
(Monotone Convergence Thm.)
Ai A and ( Ai )
lim ( Ai ) ( A)
f i f and f i d
lim f i d fd
(Bounded Convergence Thm.)
(Fatous Lemma, etc...)
etc...
etc...
The Lp Spaces
p
|
f
|
d
A function is in L (,F,) if
p
| f | d
p
1/ p
f , g fgd
1 p
(1 x )
2
; x
xf (x)dx
Now,
x
0 1 x2 dx
x
dx
2
1 x
1
1
x
dx x
dx
2
2
1 x
1 x
1
1
_
x
dx
x
dx
0 1 x2
1 x2
x
1 1 x2 dx
x
dx
2
1 x
[1, )
1 x2 z
2 xdx dz
n,n 1
1
log z
1x
2 n1
1
1 (n 1)2
log
2 n1
1 n2
put
1(n 1)2
2
n+1
1+(n+1)2
1+n2
1 (n 1) 2
1 (n 1) 2
log
log
2
1 n
1 2n 2
1 (n 1) 2 1
lim log
2
n
1 n
2
dx
2
does
1 x
0
E( X )
not
exist
does not exist
x
lim
dx 0;
2
a
a 1 x
a
x
But lim
dx
2
a
2a 1 x
1a
1
lim log z
a
1 4a
2
1
1 a2
lim log
a
2
1 4a2
1
2
E (X )
x
Since 2
1 x
function
does not
exist.
is an odd
x
2
2
2
x
x
1 x
lim
1 0
2
2
x
1
1 x
1 x
x
x
dx
2
1
x
0
E (X )
Measure-Probability Measure-Cumulative
Distribution Function
Bounded measure
Probability
measure
Cumulative
Distribution
Function
Discrete P(A)=1,
#(A)=finite or
Probability
Measures
Continuous
P{x}=0 for
all x
Absolutely
Continous w.r.t.
Lebesgue measure
Non A.C.
On Euclidean sample
space
Broad Categories of
Probability Measures
Expectation
A random variable is a measurable function.
X ( )
The expectation of a random variable is its integral:
E ( X ) XdP
A density function is the Radon-Nikodym derivative wrt
Lebesgue measure:
dP
fX
E ( X ) XdP xf X ( x)dx
dx
Expectation
Change of variables. Let f be measurable from (, F , v)
to (,) and g be Borel on (,). Then
(1)
i.e. , if either integral exists, then so does the other, and the
two are the same.
Note that integration domains are indicated on both
sides of 1. This result extends the change of variable
formula for Reimann integrals, i.e. ,
Expectation
In statistics we will talk about expectations with respect
to different measures.
E P ( X ) XdP
E Q ( X ) XdQ
dP
or dP dQ
where
dQ