Damiano Rossello: DEB University of Catania
Damiano Rossello: DEB University of Catania
Lecture 5
Damiano Rossello
DEB
University of Catania
Outline
Example
Let (Ω, F ) be an event space and take a fixed event A ∈ F . Define a Bernoulli rv by
X (ω) = 1 if ω ∈ A and X (ω) = 0 otherwise. The probability distribution PX of such a
rv is given by two masses p := P(X = 1) and 1 − p := P(X = 0), where 0 < p < 1.
Now, you have the following gamble: win € 1000 if the event {X = 1} occurs,
otherwise loose € 900 if event {X = 0} occurs. This may be modelled by a new rv
Y (ω) which takes on this two monetary values with probabilities p and 1 − p,
respectively. Note that X and Y share the same distribution.
To forecast the win/loss you may compute X := p · 1000 + (−900) · (1 − p). E.g.,
assuming p = 21 you expect to win X = 50 euro. A physical interpretation of X is that
it represents the centre of gravity of the numbers −900 and 1000 in R, on which you
put two probability weights, i.e. PX (B) = pδ1000 (B) + (1 − p)δ−900 (B) for every Borel
set B ⊂ R and δx (B) being the Dirac-delta measure. Observe that X is a number
lying between −900 and 1000, ∀p ∈ (0, 1).
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Figure: The Bernoulli distribution PX is discrete with total mass 1 and X is its
centre of gravity, with p = 0.65
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Remark
The number X in the previous Example is based on the concept of a convex
combination of two real numbers. Let x, y ∈ R be such that x < y and define:
xλ := λ x + (1 − λ) y , for every 0 6 λ 6 1,
where xλ ∈ [x, y ]. The weights λ and (1 − λ) may be thought of as probability masses,
since their sum is 1. Now, choose P the numbers x1 , . . . P , xn ∈ [x, y ] and n positive
weights λ1 , . . . , λn such that ni=1 λi = 1. Then S = ni=1 λi xi ∈ [x,Py ], where S is
the average of the xi ’s and satisfies P x 6 S 6 y . Indeed, assume that ni=2 λi xi ∈ [x, y ]
for n − 1 numbers. From condition ni=1 λi = 1 deduce 1 − λ1 > λi . This implies
λi λi
< 1, each i = 2, 3, . . . , n. We can find z = ni=2 1−λ
P
0 < 1−λ xi within [x, y ], so
1 Pn 1
the convex combination i=1 λi xi = λ1 x1 + (1 − λ1 )z belongs to [x, y ]. Note that S
lies between min{x1 , . . . , xn } and max{x1 , . . . , xn }.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Indicator Functions
0 1
IA 1
A c
0 1
A IA 1
A
Ac
IA 1
0 1
IA 1
Borel set B 0 1
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Some Properties
Exercise
Show Properties 1 to 6 for indicator functions. Observe that Properties 1
to 6 are considered pointwise, i.e. they are verified comparing IA (ω) and
IB (ω) for each ω ∈ Ω. Show that 1 becomes IA∪B = IA + IB if A and B
are disjoint. Show that 3 is simply IA − IB whenever B ⊂ A. Property 6
refers to symmetric difference between sets.
Pn
We wish X = i=1 xi · IAi being a rv. The affirmative answer comes
from a series of properties concerning measurability of random functions
and composition of maps. We start with the following preliminary result:
Let (Ω, F) be an event space and let (R, B) be the real measurable
space. Define G collecting Borel sets B ⊂ R that generate B, i.e.
B = σ(G). A map X : Ω → R is a rv (a measurable map) iff
X −1 (B) ∈ F for every B ∈ G.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Proof.
(⇒) Trivial. (⇐) Let us assume X −1 (B) ∈ F for every B ∈ G. To show
that this implies measurability on the whole Borel σ-algebra B, we will
use a squeezing argument. By assumption G ⊂ B, so it contains some of
the Borel sets B ∈ B whose inverse images are in F. The collection
E = {B : X −1 (B) ∈ F} is of course greater, G ⊂ E, and is a σ-algebra
(see Lecture 3). Moreover, E ⊂ B and by hypotheses σ(G) = B. This
implies B = E and then every X −1 (B) belong to F for every B ∈ B,
which entails the required result.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Recall that (see Lecture 3) the collection of all half-rays (−∞, x],
together with ∅, generates the Borel σ-algebra B for every x ∈ R.
Now do:
Exercise
Prove the following Theorem:
Let (Ω, F ) be an event space and let X , Y be two rv’s defined here. Then:
1 X + Y is a rv
2 α · X is a rv, for any α ∈ R
X
3
Y
is a rv, for any ω ∈ Ω such that Y (ω) 6= 0
4 the maximum and minimum between rv’s, X ∨ Y := max{X , Y } and
X ∧ Y := min{X , Y }, is again a rv.
Hint: for 1 show that {X + Y 6 x} is an event in F for every x ∈ R, and similarly for
2,3. For the maximum and minimum of rv’s, show that for every x ∈ R the set
{X ∨ Y 6 x} equals {X 6 x} ∩ {Y 6 x} and the set {X ∧ Y 6 x} equals
{X 6 x} ∪ {Y 6 x}.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Simple Rv’s
Observe that any simple rv is a bounded function, i.e. we can find C > 0
such that |X (ω)| 6 C for every ω ∈ Ω. A simple rv might be defined
through different choices of the partition, but the standard one is always
Ai = {X = xi }.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
X I A 1 I A 2 I A3 I A4
A1 A1 A41
A4 A43
IA41 IA42 IA43 A42
A2 A2
A3 A3
X
1 2 3 4
The ith event is further decomposed into Ai1 ∪ · · · ∪ Aim . Thus, in the original
representation of X the ith term must be replaced xi IAi = xi I∪m Aij = m
P
j=1 j=1 xi IAij .
The Figure above illustrates this with i = 4 and m = 3. It would be possible to replace
all the events Ai with others Bj that form a different partition, but leaving the
coefficients xi unaffected.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Expectation: Step 1
Pn
Let X = i=1 xi IAi be a simple rv. The expectation of X is:
Exercise
Let S be the collection of all simple rv’s defined over (Ω, F, P). Show
that:
1 The constant unit function 1 := IΩ belongs to S; the constant zero
function 0 := I∅ is also in S.
2 The product c · X between a number c ∈ R and a simple rv X is an
element of S.
3 The sum X + Y of two simple rv’s is again a simple rv in S.
4 The product of any pair X , Y of simple rv’s is in S.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Show that:
1 0 + X = X , for simple rv’s 0, X ∈ S.
2 There exists a simple rv −X ∈ S such that X + (−X ) = 0.
3 1 · X = X , for simple rv’s 1, X ∈ S.
4 a · (b · X ) = (a · b) · X , for a simple rv X ∈ S and constants a, b ∈ R.
5 (a + b) · X = a · X + b · X , for a simple rv X ∈ S and constants
a, b ∈ R.
6 a · (X + Y ) = a · X + a · Y , for simple rv’s X , Y ∈ S and a constant
a ∈ R.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Example
Pn
Let X = i=1 xi I{X =xi } be a nonnegative simple rv, i.e. xi > 0, and let
c > 0. By linearity we have E(c X ) = c E(X ). Namely, expectation is
positive homogeneous.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Let A1 , A2 , . . . , An be events.
Qn
1 Prove I∪ni=1 Ai = 1 − i=1 (1 − IAi ).
2 Prove Inclusion-Exclusion Formula by applying Step 1 in the
definition of expectation to the identity in 1.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Expectation: Step 2
Indeed, Step 1 gives each E(Z ) with respect to nonnegative simple rv’s
Z ∈ S+ that approximate X from
below. The set
E(Z ) : 0 6 Z 6 X , Z simple is of the form [0, x) or [0, x], where
x = ∞ is allowed.
The supremum above always exists, since it is taken over a subset of
increasing real numbers. It may be infinite even if X is not.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Lemma
Suppose X > 0 is a nonnegative rv. Then, there exists an increasing
sequence X1 6 X2 6 X3 6 · · · of simple rv’s which are nonnegative and
converge pointwise to X .
Remark
We can define a measurable map Xn : Ω → R for every n ∈ N, then obtaining a
sequence (Xn )n∈N of rv’s. For each ω ∈ Ω the set
{X1 (ω), X2 (ω), . . .} = {x1 , x2 , . . .}
is an infinite sequence of real numbers. If limn→∞ xn = x, then the number x
corresponds to the given ω. It might happen that for any other ω the above limit
exists (but in general it is not equal to the same number x). Thus, we have a function
that to each ω attaches a number given by a limit. In this case the sequence (Xn )n∈N
is said to converge pointwise to the limit function X : Ω → R. Namely, ∀ ω ∈ Ω and
∀ > 0, there exists a positive integer N such that n > N implies |Xn (ω) − X (ω)| < .
It can be shown that the random function X is a rv too (see later).
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Example
Let (Ω, F) be an event space with Ω = [0, 1] and F = B ∩ [0, 1], the
trace σ-algebra of Borel sets in the unit interval. Define the sequence of
functions Xn (ω) = ω n for every n ∈ N. Each Xn is measurable (since it is
a continuous function, see later), thus it is a rv. We have limn Xn (ω) = 0
if ω ∈ [0, 1) and limn Xn (ω) = 1 if ω = 1. Thus the sequence (Xn )n∈N
just defined converges pointwise to the limit function
(
0 if ω ∈ [0, 1)
X (ω) =
1 if ω = 1.
The convergence is very rapid to X (ω) = 0 for ω near zero, but very slow for ω near 1.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Xn X
1 1
1 2
0 1 0 1
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Exercise
Assume X ∈ S+ . Show that E(X ) given by formula (4) equals that
of formula (2).
Let X , Y > 0. Show that the expectation of nonnegative rv’s is
linear and order preserving as for rv’s in S.
Let X > 0 and assume E(X ) < ∞. Show that P(X < ∞) = 1.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Exercise
a.s.
Let X be an a.s. degenerate rv, i.e. X = c. Show that E(X ) = c.
In general, if X = c then is distribution function its FX (x) = 1 for every x > c and
zero otherwise. We have:
FX (c−) = P(X < c) = lim P(X 6 c − h) = 0,
h↓0
Remark
We can write X + = X · I{X >0} and X − = −X · I{X 60} . The absolute
value |X | = X + + X − is the sum of two nonnegative rv’s. Then, by 1 of
Exercise 2 it is a rv. Clearly, X + 6 |X | and X − 6 |X |, for every outcome
ω ∈ Ω.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s
Remark
There is an equivalent condition for the existence of E(X ), obtained
through the decomposition |X | = X + + X − . Assume
E(|X |) = E(X + ) + E(X − ) is finite and take the expectation of both sides
in X + 6 |X | and in X − 6 |X |. It follows E(X + ) < ∞ and E(X − ) < ∞.
Thus, the finiteness condition in equation (5) is fulfilled. On the other
hand, if we assume that both E(X + ) and E(X − ) are finite, then by
summing and using linearity we get E(|X |) < ∞.
Exercise
Prove the important inequality:
(6) |E(X )| 6 E(|X |).
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Proof.
Assuming pointwise convergence Xn → X , we have X is a rv. Furthermore,
E(X1 ) 6 E(X2 ) 6 · · · 6 E(X ) by assumption. Hence, limn E(Xn ) 6 E(X ). It suffices to
show the converse inequality. The case E(X1 ) = ∞ is trivial, then we assume
E(X1 ) < ∞. By formula (4)P we can approximate E(X ) by the expectation of a
nonnegative simple rv Z = m i=1 zi IAi such that Z 6 X . The latter
P condition means
that zi 6 X (ω) for every ω ∈ Ai . Let us show that limn E(Xn ) > m i=1 zi P(Ai ). Pick
> 0, define Ain = {ω P ∈ Ai : Xn (ω) > zi − } and observe that Ain ↑ Ai as n → ∞.
Furthermore, E(Xn ) > m i=1 (zi − )P(Ain ). By continuity from below we have
Remark
In the proof of Theorem 2 we use the following fact: Let a, b ∈ R such
that a 6 b + , and > 0, then a 6 b. The proof runs by contradiction:
assume a 6 b + and ¬(a 6 b) = a > b. But choosing = a−b 2 > 0 we
arrive to a > b + , which contradicts the assumption.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Let X , Y > 0. Give an alternative proof that the expectation is linear,
using the Monotone Convergence Theorem.
Hint
Find sequences (Xn )n∈N and (Yn )n∈N of simple rv’s such that Xn → X
and Yn → Y pointwise. Then take the sequence of sums (Xn + Yn )n∈N .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Let X and Y rv’s defined on the same probability space and such that
a.s.
X = Y . Show that E(X ) = E(Y ).
Hint
Use lemma 1 and the Monotone Convergence Theorem.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Let X be a nonnegative discrete rv, i.e. X (Ω) = {x1 , x2 , . . .} with xi > 0
each i. Show that its expectation is:
∞
X
(7) xi P(X = xi ), where events {X = xi } decompose Ω.
i=1
Hint
Define the random variable
(
X if X ∈ {x1 , x2 , . . . , xn }
Yn =
0 if X ∈ {xn+1 , xn+2 , . . .},
and observe that Yn < X , for every ω ∈ Ω and some n ∈ N. Then, use formula (2) to
write E(Yn ). Eventually, use Lemma 1 and the Mon. Conv. Th.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Show that
E(X1 + X2 + · · · ) = E(X1 ) + E(X2 ) + · · ·
Hint Pn
for n ∈ N. Then apply the Monotone
Write the partial sum Sn = i=1 Xi P
∞
Convergence Theorem to the series i=1 Xi = limn→∞ Sn .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Remark
By Exercise 12, to guarantee that the expectation of a discrete rv X is
finite we must impose:
∞
X
(8) |xi |P(X = xi ) < ∞.
i=1
Example
Let X be a rv with X (Ω) = {1, 2, . . .}, and let PX be given by masses
1
P(Xi = i) = i(i+1)
∈ (0, 1), i = 1, 2, . . . .
Pn 1
Observe that sn = i=1 i(i+1) may be written as:
1 1 1
sn = 1·2
+ 2·3
+ ··· + n(n+1)
1 1 1 1 1 1 1
= 1− 2
+ 2
− 3
+ ··· + n−1
− n
+ n
− n+1
1
= 1− n+1
.
Thus the total mass is limn→∞ sn = 1. Unfortunately, the expectation of X is infinite
for the condition (8) is not fulfilled:
n
X ∞
X
1 1
lim |i| · i(i+1)
= i+1
= ∞.
n→∞
i=1 i=1
We drop the absolute signs and consider the second sum as the harmonic series, which
is known to be divergent.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Expectation as an Integral
R
We are going to show that E(X ) = Ω X dP, i.e. expectation is an
integral. More precisely, it is a Lebesgue type integral. A fundamental
argument is the following:
Theorem
A Remark
A measure m on (R, B) such that m((a, b]) < ∞, for finite a, b ∈ R (viz.
Lebesgue-Stieltjes measure, see later in this Lecture), gives raise to a
càdlàg function as follows:
m((0, x]), if x > 0
F (x) =
−m((x, 0]), if x 6 0.
It can be shown (exercise for the student) that m((a, b]) = F (b) − F (a)
and F is indeed right-continuous and nondecreasing.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Show the following.
1 The Lebesgue measure ` is not finite, i.e. `(R) = ∞, but it is
σ-finite.
2 ` is diffusive.
3 `([a, b)) = `((a, b)) = `([a, b]), for finite a < b.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Lebesgue Integral in R
Let ((0, 1], B(0,1] ) be the event space for drawing at random a number
within the unit interval. Now, take the Lebesgue measure ` restricted to
B(0,1] .
`(B) is the probability that a number 0 < x 6 1 lies exactly in the Borel
set B ∈ B(0,1] . Clearly `((0, 1]) = 1.
Example
With the above in mind we arein position to show that there exists a rv X < ∞ but
E(X ) = ∞. Let (0, 1], B(0,1] , ` be a continuous uniform probability space, where ` is
meant as the restricted Lebesgue measure. Define X = ∞ i
P
i=1 2 I(2−i ,2−(i−1) ] > 0,
−i −(i−1)
where the dyadic intervals 2 , 2 decompose (0, 1]. This is a discrete rv
becausePits range X ((0, 1]) is countable. Now, choose n ∈ N and consider the simple
n
rv Z = i=1 2i I(2−i ,2−(i−1) ] > 0. We have Z 6 X for every ω ∈ (0, 1] and by formula
(2):
n
X n
X
2i · ` 2−i , 2−(i−1) 2i · (2−(i−1) − 2−i )
E(Z ) = =
i=1 i=1
Xn
= 2i · 2−i = n.
i=1
Thus by formula (4) E(X ) > E(Z ) for every n ∈ N so that E(X ) = ∞.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
n
X
UPn = U(Pn , f ) := Mi (xi − xi−1 ), Mi = sup f (x) : x ∈ [xi−1 , xi ] ,
i=1
In fact, the set {LPn : Pn be a partition} is bounded above and the set
{UPn : Pn be a partition} is bounded below. It can be shown that f is
Riemann-integrable iff
(Z )
Z b b
f (x)dx = sup g (x)dx : 0 6 g 6 f
a a
Example
Let f (x) = IQ∩[a,b] (x) be 1 if x is a rational number lying in [a, b] and zero otherwise.
This function is not Riemann-integrable. Indeed, any lower sum LPn is zero and any
upper sum UPn is 1. Thus, taking finer partitions and sending the mesh to zero as
n → ∞ we have LPn < UPn .
Example
Let (fn (x))n∈N be a sequence of functions all defined on [a, b]. Assume pointwise
convergence to the limit function f (x), i.e. fn (x) → f (x) as n → ∞ for every
x ∈ [a, b]. Then, it can be shown that
Z b Z b Z b
lim fn (x)dx = lim fn (x)dx = f (x)dx
n→∞ a a n→∞ a
To intergate more general functions than the bounded ones, let f : [a, b] → R be a
nonnegative Borel-measurable map. Then fix a partition Pn = {y0 < y1 < . . . < yn }
of the codomain [0, ∞) of f > 0, such that Ji = [yi−1 , yi ) are pairwise disjoint
right-open intervals. Observe that every Ji contains some f (x), for x ∈ [a, b].
Next, take inverse images f −1 (Ji ) = {yi−1 6 f < yi } = Ai and define a nonnegative
step function g (x) = c1 IA1 + · · · + cn IAn , where clearly the Borel sets Ai ’s decompose
P x ∈ [a, b]. Furthermore,
[a, b] and 0 6 ci = inf Ji . This entailsR g (x) 6 f (x), each
define the Lebesgue integral of g by [a,b] g (x)d` := ni=1 ci `(Ji ). We say that f is
Lebesgue-integrable iff Z
lim LePn = f (x)d`,
n→∞ [a,b]
R
where LePn is nothing but the lower-Lebesgue sum equals to [a,b] g (x)d`.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
y = g(x)
c4
c3
c2
c1
c5
a x1 x2 x3 b
P5
Figure: Step function g (x) = i=1 ci IAi (x), with
A1 = [a, x1 ), A2 = {x1 }, A3 = (x1 , x2 ), A4 = [x2 , x3 ), A5 = [x3 , b]
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
It is easily seen that LePn 6 LePn+1 . So to state the integrability condition we write:
Z b Z b
f (x)d` = sup g (x)d` : 0 6 g 6 f , g step function.
a a
c5 y = f (x)
c4
c2
c3
c1
y = g (x)
a x1 x2 x3 x4 b
P5
Figure: Bounded nonnegative function f (x). The step function g (x) = i=1 ci IAi (x),
approximates f from below, with
A1 = [a,x1 ), A2 = [x1 ,
x2 ), A3 = [x2 , x3 ), A4 = [x3 , x4 ), A5 = [x4 , b] and
ci = inf f (x) : x ∈ Ai = inf Ji
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
For more general domains than [a, b], such as Borel sets B ⊂ R, we always can define:
Z
f (x)d`, with f > 0.
B
For more general Borel measurable functions f we apply the integration process
Example
R
The function f (x) = IQ∩[a,b] (x) has Lebesgue integral [a,b]
f (x)d` = 0.
Theorem
Recall that f is a.e. continuous over a Borel subset B ⊂ R whenever the set
{x ∈ B : f (x) is not continuous} =: {f is not continuous}
has zero Lebesgue measure. As a corollary we get: The Riemann integral of every
continuous f coincides with the Lebesgue integral.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
With all the above in mind we can rephrase the definition of a continuous probability
distribution on (R, B) induced by a rv X . WeR need the density % : R → [0, ∞) to be a
nonnegative Borel-measurable function with R %(x)d` = 1.
The probability of the Borel set B ⊂ R translates to the probability of the event
{X ∈ B} and is written:
Z
PX (B) = P(X ∈ B) = %(x)d`.
B
R
In particular, for B = (a, b] we have (a,b] %(x)d`. Clearly, if % is in addition a
continuous function the Lebesgue integral used in defining these probabilities is equal
to the Riemann integral is in addition B is an interval.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Remark
R
To show that the set function B 7→ B %(x)d` defined over the Borel σ-algebra B is
actually a probability measure we only P need a pairwise disjoint sequences B1 , B2 , . . . of
Borel sets and the property I∪∞ n=1 Bn
= ∞ n=1 IBn of indicator functions, together with
property (v) of the Lebesgue integral:
Z
PX (∪∞n=1 B n ) = I∪∞
n=1 Bn
%(x)d`
R
∞
Z X
= IBn %(x)d`
R n=1
∞ Z
X ∞
X
= IBn %(x)d` = PX (Bn ).
n=1 R n=1
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
In the previous Remark we used the following fact. For a rv X take its
R A ∈ F, then write its expected value
truncated version X IA for any event
E(X IA ) as the Lebesgue integral A X dP. Observe that X IA = X for
every ω ∈ A.
Exercise
R
Show that E(X IA ) is in fact equal to A X dP.
Hint
Take the restriction of X to the event A, then define the expectation above over the
restricted probability space (A, FA ), with FA the trace σ-algebra of subsets A ∩ B for
P(A∩B)
all B ∈ F . Use the probability measure PA (B) := P(B) for every B ∈ FA and all the
steps defining E( · ).
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Taking expectation
R is meant as
R∞the step by step procedure studied so far.
Note that R may be written −∞ .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Then, the composed map IB ◦ X = IB (X ) equals the indicator of the event {X ∈ B}.
Theorem
Let (Ω, F ) be an event space and let (R, B) be the real measurable space. Define the
rv X : Ω → R and the Borel-measurable function g : R → R. Then the composed map
Z = g ◦ X : Ω → R is a rv.
Proof.
First
R we check Rformula (9) for indicators g (X ) = IB (X ), where B ∈RB. We have
Ω IB (X )dP = Ω I{X ∈B} dP = E[I{X ∈B} ] = P(X ∈ B) = PX (B) = R IB (x)dPX . By
linearity, formula (9) holds for g (X ) ∈ S+ . If g (X ) is any rv, then we apply the
Monotone Convergence Theorem to both g (X )+ and g (X )− as follows:
Z Z
g (X )dP = lim(gn+ (X ) − gn+ (X ))dP
Ω Ω n
Z Z
= lim gn+ (X )dP − lim gn− (X )dP
n Ω n Ω
Z Z
= lim gn+ (x)dPX − lim gn− (x)dPX
n R n R
Z
= lim(gn+ (x) − gn+ (x))dPX
R n
Z Z
= (g + (x) − g − (x))dPX = g (x)dPX ,
R R
Consequences
whenever g : R → R is Borel-measurable.
When both sides of equation (10) are finite, Corollary 1 says that the
Borel function g (x)%(x) is integrable.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Example
Exercise
Let X , Y be rv’s with finite expectation and let g : R → R be a
Borel-measurable function such that g (X ), g (Y ) have finite expectation
a.s.
too. Prove in an alternative way that if X = Y then
E(g (X )) = E(g (Y )).
Hint
Use the Change of Variable Formula.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Let X be a rv overPa probability space (Ω, F, P). Define the nonnegative
n
simple rv g (X ) = i=1 ai I{X ∈Bi } , where ai > 0 and ∪ni=1 Bi = R is a
finite partition of R. The function g : R → R is clearly Borel-measurable.
R
Assume E(g (X )) < ∞ and show that the expectation equals R g (x)dPX .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Exercise
Assume X = c for every ω ∈ Ω. Use the Change of Variable Formula to
compute E(g (X )) as:
Z
(11) g (X )dP = g (c).
Ω
Hint
Use the Dirac-delta measure PX (B) = δc (B), for Borel sets B ⊂ R and
c ∈ R.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Summary
Lebesgue measure and integration provides us with a powerful tool for manipulating
(symbolically or numerically) expectations. From now on we will write dx in the
Lebesgue integral, if no confusion arises on what type of integral is involved.
The probability of a Borel subset B over (R, B) can be computed by integrating dFX
here.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Lebesgue-Stieltjes Integral
R∞
The integral −∞
xdFX defining E(X ) is of a special kind. It originates
as follows.
1 Define a Lebesgue-Stieltjes measure on (R, B), i.e. a countably
additive set function m generated by a càdlàg function F as
m((a, b]) = F (b) − F (a) for finite reals a < b, and such that m(I ) is
finite for bounded intervals I ⊂ R.
2 Define a Borel-measurable function g : R → R.
RBuild the Lebesgue-type integral of g step by step to get
3
∞
−∞
g (x)dF .
4 This is called the Lebesgue-Stieltjes integral.
5 To get the expectation set g (x) = x and consider only càdlàg
functions F such that F (−∞) = 0 and F (∞) = 1, i.e. df’s.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type
Riemann-Stieltjes Integral
R∞
g (x)dF using the Riemann approach. If ab g (x)dx exists,
R
Now let us construct −∞
Pn
then it is approximately i=1 g (ci )(xi − xi−1 ) where ci ∈ (xi−1 , xi ] and [a, b] is
partitioned by points {a = x0 < x1 < · · · < xn = n}. Here the ‘representative value’
g (ci ) of g over each (xi−1 , xi ] is weighted by `((xi−1 , xi ]). Recall that ` is generated
by the càdlàg function F (x) = x.
Instead,
Pn we can use a different ‘weight function’ F (x) and form the sum
i=1 g (ci )(F (xi ) − F (xi−1 )). If for n → ∞ the sum converges to a limit, we call it
the Riemann-Stieltjes integral of g with respect to F . It is crucial that g (x) and F (x)
have not discontinuities at the same points x.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type