0% found this document useful (0 votes)

64 views

Damiano Rossello: DEB University of Catania

The document discusses expected value and two important theorems. It introduces expected value using a Bernoulli random variable example. It then discusses indicator functions and how they can be used to define random variables. It also discusses how random variables can be written as linear combinations of indicator functions.

Uploaded by

Kai Chung Tam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

Damiano Rossello: DEB University of Catania

Uploaded by

Kai Chung Tam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 78

Expected Value: The Starting Point

Two Important Theorems

Lecture 5

Damiano Rossello

DEB
University of Catania

Probability For Finance

Department of Economics and Business
Fall 2015 - Catania, Italy
Expected Value: The Starting Point
Two Important Theorems

Outline

1 Expected Value: The Starting Point

A Motivating Example
Simple Rv’s
Nonnegative rv’s
General rv’s

2 Two Important Theorems

Monotone Convergence Theorem
Change of Variable Formula
Integrals of the Stieltjes Type
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Example

Let (Ω, F ) be an event space and take a fixed event A ∈ F . Define a Bernoulli rv by
X (ω) = 1 if ω ∈ A and X (ω) = 0 otherwise. The probability distribution PX of such a
rv is given by two masses p := P(X = 1) and 1 − p := P(X = 0), where 0 < p < 1.
Now, you have the following gamble: win € 1000 if the event {X = 1} occurs,
otherwise loose € 900 if event {X = 0} occurs. This may be modelled by a new rv
Y (ω) which takes on this two monetary values with probabilities p and 1 − p,
respectively. Note that X and Y share the same distribution.

To forecast the win/loss you may compute X := p · 1000 + (−900) · (1 − p). E.g.,
assuming p = 21 you expect to win X = 50 euro. A physical interpretation of X is that
it represents the centre of gravity of the numbers −900 and 1000 in R, on which you
put two probability weights, i.e. PX (B) = pδ1000 (B) + (1 − p)δ−900 (B) for every Borel
set B ⊂ R and δx (B) being the Dirac-delta measure. Observe that X is a number
lying between −900 and 1000, ∀p ∈ (0, 1).
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

0.35 900 0.65 1000

900 335 1000

Figure: The Bernoulli distribution PX is discrete with total mass 1 and X is its
centre of gravity, with p = 0.65
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Remark
The number X in the previous Example is based on the concept of a convex
combination of two real numbers. Let x, y ∈ R be such that x < y and define:
xλ := λ x + (1 − λ) y , for every 0 6 λ 6 1,
where xλ ∈ [x, y ]. The weights λ and (1 − λ) may be thought of as probability masses,
since their sum is 1. Now, choose P the numbers x1 , . . . P , xn ∈ [x, y ] and n positive
weights λ1 , . . . , λn such that ni=1 λi = 1. Then S = ni=1 λi xi ∈ [x,Py ], where S is
the average of the xi ’s and satisfies P x 6 S 6 y . Indeed, assume that ni=2 λi xi ∈ [x, y ]
for n − 1 numbers. From condition ni=1 λi = 1 deduce 1 − λ1 > λi . This implies
λi λi
< 1, each i = 2, 3, . . . , n. We can find z = ni=2 1−λ
P
0 < 1−λ xi within [x, y ], so
1 Pn 1
the convex combination i=1 λi xi = λ1 x1 + (1 − λ1 )z belongs to [x, y ]. Note that S
lies between min{x1 , . . . , xn } and max{x1 , . . . , xn }.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Indicator Functions

We can extend the calculation of X to more elaborated rv’s. The key

idea in constructing complicated rv’s X is that one can start from simpler
structures to approximate the X ’s. Let Ω be nonempty and take A ⊂ Ω.
The indicator function of A is the map defined on Ω by:
(
1 if ω ∈ A
IA (ω) =
0 if ω ∈ Ac .

The random function IA is a rv iff A ∈ F (exercise for the student).

A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

0 1
IA 1
A c

0 1
A IA 1
A
Ac
IA 1

0 1
IA 1

Borel set B 0 1
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Some Properties

Essentially, indicator functions establish a 1-to-1 correspondence between

measurable maps (rv’s) and measurable sets (events): set operations can
be translated into (algebraic) operations on functions. For any pair of
events A, B ∈ F it holds:
1 IA∪B = IA + IB − IA∩B = max{IA , IB };
2 IA∩B = IA − IA\B = IA · IB = min{IA , IB };
3 IA\B = max{0, IA − IB };
4 IAc = 1 − IA ;
5 IA 6 IB iff A ⊂ B;
6 IA∆B = |IA − IB |.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Exercise
Show Properties 1 to 6 for indicator functions. Observe that Properties 1
to 6 are considered pointwise, i.e. they are verified comparing IA (ω) and
IB (ω) for each ω ∈ Ω. Show that 1 becomes IA∪B = IA + IB if A and B
are disjoint. Show that 3 is simply IA − IB whenever B ⊂ A. Property 6
refers to symmetric difference between sets.

Note that the rv X in Example 1 may be defined as X = IA , with

P(IA ∈ B) = p if 1 ∈ B and B ⊂ R is a Borel set.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Linear Combinations of Indicators

Any constant random function induces the partition {∅, Ω} of Ω by

means of inverse images (see Lecture 3). More generally, by Exercise 1
any indicator function gives a partition of Ω through {∅, Ω, A, Ac }.

Recall that (see Lecture 3) a discrete rv may take on a finite number of

values {x1 , . . . , xn }. This latter set is the range X (Ω) of a random
function obtained as follows:

(1) X = x1 · IA1 + · · · + xn · IAn ,

where Ai = X −1 ({xi }) = {X = xi }, i = 1, . . . , n. Here the partition is

due to {∅, A1 , . . . P
, An , Ω}. In the reverse direction, define the linear
n
combination X = i=1 xi · IAi of indicators such that events A1 , . . . , An
form a partition of Ω.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Intermezzo 1: Preserving Measurability

Pn
We wish X = i=1 xi · IAi being a rv. The affirmative answer comes
from a series of properties concerning measurability of random functions
and composition of maps. We start with the following preliminary result:

Theorem (Measurability on a Generator)

Let (Ω, F) be an event space and let (R, B) be the real measurable
space. Define G collecting Borel sets B ⊂ R that generate B, i.e.
B = σ(G). A map X : Ω → R is a rv (a measurable map) iff
X −1 (B) ∈ F for every B ∈ G.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Proof.
(⇒) Trivial. (⇐) Let us assume X −1 (B) ∈ F for every B ∈ G. To show
that this implies measurability on the whole Borel σ-algebra B, we will
use a squeezing argument. By assumption G ⊂ B, so it contains some of
the Borel sets B ∈ B whose inverse images are in F. The collection
E = {B : X −1 (B) ∈ F} is of course greater, G ⊂ E, and is a σ-algebra
(see Lecture 3). Moreover, E ⊂ B and by hypotheses σ(G) = B. This
implies B = E and then every X −1 (B) belong to F for every B ∈ B,
which entails the required result.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Recall that (see Lecture 3) the collection of all half-rays (−∞, x],
together with ∅, generates the Borel σ-algebra B for every x ∈ R.

Thus by Theorem 1 and the above, to verify that a random function

X : Ω → R is actually a rv we need only to check that {X 6 x} is an
event in F, for every real number x.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Now do:
Exercise
Prove the following Theorem:
Let (Ω, F ) be an event space and let X , Y be two rv’s defined here. Then:
1 X + Y is a rv
2 α · X is a rv, for any α ∈ R
X
3
Y
is a rv, for any ω ∈ Ω such that Y (ω) 6= 0
4 the maximum and minimum between rv’s, X ∨ Y := max{X , Y } and
X ∧ Y := min{X , Y }, is again a rv.
Hint: for 1 show that {X + Y 6 x} is an event in F for every x ∈ R, and similarly for
2,3. For the maximum and minimum of rv’s, show that for every x ∈ R the set
{X ∨ Y 6 x} equals {X 6 x} ∩ {Y 6 x} and the set {X ∧ Y 6 x} equals
{X 6 x} ∪ {Y 6 x}.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Simple Rv’s

By Formula (1), combining n indicator functions over a partition

∪ni=1 Ai = Ω of the sample space and using coefficients xi ∈ R amounts to
iterate the sum of rv’s a finitePnumber of times. By the aforementioned
n
discussion we have that X = i=1 xi IAi is thus a rv called simple.

Observe that any simple rv is a bounded function, i.e. we can find C > 0
such that |X (ω)| 6 C for every ω ∈ Ω. A simple rv might be defined
through different choices of the partition, but the standard one is always
Ai = {X = xi }.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

X I A 1 I A 2 I A3 I A4
A1 A1 A41
A4 A43
IA41 IA42 IA43 A42
A2 A2
A3 A3

X
1 2 3 4

The ith event is further decomposed into Ai1 ∪ · · · ∪ Aim . Thus, in the original
representation of X the ith term must be replaced xi IAi = xi I∪m Aij = m
P
j=1 j=1 xi IAij .
The Figure above illustrates this with i = 4 and m = 3. It would be possible to replace
all the events Ai with others Bj that form a different partition, but leaving the
coefficients xi unaffected.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Expectation: Step 1

Pn
Let X = i=1 xi IAi be a simple rv. The expectation of X is:

(2) E(X ) := x1 P(A1 ) + · · · + xn P(An ) > 0.

This is well-defined and finite. As a special case we have

(3) E(IA ) := 1 · P(A) + 0 · P(Ac ) = P(A)

for X = IA , so the expectation coincides with the probability of A.

A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Exercise
Let S be the collection of all simple rv’s defined over (Ω, F, P). Show
that:
1 The constant unit function 1 := IΩ belongs to S; the constant zero
function 0 := I∅ is also in S.
2 The product c · X between a number c ∈ R and a simple rv X is an
element of S.
3 The sum X + Y of two simple rv’s is again a simple rv in S.
4 The product of any pair X , Y of simple rv’s is in S.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Exercise (The Set S is a Vector Space)

Show that:
1 0 + X = X , for simple rv’s 0, X ∈ S.
2 There exists a simple rv −X ∈ S such that X + (−X ) = 0.
3 1 · X = X , for simple rv’s 1, X ∈ S.
4 a · (b · X ) = (a · b) · X , for a simple rv X ∈ S and constants a, b ∈ R.
5 (a + b) · X = a · X + b · X , for a simple rv X ∈ S and constants
a, b ∈ R.
6 a · (X + Y ) = a · X + a · Y , for simple rv’s X , Y ∈ S and a constant
a ∈ R.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Exercise (Properties of E(X ))

Show that:
1 If a rv X is degenerate, i.e. X (ω) = c for every ω ∈ Ω, then
E(X ) = c.
2 Let X , Y ∈ S be simple rv’s. Then E(aX + bY ) = aE(X ) + bE(Y ),
for any a, b ∈ R. Expectation is linear.
3 Let X , Y ∈ S be simple rv’s such that X (ω) 6 Y (ω) for every
ω ∈ Ω, then E(X ) 6 E(Y ). Expectation preserves ordering.
Hint: by Exercise 4 the collection S of simple rv’s is a vector space, i.e. is closed
under summing simplePrv’s and underPthe product of scalars a ∈ R by simple rv’s.
Then, whenever X = i xi IAi , Y = j yj IBj and a, b ∈ R it holds
P P
Z = aX + bY = i j (axi + byj )IAi ∩Bj is a rv Z ∈ S, taking values axi + byj on
events Ai ∩ Bj ; these still form a partition of Ω.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Example
Pn
Let X = i=1 xi I{X =xi } be a nonnegative simple rv, i.e. xi > 0, and let
c > 0. By linearity we have E(c X ) = c E(X ). Namely, expectation is
positive homogeneous.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Exercise (Inclusion-Exclusion Formula Revisited)

Let A1 , A2 , . . . , An be events.
Qn
1 Prove I∪ni=1 Ai = 1 − i=1 (1 − IAi ).
2 Prove Inclusion-Exclusion Formula by applying Step 1 in the
definition of expectation to the identity in 1.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Expectation: Step 2

Let S+ ⊂ S be the subset of nonnegative simple rv’s. The expectation of

a nonnegative rv X > 0 is:

(4) E(X ) := sup E(Z ) : 0 6 Z 6 X , Z ∈ S+ .

Indeed, Step 1 gives each E(Z ) with respect to nonnegative simple rv’s
Z ∈ S+ that approximate X from
below. The set
E(Z ) : 0 6 Z 6 X , Z simple is of the form [0, x) or [0, x], where
x = ∞ is allowed.
The supremum above always exists, since it is taken over a subset of
increasing real numbers. It may be infinite even if X is not.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

The correctness of formula (4) is based on the following result:

Lemma
Suppose X > 0 is a nonnegative rv. Then, there exists an increasing
sequence X1 6 X2 6 X3 6 · · · of simple rv’s which are nonnegative and
converge pointwise to X .

Whence, the definition of E(X ) for a nonnegative rv X can be restated as:

lim E(Xn ) = E lim Xn = E(X ),
n→∞ n→∞

Expectation and limits can be exchanged.

A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Remark
We can define a measurable map Xn : Ω → R for every n ∈ N, then obtaining a
sequence (Xn )n∈N of rv’s. For each ω ∈ Ω the set
{X1 (ω), X2 (ω), . . .} = {x1 , x2 , . . .}
is an infinite sequence of real numbers. If limn→∞ xn = x, then the number x
corresponds to the given ω. It might happen that for any other ω the above limit
exists (but in general it is not equal to the same number x). Thus, we have a function
that to each ω attaches a number given by a limit. In this case the sequence (Xn )n∈N
is said to converge pointwise to the limit function X : Ω → R. Namely, ∀ ω ∈ Ω and
∀ > 0, there exists a positive integer N such that n > N implies |Xn (ω) − X (ω)| < .
It can be shown that the random function X is a rv too (see later).
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Example
Let (Ω, F) be an event space with Ω = [0, 1] and F = B ∩ [0, 1], the
trace σ-algebra of Borel sets in the unit interval. Define the sequence of
functions Xn (ω) = ω n for every n ∈ N. Each Xn is measurable (since it is
a continuous function, see later), thus it is a rv. We have limn Xn (ω) = 0
if ω ∈ [0, 1) and limn Xn (ω) = 1 if ω = 1. Thus the sequence (Xn )n∈N
just defined converges pointwise to the limit function
(
0 if ω ∈ [0, 1)
X (ω) =
1 if ω = 1.

The convergence is very rapid to X (ω) = 0 for ω near zero, but very slow for ω near 1.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Xn X

1 1

1 2

0 1 0 1
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Exercise
Assume X ∈ S+ . Show that E(X ) given by formula (4) equals that
of formula (2).
Let X , Y > 0. Show that the expectation of nonnegative rv’s is
linear and order preserving as for rv’s in S.
Let X > 0 and assume E(X ) < ∞. Show that P(X < ∞) = 1.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Remark (Almost Sure Comparisons)

Let X , Y be any pair of rv’s. If X (ω) 6= Y (ω), except on those ω ∈ Ω for
which {ω : X (ω) = Y (ω)} has probability 1, then P(X 6= Y ) = 0. We
a.s.
write X = Y and say that X is almost surely, briefly a.s., equal to Y .
This can be extended to other cases than equality, such as X 6 Y a.s.,
a.s.
X > Y a.s., and so on. Observe that X = Y implies equality in
distribution, i.e. PX (B) = PY (B) for any Borel set B ∈ R, but the
converse is not true.
a.s.
A rv is said to be a.s. degenerate, written X = c, if for a constant
c ∈ R we have P(X = c) = 1 but X can differs from the number c on a
P-null event. Thus, P(X 6= c) = 0.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Exercise
a.s.
Let X be an a.s. degenerate rv, i.e. X = c. Show that E(X ) = c.
In general, if X = c then is distribution function its FX (x) = 1 for every x > c and
zero otherwise. We have:
FX (c−) = P(X < c) = lim P(X 6 c − h) = 0,
h↓0

while FX (c) = P(X 6 c) = 1. This implies P(X = c) = FX (c) − FX (c−) = 1 and X

acts like a constant, i.e. its distribution is actually the Dirac-delta δc (B) for any Borel
set B ⊂ R.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Expectation: Final Step

Let X , Y be any pair of rv’s. By 4 of Exercise 2 and by choosing Y = 0
as a degenerate rv, we have:

X + := max{0, X }, X − := max{0, −X } = − min{0, X }.

Observe that X + , X − > 0. We call X + the positive part of X and X −

its negative part. Clearly, X = X + − X − . We may define E(X + ) and
E(X − ) by formula (4) in Step 2. Consequently, if both expectation are
finite we can let:

(5) E(X ) = E(X + ) − E(X − ) < ∞.

All the properties of expectation studied for nonnegative rv’s hold true. When the
expectation of either X + or X − is infinite we have E(X ) = ±∞; when both equal
infinity we no longer can define E(X ).
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Remark
We can write X + = X · I{X >0} and X − = −X · I{X 60} . The absolute
value |X | = X + + X − is the sum of two nonnegative rv’s. Then, by 1 of
Exercise 2 it is a rv. Clearly, X + 6 |X | and X − 6 |X |, for every outcome
ω ∈ Ω.
A Motivating Example
Expected Value: The Starting Point Simple Rv’s
Two Important Theorems Nonnegative rv’s
General rv’s

Remark
There is an equivalent condition for the existence of E(X ), obtained
through the decomposition |X | = X + + X − . Assume
E(|X |) = E(X + ) + E(X − ) is finite and take the expectation of both sides
in X + 6 |X | and in X − 6 |X |. It follows E(X + ) < ∞ and E(X − ) < ∞.
Thus, the finiteness condition in equation (5) is fulfilled. On the other
hand, if we assume that both E(X + ) and E(X − ) are finite, then by
summing and using linearity we get E(|X |) < ∞.

Exercise
Prove the important inequality:
(6) |E(X )| 6 E(|X |).
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Manipulating Expectation to the Limit

The following theorem is of key importance. Comparing with Lemma 1,

now we do not need simple rv’s.
Theorem
Let (Xn )n∈N be a sequence of nonnegative rv’s and let X be any rv such
that 0 6 X1 6 X2 6 · · · 6 X . If the sequence converges pointwise to X ,
then
lim E(Xn ) = E(X ).
n→∞
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Proof.
Assuming pointwise convergence Xn → X , we have X is a rv. Furthermore,
E(X1 ) 6 E(X2 ) 6 · · · 6 E(X ) by assumption. Hence, limn E(Xn ) 6 E(X ). It suffices to
show the converse inequality. The case E(X1 ) = ∞ is trivial, then we assume
E(X1 ) < ∞. By formula (4)P we can approximate E(X ) by the expectation of a
nonnegative simple rv Z = m i=1 zi IAi such that Z 6 X . The latter
P condition means
that zi 6 X (ω) for every ω ∈ Ai . Let us show that limn E(Xn ) > m i=1 zi P(Ai ). Pick
> 0, define Ain = {ω P ∈ Ai : Xn (ω) > zi − } and observe that Ain ↑ Ai as n → ∞.
Furthermore, E(Xn ) > m i=1 (zi − )P(Ain ). By continuity from below we have

lim (zi − )P(Ain ) = (zi − )P(Ai ),

n→∞

which implies limn→∞ E(Xn ) > m (zi − )P(Ai ) = m

P P
Pi=1 i=1 zi P(Ai ) − . Since > 0 is
m
arbitrary it follows limn E(Xn ) > i=1 zi P(Ai ) and we are done.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Remark
In the proof of Theorem 2 we use the following fact: Let a, b ∈ R such
that a 6 b + , and > 0, then a 6 b. The proof runs by contradiction:
assume a 6 b + and ¬(a 6 b) = a > b. But choosing = a−b 2 > 0 we
arrive to a > b + , which contradicts the assumption.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Exercise
Let X , Y > 0. Give an alternative proof that the expectation is linear,
using the Monotone Convergence Theorem.

Hint
Find sequences (Xn )n∈N and (Yn )n∈N of simple rv’s such that Xn → X
and Yn → Y pointwise. Then take the sequence of sums (Xn + Yn )n∈N .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Exercise
Let X and Y rv’s defined on the same probability space and such that
a.s.
X = Y . Show that E(X ) = E(Y ).

Hint
Use lemma 1 and the Monotone Convergence Theorem.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Exercise
Let X be a nonnegative discrete rv, i.e. X (Ω) = {x1 , x2 , . . .} with xi > 0
each i. Show that its expectation is:
∞
X
(7) xi P(X = xi ), where events {X = xi } decompose Ω.
i=1

Hint
Define the random variable
(
X if X ∈ {x1 , x2 , . . . , xn }
Yn =
0 if X ∈ {xn+1 , xn+2 , . . .},

and observe that Yn < X , for every ω ∈ Ω and some n ∈ N. Then, use formula (2) to
write E(Yn ). Eventually, use Lemma 1 and the Mon. Conv. Th.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

So far we seen that expectation is linear. But what happen if we try to

compute E(X1 + X2 + · · · ), for an infinite sequence of nonnegative rv’s?

Exercise
Show that
E(X1 + X2 + · · · ) = E(X1 ) + E(X2 ) + · · ·
Hint Pn
for n ∈ N. Then apply the Monotone
Write the partial sum Sn = i=1 Xi P
∞
Convergence Theorem to the series i=1 Xi = limn→∞ Sn .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Remark
By Exercise 12, to guarantee that the expectation of a discrete rv X is
finite we must impose:
∞
X
(8) |xi |P(X = xi ) < ∞.
i=1

Indeed, X is not necessarily > 0 so we can decompose it by

X = X + − X − , where the positive and negative parts are nonnegative
discrete rv’s. Condition (8) resembles that of absolute convergence of
numerical series (note that |a b| = |a| |b|, with a = xi and
b = P(X = x) > 0).
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Example
Let X be a rv with X (Ω) = {1, 2, . . .}, and let PX be given by masses
1
P(Xi = i) = i(i+1)
∈ (0, 1), i = 1, 2, . . . .
Pn 1
Observe that sn = i=1 i(i+1) may be written as:
1 1 1
sn = 1·2
+ 2·3
+ ··· + n(n+1)

1 1 1 1 1 1 1

= 1− 2
+ 2
− 3
+ ··· + n−1
− n
+ n
− n+1
1
= 1− n+1
.
Thus the total mass is limn→∞ sn = 1. Unfortunately, the expectation of X is infinite
for the condition (8) is not fulfilled:
n
X ∞
X
1 1
lim |i| · i(i+1)
= i+1
= ∞.
n→∞
i=1 i=1

We drop the absolute signs and consider the second sum as the harmonic series, which
is known to be divergent.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Example (Expectation of Binomial Rv’s)

Let XP∼ Bin(n, p) be a Binomial rv. Then E(X ) equals np. Indeed,
X = ni=1 i I{X =i} and we have:
n n n
X X n!
E(X ) = i p i (1 − p)n−i = i p i (1 − p)n−i
i=0 |
i i=0
i!(n − i)!
{z }
=P(X =i)
n
X (n − 1)!
= np p i−1 (1 − p)n−i
i=0
(i − 1)!(n − i)!
n−1
X n − 1 k
= np p (1 − p)n−1−k
k=0
k

= np(p + (1 − p))n−1 = np.

Exercise

Let X ∼ N(µ, σ 2 ) be a Gaussian rv, Y ∼ U(a, b) be a uniform rv and Z ∼ Esp(λ) be

an exponential rv. Show that µ = E(X ), E(Y ) = a+b
2
and E(Z ) = λ1 .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Expectation as an Integral
R
We are going to show that E(X ) = Ω X dP, i.e. expectation is an
integral. More precisely, it is a Lebesgue type integral. A fundamental
argument is the following:
Theorem

There exists a unique measure ` : B → [0, ∞] called the Lebesgue

measure, which assigns to each interval (a, b] ∈ B its length
`((a, b]) = b − a.
Recall that there is a 1-to-1 correspondence between probability measures P and df’s
F . This framework can be made more general simply by dropping normalization
P(Ω) = 1, so that P is a nonnegative measure (a countable additive set functions),
and by requiring only càdlàg functions, i.e. giving up the asymptotic behavior of F
and then boundedness in [0, 1]. Thus, a proof of Theorem 3 might be based on the
choice F (x) = x for the corresponding càdlàg function with P = `.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

A Remark

A measure m on (R, B) such that m((a, b]) < ∞, for finite a, b ∈ R (viz.
Lebesgue-Stieltjes measure, see later in this Lecture), gives raise to a
càdlàg function as follows:

m((0, x]), if x > 0
F (x) =
−m((x, 0]), if x 6 0.

It can be shown (exercise for the student) that m((a, b]) = F (b) − F (a)
and F is indeed right-continuous and nondecreasing.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Exercise
Show the following.
1 The Lebesgue measure ` is not finite, i.e. `(R) = ∞, but it is
σ-finite.
2 ` is diffusive.
3 `([a, b)) = `((a, b)) = `([a, b]), for finite a < b.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Lebesgue Integral in R

Another important concept is that of a Borel-measurable function

f : R → R, i.e. f −1 ((−∞, x]) =: {f 6 x} ∈ B for every x ∈ R. Indeed,
such a function is defined between the measurable space (R, B) and itself.

We have to build a more general integration process starting from

nonnegative real-valued functions on [a, b]. Thus, we need a quick review
of Riemann integration.
In the sequel we will also need the analogous of an indicator rv. An indicator of Borel
sets is simply IB (x) = 1 iff x ∈ B, and zero otherwise. In particular, B may be an
interval.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Intermezzo 2: Uniform Probability Spaces

Let ((0, 1], B(0,1] ) be the event space for drawing at random a number
within the unit interval. Now, take the Lebesgue measure ` restricted to
B(0,1] .

`(B) is the probability that a number 0 < x 6 1 lies exactly in the Borel
set B ∈ B(0,1] . Clearly `((0, 1]) = 1.

If B = {q1 , q2 , . . .} is a countable subset of (0, 1] we then have `(B) = 0.

Of course, ((0, 1], B(0,1] , `) is a uniform continuous probability space, i.e.
a nonzero probability is put only on Borel sets greater than a singleton
{x} such as intervals [a, b] ⊂ (0, 1].
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Example
With the above in mind we arein position to show that there exists a rv X < ∞ but
E(X ) = ∞. Let (0, 1], B(0,1] , ` be a continuous uniform probability space, where ` is
meant as the restricted Lebesgue measure. Define X = ∞ i
P
i=1 2 I(2−i ,2−(i−1) ] > 0,
−i −(i−1)

where the dyadic intervals 2 , 2 decompose (0, 1]. This is a discrete rv
becausePits range X ((0, 1]) is countable. Now, choose n ∈ N and consider the simple
n
rv Z = i=1 2i I(2−i ,2−(i−1) ] > 0. We have Z 6 X for every ω ∈ (0, 1] and by formula
(2):
n
X n
X
2i · ` 2−i , 2−(i−1) 2i · (2−(i−1) − 2−i )

E(Z ) = =
i=1 i=1
Xn
= 2i · 2−i = n.
i=1

Thus by formula (4) E(X ) > E(Z ) for every n ∈ N so that E(X ) = ∞.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Riemann vs Lebesgue Integration /1 *

Let f : [a, b] → R be a nonnegative bounded function. Recall that when
f is Riemann-integrable the lower sums and the upper sums, respectively
n
X
LPn = L(Pn , f ) := mi (xi − xi−1 ), mi = inf f (x) : x ∈ [xi−1 , xi ]
i=1

n
X
UPn = U(Pn , f ) := Mi (xi − xi−1 ), Mi = sup f (x) : x ∈ [xi−1 , xi ] ,
i=1

must coincide as partitions Pn = {a = x0 < x1 < . . . < xn = b} refine

each other, i.e. they get finer and finer, Pn ⊂ Pn+1 , and the mesh
kPn k = max{(xi − xi−1 ) : i = 1, . . . , n} goes to zero for large n ∈ N.
Thus: Z b
lim LPn = lim UPn = f (x)dx.
n→∞ n→∞ a
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

In fact, the set {LPn : Pn be a partition} is bounded above and the set
{UPn : Pn be a partition} is bounded below. It can be shown that f is
Riemann-integrable iff
(Z )
Z b b
f (x)dx = sup g (x)dx : 0 6 g 6 f
a a

where g is a step function, i.e. g (x) = ci for x ∈ (xi−1 , xi ) so it is

Rb Pn
constant over subintervals in [a, b], and a g (x)dx := i=1 ci (xi − xi−1 ).
Rb
E.g., choosing ci = mi we have that the integral a g (x)dx of a step
function equals the lower sum LPn .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Riemann Integration suffers Pathologies

Example
Let f (x) = IQ∩[a,b] (x) be 1 if x is a rational number lying in [a, b] and zero otherwise.
This function is not Riemann-integrable. Indeed, any lower sum LPn is zero and any
upper sum UPn is 1. Thus, taking finer partitions and sending the mesh to zero as
n → ∞ we have LPn < UPn .

Example
Let (fn (x))n∈N be a sequence of functions all defined on [a, b]. Assume pointwise
convergence to the limit function f (x), i.e. fn (x) → f (x) as n → ∞ for every
x ∈ [a, b]. Then, it can be shown that
Z b Z b Z b
lim fn (x)dx = lim fn (x)dx = f (x)dx
n→∞ a a n→∞ a

does not always hold.

Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Riemann vs Lebesgue Integration /2 *

To intergate more general functions than the bounded ones, let f : [a, b] → R be a
nonnegative Borel-measurable map. Then fix a partition Pn = {y0 < y1 < . . . < yn }
of the codomain [0, ∞) of f > 0, such that Ji = [yi−1 , yi ) are pairwise disjoint
right-open intervals. Observe that every Ji contains some f (x), for x ∈ [a, b].

Next, take inverse images f −1 (Ji ) = {yi−1 6 f < yi } = Ai and define a nonnegative
step function g (x) = c1 IA1 + · · · + cn IAn , where clearly the Borel sets Ai ’s decompose
P x ∈ [a, b]. Furthermore,
[a, b] and 0 6 ci = inf Ji . This entailsR g (x) 6 f (x), each
define the Lebesgue integral of g by [a,b] g (x)d` := ni=1 ci `(Ji ). We say that f is
Lebesgue-integrable iff Z
lim LePn = f (x)d`,
n→∞ [a,b]
R
where LePn is nothing but the lower-Lebesgue sum equals to [a,b] g (x)d`.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

y = g(x)
c4

c2
c1
c5

a x1 x2 x3 b

P5
Figure: Step function g (x) = i=1 ci IAi (x), with
A1 = [a, x1 ), A2 = {x1 }, A3 = (x1 , x2 ), A4 = [x2 , x3 ), A5 = [x3 , b]
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Riemann vs Lebesgue Integration /3 *

The limit above is understood as n → ∞ and the mesh

kPn k = max{(yi − yi−1 ) : i = 1, . . . , n} goes to zero. If each Ai is an interval we
clearly have LPn = LePn . Step functions are obviously Borel-measurable.

It is easily seen that LePn 6 LePn+1 . So to state the integrability condition we write:
Z b Z b
f (x)d` = sup g (x)d` : 0 6 g 6 f , g step function.
a a

The supremum might be ∞.

Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

c5 y = f (x)

c4
c2
c3

c1
y = g (x)

a x1 x2 x3 x4 b

P5
Figure: Bounded nonnegative function f (x). The step function g (x) = i=1 ci IAi (x),
approximates f from below, with
A1 = [a,x1 ), A2 = [x1 ,
x2 ), A3 = [x2 , x3 ), A4 = [x3 , x4 ), A5 = [x4 , b] and
ci = inf f (x) : x ∈ Ai = inf Ji
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Riemann vs Lebesgue Integration /4 *

For more general domains than [a, b], such as Borel sets B ⊂ R, we always can define:
Z
f (x)d`, with f > 0.
B

For more general Borel measurable functions f we apply the integration process

separately to the positive part f + := max{0, f } and to the negative part

f − := max{0, −f } = − min{0, f } and then take the difference
Z Z Z
f (x)d` = f + (x)d` − f − (x)d`.
B B B
R
A function f is Lebesgue integrable iff B |f (x)|d` < ∞, where as usual
+ −
|f | = f + f . We often write dx instead of d`.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Example
R
The function f (x) = IQ∩[a,b] (x) has Lebesgue integral [a,b]
f (x)d` = 0.

Write f = 1 · IQ∩[a,b] + 0 · I(Q∩[a,b])c (x). Then, compute the integral as

1 · `(Q ∩ [a, b]) + 0 · `((Q ∩ [a, b])c ) which is equal to zero!

Indeed, Q ∩ [a, b] = {q1 , q2 , . . .} is the countable set of all rational points

within [a, b].
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Riemann vs Lebesgue Integration /5

Indeed, we have the following result:

Theorem

Let f : [a, b] → R with −∞ < a < b < ∞.

(i) If f is Riemann-integrable over [a, b], then it is Lebesgue-integrable and both the
integrals coincide.
(ii) If f is bounded in [a, b], then it is Riemann-integrable iff it is almost everywhere
(a.e.) continuous.

Recall that f is a.e. continuous over a Borel subset B ⊂ R whenever the set
{x ∈ B : f (x) is not continuous} =: {f is not continuous}
has zero Lebesgue measure. As a corollary we get: The Riemann integral of every
continuous f coincides with the Lebesgue integral.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Some Properties of the Lebesgue Integral

For Borel-measurable functions f , g : R → R the following properties hold:
R
(i) If f (x) = c for every x, then it is Lebesgue integrable and R f (x)d` = c.
(ii) If
R both f , g are Lebesgue integrable
R R b ∈ R, then
and a,
R {a f (x) + b g (x)}d` = a R f (x)d` + b R g (x)d`.
(iii) R
If f (x) 6 g (x)
R for every x, and both functions are Lebesgue integrable, then
R f (x)d` 6 R g (x)d`.
(iv) R f (x)d` = ∞
R P R
i=1 Bi f (x)d` provided that B1 , B2 , . . . form a partition of R, i.e.
Bi ∩ Bj = ∅ for i 6= j and ∪∞ i=1 Bi = R.
(v) For every sequence of nonnegative Borel-measurable functions f1 , f2 , . . . it holds
R P ∞ P∞ R
R n=1 fn d` = n=1 R fn d`.
R R
(vi) R f (x)d` 6 R |f (x)|d`
Properties (i) and (iii) can be stated in the a.e. sense, with the Lebesgue measure
being zero over the sets {f 6= c} and {f > g }.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

The Probabilistic Connection /1

The steps in the construction of E(X ) are the same as in the definition of a Lebesgue
integral.
Start with nonnegative simple rv’s P
(measurable random functions taking on a
finite number of values) and build ni=1 zi P(Ai ), where zi stands for ci ’s of step
functions; ` is replaced by P.
This is nothing but the ‘lower sum’ of simple rv’s Z , which are less than or equal
to a rv X > 0.
Take the supremum of such ni=1 zi P(Ai ) over all ‘partition’ of Ω, leading to
P
sequences of simple rv’s Zn that better approximate X (viz. converging
pointwise to X ).

It is apparent that the further advantage of Lebesgue integration is its applicability to

measurable functions defined over abstract domains (such as rv’s). Thus,
Z Z Z
E(X ) = X dP = X + dP − X − dP.
Ω Ω Ω
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

The Probabilistic Connection /2

With all the above in mind we can rephrase the definition of a continuous probability
distribution on (R, B) induced by a rv X . WeR need the density % : R → [0, ∞) to be a
nonnegative Borel-measurable function with R %(x)d` = 1.

The probability of the Borel set B ⊂ R translates to the probability of the event
{X ∈ B} and is written:
Z
PX (B) = P(X ∈ B) = %(x)d`.
B
R
In particular, for B = (a, b] we have (a,b] %(x)d`. Clearly, if % is in addition a
continuous function the Lebesgue integral used in defining these probabilities is equal
to the Riemann integral is in addition B is an interval.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Remark
R
To show that the set function B 7→ B %(x)d` defined over the Borel σ-algebra B is
actually a probability measure we only P need a pairwise disjoint sequences B1 , B2 , . . . of
Borel sets and the property I∪∞ n=1 Bn
= ∞ n=1 IBn of indicator functions, together with
property (v) of the Lebesgue integral:
Z
PX (∪∞n=1 B n ) = I∪∞
n=1 Bn
%(x)d`
R
∞
Z X
= IBn %(x)d`
R n=1
∞ Z
X ∞
X
= IBn %(x)d` = PX (Bn ).
n=1 R n=1
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Integrating over Subsets

In the previous Remark we used the following fact. For a rv X take its
R A ∈ F, then write its expected value
truncated version X IA for any event
E(X IA ) as the Lebesgue integral A X dP. Observe that X IA = X for
every ω ∈ A.
Exercise
R
Show that E(X IA ) is in fact equal to A X dP.

Hint
Take the restriction of X to the event A, then define the expectation above over the
restricted probability space (A, FA ), with FA the trace σ-algebra of subsets A ∩ B for
P(A∩B)
all B ∈ F . Use the probability measure PA (B) := P(B) for every B ∈ FA and all the
steps defining E( · ).
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Change of Variable in the Integral

Finally we arrive to:

Theorem
Let X be a rv with distribution PX and let g : R → R be a
Borel-measurable function. Then we have
Z
(9) E(g (X )) = g (x)dPX ,
R

provided that either side is well-defined.

Taking expectation
R is meant as
R∞the step by step procedure studied so far.
Note that R may be written −∞ .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Before to prove the Theorem above let you do the following:

Exercise
Let X : Ω → R be a rv over the event space (Ω, F ). Let:
(
1 if x ∈ B
IB (x) = for every Borel set B ∈ B.
0 if x ∈ B c ,

Then, the composed map IB ◦ X = IB (X ) equals the indicator of the event {X ∈ B}.

The key idea in the above Exercise is:

Theorem

Let (Ω, F ) be an event space and let (R, B) be the real measurable space. Define the
rv X : Ω → R and the Borel-measurable function g : R → R. Then the composed map
Z = g ◦ X : Ω → R is a rv.

The proof is left as an exercise.

Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Proof.
First
R we check Rformula (9) for indicators g (X ) = IB (X ), where B ∈RB. We have
Ω IB (X )dP = Ω I{X ∈B} dP = E[I{X ∈B} ] = P(X ∈ B) = PX (B) = R IB (x)dPX . By
linearity, formula (9) holds for g (X ) ∈ S+ . If g (X ) is any rv, then we apply the
Monotone Convergence Theorem to both g (X )+ and g (X )− as follows:
Z Z
g (X )dP = lim(gn+ (X ) − gn+ (X ))dP
Ω Ω n
Z Z
= lim gn+ (X )dP − lim gn− (X )dP
n Ω n Ω
Z Z
= lim gn+ (x)dPX − lim gn− (x)dPX
n R n R
Z
= lim(gn+ (x) − gn+ (x))dPX
R n
Z Z
= (g + (x) − g − (x))dPX = g (x)dPX ,
R R

where gn+ (X ) ↑ g + (X ), gn− (X ) ↑ g − (X ) pointwise as n → ∞ and gn+ (X ), gn− (X ) are

taken from S+ .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Consequences

Theorem 5 enable us to compute the expectation of a rv X by means of

a numerical Rintegral over the real line. Finding E(g (X )) should requires
to consider Ω g (X )dPg (X ) , since in general g (X ) is a new rv with
different distribution Pg (X ) . Formula (9) is instead based on the old PX .
Moreover, by the 1-to-1 correspondence between probability measures
and df’s we can safely replace dPX by dFX .
R∞
Taking the identity function g (x) = x we clearly get E(X ) = −∞ xdFX .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

If the probability distribution PX is continuous we get the following result.

Corollary

Let X be a random variable with continuous distribution PX having

density %(x). Then, the expectation of X is the Lebesgue integral
Z
(10) E(g (X )) = g (x)%(x)d`
R

whenever g : R → R is Borel-measurable.

When both sides of equation (10) are finite, Corollary 1 says that the
Borel function g (x)%(x) is integrable.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Example

Let X ∼ N(0, 1) be a standard Gaussian rv. Transforming X by the

x
Borel-measurable (indeed continuous) function
R ∞ gx(x) = e , we get the
X
new rv Y = e . The expectation E(Y ) is −∞ e %(x)dx, where
2
%(x) = √1 e−x /2 is the density function of PX .
2π
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Exercise
Let X , Y be rv’s with finite expectation and let g : R → R be a
Borel-measurable function such that g (X ), g (Y ) have finite expectation
a.s.
too. Prove in an alternative way that if X = Y then
E(g (X )) = E(g (Y )).

Hint
Use the Change of Variable Formula.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Exercise
Let X be a rv overPa probability space (Ω, F, P). Define the nonnegative
n
simple rv g (X ) = i=1 ai I{X ∈Bi } , where ai > 0 and ∪ni=1 Bi = R is a
finite partition of R. The function g : R → R is clearly Borel-measurable.
R
Assume E(g (X )) < ∞ and show that the expectation equals R g (x)dPX .
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Exercise
Assume X = c for every ω ∈ Ω. Use the Change of Variable Formula to
compute E(g (X )) as:
Z
(11) g (X )dP = g (c).
Ω

Hint
Use the Dirac-delta measure PX (B) = δc (B), for Borel sets B ⊂ R and
c ∈ R.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Summary

To compute the expectation of a rv X we use the formula

Z ∞
E(X ) = x dFX .
−∞

Thus we have two practical cases.

FX is (absolutely) continuous and % is continuous ⇒
dFX (x) =R∞%(x)dx (by the Fundamental Theorem of Calculus) and
E(X ) = −∞ x%(x)dx, a Riemann integral.
P∞ ⇒ dFX (x) = FX (xi ) − FX (xi−1 ) and
FX is ‘discrete’
E(X ) = i=1 xi P(X = xi ).
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Not Only a Matter of Notation

Lebesgue measure and integration provides us with a powerful tool for manipulating
(symbolically or numerically) expectations. From now on we will write dx in the
Lebesgue integral, if no confusion arises on what type of integral is involved.

dP = P(dω) ⇐⇒ dPX = P(dx) ⇐⇒ dFX = FX (dx).

R
When a density arises, PX (B) = B %(x)dx, the probability distribution
R of X acts on
‘infinitesimal’ Borel set dx through PX (dx) = %(x)dx, i.e. PX (B) = B dPX .

From E(IA ) = P(A), let A = {X ∈ B}. Thus:

Z Z
I{X ∈B} dP = PX (B) = dFX .
Ω B

The probability of a Borel subset B over (R, B) can be computed by integrating dFX
here.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Lebesgue-Stieltjes Integral
R∞
The integral −∞
xdFX defining E(X ) is of a special kind. It originates
as follows.
1 Define a Lebesgue-Stieltjes measure on (R, B), i.e. a countably
additive set function m generated by a càdlàg function F as
m((a, b]) = F (b) − F (a) for finite reals a < b, and such that m(I ) is
finite for bounded intervals I ⊂ R.
2 Define a Borel-measurable function g : R → R.
RBuild the Lebesgue-type integral of g step by step to get
3
∞
−∞
g (x)dF .
4 This is called the Lebesgue-Stieltjes integral.
5 To get the expectation set g (x) = x and consider only càdlàg
functions F such that F (−∞) = 0 and F (∞) = 1, i.e. df’s.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Riemann-Stieltjes Integral

R∞
g (x)dF using the Riemann approach. If ab g (x)dx exists,
R
Now let us construct −∞
Pn
then it is approximately i=1 g (ci )(xi − xi−1 ) where ci ∈ (xi−1 , xi ] and [a, b] is
partitioned by points {a = x0 < x1 < · · · < xn = n}. Here the ‘representative value’
g (ci ) of g over each (xi−1 , xi ] is weighted by `((xi−1 , xi ]). Recall that ` is generated
by the càdlàg function F (x) = x.

Instead,
Pn we can use a different ‘weight function’ F (x) and form the sum
i=1 g (ci )(F (xi ) − F (xi−1 )). If for n → ∞ the sum converges to a limit, we call it
the Riemann-Stieltjes integral of g with respect to F . It is crucial that g (x) and F (x)
have not discontinuities at the same points x.
Monotone Convergence Theorem
Expected Value: The Starting Point
Change of Variable Formula
Two Important Theorems
Integrals of the Stieltjes Type

Just to Remind You

R∞
The improper Riemann integral a g (x)dx exists iff
Rb
K (b) := a g (x)dx has finite limit limb→∞ K (b). Similarly, we can
Rb
define other improper Riemann integrals, such as −∞ g (x)dx and
R∞ R∞
−∞
g (x)dx. Thus, the Riemann-Stieltjes integral a g (x)dF (x)
can be defined over the whole R.
In writing Lebesgue-Stieltjes integrals
R over bounded intervals, the
end-points matter. For example, (a,b] g (x)dF differs from
R
[a,b]
g (x)dF , and similarly for [a, b) and (a, b). This is because the
underlying construction is based on càdlàg functions F that might
be step functions with F (x) − F (x−) 6= 0.
Sufficient conditions for the existence of the Riemann-Stieltjes
integral: g is continuous and F is increasing, or
F is expressed as the difference of two increasing functions.

Durrett Probability Theory and Examples Solutions PDF
73% (15)
Durrett Probability Theory and Examples Solutions PDF
122 pages
(Courant Lecture Notes) S. R. S. Varadhan - Probability Theory-Amer Mathematical Society (2001) PDF
No ratings yet
(Courant Lecture Notes) S. R. S. Varadhan - Probability Theory-Amer Mathematical Society (2001) PDF
178 pages
Applied Stochastic Processes: M. Ottobre
No ratings yet
Applied Stochastic Processes: M. Ottobre
164 pages
Xerox®AltaLink®B8170F SM May2020
100% (1)
Xerox®AltaLink®B8170F SM May2020
1,284 pages
Probability Theory and Random Processes (MA 225) : Class Notes
No ratings yet
Probability Theory and Random Processes (MA 225) : Class Notes
35 pages
Notes Mainimp
No ratings yet
Notes Mainimp
164 pages
Lesson 1
No ratings yet
Lesson 1
31 pages
mcnotes51
No ratings yet
mcnotes51
8 pages
MA225 L3 Notes
No ratings yet
MA225 L3 Notes
40 pages
Probability Theory (MATHIAS LOWE)
No ratings yet
Probability Theory (MATHIAS LOWE)
69 pages
Cheatsheet Probability and Statistics
100% (1)
Cheatsheet Probability and Statistics
10 pages
Mathematical Methods Notes
No ratings yet
Mathematical Methods Notes
230 pages
Selective Review - Probability
No ratings yet
Selective Review - Probability
30 pages
Probability and Statistics: Cheat Sheet
100% (1)
Probability and Statistics: Cheat Sheet
10 pages
4SP_LectureNotes_v3 (1)
No ratings yet
4SP_LectureNotes_v3 (1)
45 pages
Econometric S If All 2020
No ratings yet
Econometric S If All 2020
119 pages
Chap2 PDF
No ratings yet
Chap2 PDF
20 pages
Maa 203 Cheat Sheet Lucien Walewski
No ratings yet
Maa 203 Cheat Sheet Lucien Walewski
2 pages
Introduction To Probability Theory
No ratings yet
Introduction To Probability Theory
8 pages
An Introduction To Probability Theory
100% (1)
An Introduction To Probability Theory
91 pages
Orf526 f24 Lec3
No ratings yet
Orf526 f24 Lec3
5 pages
Ito Shit
No ratings yet
Ito Shit
35 pages
Introduction To Probability Theory
No ratings yet
Introduction To Probability Theory
9 pages
Financial Mathematics (Lectures)
No ratings yet
Financial Mathematics (Lectures)
64 pages
Prob Notes
No ratings yet
Prob Notes
70 pages
Hitchhiker S Guide To Probability
No ratings yet
Hitchhiker S Guide To Probability
6 pages
Week8 Notes
No ratings yet
Week8 Notes
18 pages
Cs229 Probability Review
No ratings yet
Cs229 Probability Review
36 pages
Basic Statistics and Probability For Econometrics Econ 270a
No ratings yet
Basic Statistics and Probability For Econometrics Econ 270a
18 pages
study-of-the-mellin-integral-transform-with-applications-in-statistics-and-probability
No ratings yet
study-of-the-mellin-integral-transform-with-applications-in-statistics-and-probability
17 pages
Final Print (1) PRINT
No ratings yet
Final Print (1) PRINT
25 pages
MIT6 262S11 Lec02
No ratings yet
MIT6 262S11 Lec02
11 pages
Inequalites Mso205
No ratings yet
Inequalites Mso205
5 pages
MScFE 620 DTSP - Video - Transcripts - M1
No ratings yet
MScFE 620 DTSP - Video - Transcripts - M1
14 pages
Review Notes - Probability
No ratings yet
Review Notes - Probability
16 pages
Week3 Lecture1 PDF
No ratings yet
Week3 Lecture1 PDF
27 pages
Sia Notes 2013
No ratings yet
Sia Notes 2013
279 pages
Lecturenotes3 4 Probability
No ratings yet
Lecturenotes3 4 Probability
14 pages
LN SP2018
No ratings yet
LN SP2018
139 pages
Lecture Notes Fall Term 2013
No ratings yet
Lecture Notes Fall Term 2013
40 pages
brownian_motion_and_ito_calculus
No ratings yet
brownian_motion_and_ito_calculus
19 pages
Probability Theory I: CAM 384K Concepts
No ratings yet
Probability Theory I: CAM 384K Concepts
14 pages
Week3 Notes
No ratings yet
Week3 Notes
7 pages
Lecture 3
No ratings yet
Lecture 3
4 pages
StochasticModels 2011 Part 2 v1
No ratings yet
StochasticModels 2011 Part 2 v1
22 pages
MIT6 436JF18 Lec04
No ratings yet
MIT6 436JF18 Lec04
15 pages
Excercise and Solution Manual
0% (1)
Excercise and Solution Manual
20 pages
Lecture 09
No ratings yet
Lecture 09
15 pages
Foss Lecture1
No ratings yet
Foss Lecture1
32 pages
Notes
No ratings yet
Notes
47 pages
probability_and_statistics_22s_soln
No ratings yet
probability_and_statistics_22s_soln
4 pages
Skript 2022
No ratings yet
Skript 2022
112 pages
Continuous Time
No ratings yet
Continuous Time
62 pages
Discrete Time
No ratings yet
Discrete Time
106 pages
Set Theory Essentials
From Everand
Set Theory Essentials
Emil Milewski
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Lectures on Measure and Integration
From Everand
Lectures on Measure and Integration
Harold Widom
No ratings yet
Square Summable Power Series
From Everand
Square Summable Power Series
Louis de Branges
5/5 (1)
Complex numbers
From Everand
Complex numbers
Alessio Mangoni
No ratings yet
Probability Theory: A Concise Course
From Everand
Probability Theory: A Concise Course
Y. A. Rozanov
4/5 (2)
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet
Riemann–Hilbert correspondence for irregular holonomic D-modules? Masaki Kashiwara
No ratings yet
Riemann–Hilbert correspondence for irregular holonomic D-modules? Masaki Kashiwara
33 pages
978-3-662-02661-8_4
No ratings yet
978-3-662-02661-8_4
56 pages
Riemann–Hilbert correspondence for irregular holonomic D-modules? Masaki Kashiwara
No ratings yet
Riemann–Hilbert correspondence for irregular holonomic D-modules? Masaki Kashiwara
37 pages
Math Competition Group Class: Final, June 3rd 2017
No ratings yet
Math Competition Group Class: Final, June 3rd 2017
3 pages
Localizations for quiver Hecke algebras III MathematischeAnnalen Masaki Kashiwara1,2 · Myungho Kim3 · Se-jin Oh4 · Euiyong Park5
No ratings yet
Localizations for quiver Hecke algebras III MathematischeAnnalen Masaki Kashiwara1,2 · Myungho Kim3 · Se-jin Oh4 · Euiyong Park5
34 pages
Belhard Study Guide
No ratings yet
Belhard Study Guide
100 pages
Discrete PDF
No ratings yet
Discrete PDF
1 page
Place To Stand Part 1
No ratings yet
Place To Stand Part 1
40 pages
Math4120 Lecture-1-01 H
No ratings yet
Math4120 Lecture-1-01 H
12 pages
AMC-10 Group Class: Midterm, November 4th 2017
No ratings yet
AMC-10 Group Class: Midterm, November 4th 2017
4 pages
Problem Set 1 PDF
No ratings yet
Problem Set 1 PDF
10 pages
Probability PDF
No ratings yet
Probability PDF
1 page
Number Theory Intro: 1 Preparatory Meterials
No ratings yet
Number Theory Intro: 1 Preparatory Meterials
7 pages
Multiplication Notes Problems and Keys
No ratings yet
Multiplication Notes Problems and Keys
5 pages
M7 Answer and Explanations
No ratings yet
M7 Answer and Explanations
22 pages
M7 No Answer Key
No ratings yet
M7 No Answer Key
13 pages
Analytic Geometry
100% (3)
Analytic Geometry
219 pages
Practice Test 8 Math Level 2: Answer Sheet
No ratings yet
Practice Test 8 Math Level 2: Answer Sheet
22 pages
Practice Test 8 Math Level 2: Answer Sheet
No ratings yet
Practice Test 8 Math Level 2: Answer Sheet
13 pages
Excaliburv20 n1
No ratings yet
Excaliburv20 n1
4 pages
ARML 2014 Contest Final Version
No ratings yet
ARML 2014 Contest Final Version
36 pages
Everything Maths Grade 12
100% (1)
Everything Maths Grade 12
199 pages
Practice Test 5 Questions: 310 Part Iii / Eight Practice Tests
No ratings yet
Practice Test 5 Questions: 310 Part Iii / Eight Practice Tests
11 pages
Answer Key: Practice Test 5 321
No ratings yet
Answer Key: Practice Test 5 321
8 pages
Ratio and Proportion Shortcuts
No ratings yet
Ratio and Proportion Shortcuts
5 pages
AMC 8 Handy Facts
No ratings yet
AMC 8 Handy Facts
7 pages
Coordinate Geometry: Points
No ratings yet
Coordinate Geometry: Points
9 pages
Trigonometry Olympiad Problems
No ratings yet
Trigonometry Olympiad Problems
7 pages
Art Subject For Elementary - 2nd Grade - Visual Arts by Slidesgo
No ratings yet
Art Subject For Elementary - 2nd Grade - Visual Arts by Slidesgo
56 pages
Sanyo PLV-Z4 Service Manual
No ratings yet
Sanyo PLV-Z4 Service Manual
138 pages
C5 Air Condition AC
No ratings yet
C5 Air Condition AC
15 pages
2025_IsiZulu_Grade 4_Mathematics_ATP_Term1 & 2
No ratings yet
2025_IsiZulu_Grade 4_Mathematics_ATP_Term1 & 2
4 pages
General Technical-Parker/fittings
No ratings yet
General Technical-Parker/fittings
34 pages
November 2024 Sky Watch
No ratings yet
November 2024 Sky Watch
1 page
Mla In-Text Citation
No ratings yet
Mla In-Text Citation
15 pages
Download ebooks file Values and Principles in European Union Foreign Policy Sonia Lucarelli all chapters
100% (3)
Download ebooks file Values and Principles in European Union Foreign Policy Sonia Lucarelli all chapters
45 pages
FTMS PDF
No ratings yet
FTMS PDF
19 pages
Line Coding & PSD
No ratings yet
Line Coding & PSD
10 pages
TDS ASTM TYPE I II (OCTOBER 2024)
No ratings yet
TDS ASTM TYPE I II (OCTOBER 2024)
1 page
Humidity
No ratings yet
Humidity
18 pages
Grade 4 6 Mathematics 5
No ratings yet
Grade 4 6 Mathematics 5
22 pages
Example of Scientific Paper For IABO 2021
No ratings yet
Example of Scientific Paper For IABO 2021
6 pages
Thesis Work in Germany
100% (3)
Thesis Work in Germany
6 pages
2017NECB-V2 National Energy Code of Canada For Buildings
No ratings yet
2017NECB-V2 National Energy Code of Canada For Buildings
341 pages
MELCS Unpacking2
No ratings yet
MELCS Unpacking2
2 pages
Group Assignment - Lesson Plan - TBLT
No ratings yet
Group Assignment - Lesson Plan - TBLT
13 pages
Migraine Inhaler Sds
No ratings yet
Migraine Inhaler Sds
8 pages
Untitled
No ratings yet
Untitled
7 pages
Calculating Module Voltages
No ratings yet
Calculating Module Voltages
2 pages
Experiment - 1: Aim: To Determine The Least Count of The Vernier Callipers and Measure The Length and Diameter of A Small Cylinder. Apparatus
100% (1)
Experiment - 1: Aim: To Determine The Least Count of The Vernier Callipers and Measure The Length and Diameter of A Small Cylinder. Apparatus
7 pages
List of Graduates Bscrim S.Y 2023-2024
No ratings yet
List of Graduates Bscrim S.Y 2023-2024
6 pages
TALACOGON WEST and EAST SMEA 1ST QUARTER 2022
No ratings yet
TALACOGON WEST and EAST SMEA 1ST QUARTER 2022
71 pages
Jefferson 1988
No ratings yet
Jefferson 1988
24 pages
Lightning Event Counter: Accessories Down-Conductors and Accessories
No ratings yet
Lightning Event Counter: Accessories Down-Conductors and Accessories
1 page
1-BLDG 471 Intro HVAC System Design
No ratings yet
1-BLDG 471 Intro HVAC System Design
15 pages
SY Binary Numbers
No ratings yet
SY Binary Numbers
14 pages
5KG Small Scale Load Cell Weighing Pressure Sensor With A-D HX711AD Adapter (Banggood)
No ratings yet
5KG Small Scale Load Cell Weighing Pressure Sensor With A-D HX711AD Adapter (Banggood)
5 pages

Damiano Rossello: DEB University of Catania

Uploaded by

Damiano Rossello: DEB University of Catania

Uploaded by

Expected Value: The Starting Point

Two Important Theorems

Probability For Finance

1 Expected Value: The Starting Point

2 Two Important Theorems

0.35 900 0.65 1000

900 335 1000

We can extend the calculation of X to more elaborated rv’s. The key

The random function IA is a rv iff A ∈ F (exercise for the student).

Essentially, indicator functions establish a 1-to-1 correspondence between

Note that the rv X in Example 1 may be defined as X = IA , with

Linear Combinations of Indicators

Any constant random function induces the partition {∅, Ω} of Ω by

Recall that (see Lecture 3) a discrete rv may take on a finite number of

(1) X = x1 · IA1 + · · · + xn · IAn ,

where Ai = X −1 ({xi }) = {X = xi }, i = 1, . . . , n. Here the partition is

Intermezzo 1: Preserving Measurability

Theorem (Measurability on a Generator)

Thus by Theorem 1 and the above, to verify that a random function

By Formula (1), combining n indicator functions over a partition

(2) E(X ) := x1 P(A1 ) + · · · + xn P(An ) > 0.

This is well-defined and finite. As a special case we have

(3) E(IA ) := 1 · P(A) + 0 · P(Ac ) = P(A)

for X = IA , so the expectation coincides with the probability of A.

Exercise (The Set S is a Vector Space)

Exercise (Properties of E(X ))

Exercise (Inclusion-Exclusion Formula Revisited)

Let S+ ⊂ S be the subset of nonnegative simple rv’s. The expectation of

The correctness of formula (4) is based on the following result:

Whence, the definition of E(X ) for a nonnegative rv X can be restated as:

Expectation and limits can be exchanged.

Remark (Almost Sure Comparisons)

while FX (c) = P(X 6 c) = 1. This implies P(X = c) = FX (c) − FX (c−) = 1 and X

Expectation: Final Step

X + := max{0, X }, X − := max{0, −X } = − min{0, X }.

Observe that X + , X − > 0. We call X + the positive part of X and X −

(5) E(X ) = E(X + ) − E(X − ) < ∞.

Manipulating Expectation to the Limit

The following theorem is of key importance. Comparing with Lemma 1,

lim (zi − )P(Ain ) = (zi − )P(Ai ),

which implies limn→∞ E(Xn ) > m (zi − )P(Ai ) = m

So far we seen that expectation is linear. But what happen if we try to

Indeed, X is not necessarily > 0 so we can decompose it by

Example (Expectation of Binomial Rv’s)

= np(p + (1 − p))n−1 = np.

Let X ∼ N(µ, σ 2 ) be a Gaussian rv, Y ∼ U(a, b) be a uniform rv and Z ∼ Esp(λ) be

There exists a unique measure ` : B → [0, ∞] called the Lebesgue

Another important concept is that of a Borel-measurable function

We have to build a more general integration process starting from

Intermezzo 2: Uniform Probability Spaces

If B = {q1 , q2 , . . .} is a countable subset of (0, 1] we then have `(B) = 0.

Riemann vs Lebesgue Integration /1 *

must coincide as partitions Pn = {a = x0 < x1 < . . . < xn = b} refine

where g is a step function, i.e. g (x) = ci for x ∈ (xi−1 , xi ) so it is

Riemann Integration suffers Pathologies

does not always hold.

Riemann vs Lebesgue Integration /2 *

Riemann vs Lebesgue Integration /3 *

The limit above is understood as n → ∞ and the mesh

The supremum might be ∞.

Riemann vs Lebesgue Integration /4 *

separately to the positive part f + := max{0, f } and to the negative part

Write f = 1 · IQ∩[a,b] + 0 · I(Q∩[a,b])c (x). Then, compute the integral as

Indeed, Q ∩ [a, b] = {q1 , q2 , . . .} is the countable set of all rational points

Riemann vs Lebesgue Integration /5

Indeed, we have the following result:

Let f : [a, b] → R with −∞ < a < b < ∞.

Some Properties of the Lebesgue Integral

The Probabilistic Connection /1

It is apparent that the further advantage of Lebesgue integration is its applicability to

The Probabilistic Connection /2

Integrating over Subsets

Change of Variable in the Integral

Finally we arrive to:

provided that either side is well-defined.

Before to prove the Theorem above let you do the following:

lim (zi − )P(Ain ) = (zi − )P(Ai ),

which implies limn→∞ E(Xn ) > m (zi − )P(Ai ) = m