0% found this document useful (0 votes)

9 views

ee5110-lecture-limit-theorems

The document outlines key concepts in probability theory relevant for electrical engineers, including the Weak Law of Large Numbers, convergence in probability, convergence in distribution, and the Central Limit Theorem. It provides definitions, inequalities, and examples to illustrate these concepts, emphasizing their applications in estimating probabilities and understanding random variables. Additionally, it discusses the Strong Law of Large Numbers and convergence with probability one, reinforcing the importance of these principles in statistical analysis.

Uploaded by

Aman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

ee5110-lecture-limit-theorems

Uploaded by

Aman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

EE5110: Probability Foundations for Electrical Engineers

July - Nov 2024

1 Weak Law of Large Numbers

1. Markov Inequality
• “If X ≥ 0 and E[X] is small, then X is unlikely to take large values”
• If a random variable X can only take non-negative values, then
E[X]
P(X ≥ a) ≤ , for all a > 0
a
• Example: Let X be uniformly distributed in the interval [0, 4].
(a) Find P(X ≥ 2), P(X ≥ 3) and P(X ≥ 4). (Ans: 12 , 14 and 0).
(b) Bound the values of P(X ≥ 2), P(X ≥ 3) and P(X ≥ 4) using
Markov inequality. (Ans: 1, 23 and 42 ).

2. Chebyshev Inequality
• “If the variance of X is small, then X is unlikely to take values away
from the mean”
• If X is a random variable with mean µ and variance σ 2 , then

σ2
P(|X − µ| ≥ a) ≤ for all a > 0
a2
• Example: Let X be uniformly distributed in the interval [0, 4]. Find
a bound on P(X ≥ 2), P(X ≥ 3) and P(X ≥ 4) using Chebyshev
inequality. (Ans: 1, 1 and 31 .)
3. Weak Law of Large Numbers
• “Sample mean of i.i.d. r.v.s is likely to be close to the true mean”
• Let X1 , X2 , · · · , be i.i.d. r.v.s with mean µ and variance σ 2 . Define
sample mean as
n
1X
X̄n = Xi
n i=1
For every a > 0, we have,

X1 + · · · + Xn
P(|X̄n − µ| ≥ a) = P − µ ≥ a → 0, n→∞
n

(use Chebyshev inequality)

1
Figure 1: Distribution of X̄n for different values of n (n = 10, 20, 30, 40, 60
and 100) when Xi are Bernoulli with p = 12 . Observe that the sample mean
converges around p = 12 .

Figure 2: Sample average X̄n (corresponding the above figure) as a function

of n, for different realisations. Observe that most of the realisations have their
sample average around p = 12 .

4. Example: Consider an event A defined in the context of some probabilistic

experiment. Let p = P (A) be the probability of this event. We consider
n independent repetitions of the experiment, and let Mn be the fraction

2
of time that event A occurs; Mn is often called the empirical frequency of
A. The weak law applies and shows that when n is large, the empirical
frequency is most likely to be within ϵ of p. Empirical frequencies are
faithful estimates of probability of an event!
5. Example: Let p be the fraction of voters who support a particular candi-
date for office. We interview n “randomly selected” voters and record Mn ,
the fraction of them that support the candidate. Show that with a sample
size of n = 100, the probability that our estimate of p is incorrect by more
than 0.1 (accuracy) is no larger than 0.25 (confidence). Suppose, we like
to have high confidence (probability at least 95%) that our estimate will
be very accurate (within .01 of p). How many voters should be sampled?
(Hint: 50,000.)

2 Convergence in Probability
1. Convergence in Probability:
• Let Y1 , Y2 , · · · be a sequence of random variables. We say that Yn
converges to a (a real number) in probability if, for every ϵ > 0,

lim P(|Yn − a| ≥ ϵ) = 0
n→∞

i.p. P
In this case, we write, Yn −−→ a or Yn −
→ a.
– If Yn (ω) → a for all ω ∈ Ω, we say Yn converges pointwise to
a. Here, the notion of convergence considered is the standard
definition of convergence of a sequence of real numbers.
1 P
2. Example: Let P(Yn = 1) = n = 1 − P(Yn = 0). Then, Yn −
→ 0.
1 P
3. Example: Let P(Yn = n) = 1, for all n. Then, Yn −
→ 0.
4. Example: (WLLN) The sample mean {X̄n } converges to the true mean µ
P
in probability, X̄n −
→ µ.
5. Example: Let X1 , X2 , · · · be i.i.d. uniform random variables between
i.p.
[0, 1]. Define, for all n, Yn = min(X1 , · · · , Xn ). Then, Yn −−→ 0.
6. Example: Let X1 , X2 , · · · be i.i.d. uniform random variables between
i.p.
[0, 1]. Define, for all n, Yn = max(X1 , · · · , Xn ). Then, Yn −−→ 1.
7. Properties: Let Xn → a and Yn → b in probability. Then,
• Xn + Yn → a + b in probability
• If g(·) is continuous, then g(Xn ) → g(a) in probability
• E[Xn ] need not converge to a

3
8. Let Y1 , Y2 , · · · be a sequence of random variables. We say that Yn con-
verges to Y (a random variable) in probability if, for every ϵ > 0,

lim P(|Yn − Y | ≥ ϵ) = 0
n→∞

i.p. P
In this case, we write Yn −−→ Y , or Yn −
→Y.
• Convergence in probability characterizes the behaviour of the se-
quence of random variables {Yn } individually (in terms of their marginal
distributions) and their relationship to a limiting random variable
Y , but it does not imply much about the joint distribution of the
sequence {Yn }.

3 Convergence in Distribution
1. A sequence of random variables Y1 , Y2 , · · · is said to converge to a random
variable Y in distribution if,

lim FYn (y) = FY (y)

n→∞

for all y at which Fy (y) is continuous.

D
2. Example: Let P(Yn = 1) = n1 = 1 − P(Yn = 0). Then, Yn −→ 0. (Here,
Y = 0 is the degenerate random variable.)
D
3. Example: Let P(Yn = n1 ) = 1, then Yn −→ 0. (Here, Y = 0 is the
degenerate random variable.)
D
4. Example: Let Xn be a sequence of i.i.d. random variables. Then, Xn −→
X1 .
Xn
5. Example: Let Xn ∼ geo( nλ ). Let Yn = n . Show that Yn → Y in
distribution, where Y ∼ exp(λ).
6. Example: Let Yn ∼ Binomial(n, nλ ). Show that Yn → Y in distribution,
where Y ∼ Poisson(λ).
7. Convergence in probability implies convergence in distribution, but con-
vergence in distribution does not imply convergence in probability.

4 Central Limit Theorem

1. Motivation: Revisiting Chebyshev inequality
Pn
• i=1 (Xi − µ) has variance nσ
2

Pn 2
• n1 i=1 (Xi − µ) has variance σn

4
Pn
• √1
n i=1 (Xi − µ) has variance σ 2

2. Central limit theorem: Let X1 , X2 , · · · be i.i.d. random variables with

finite µ and σ 2 . Define
n
X Xi − µ
Zn = √
i=1
σ n

Zn has zero mean and unit variance. The CDF of Zn converges to the
standard normal CDF, i.e.,

lim P(Zn ≤ z) = Φ(z) for all z

n→∞

D
Zn −→ Z, where Z ∼ N (0, 1).
3. CLT is quite general and very useful!
• sum of random variables is approximately normal
4. Characteristic function of a random variable X:
R ∞ itx
itX −∞
e fX (x) dx, X is continuous
ΦX (t) = E[e ] = P itx
x e pX (x), X is discrete

where t ∈ R and i2 = −1.

• characteristic functions are well defined for all t and X.
• characteristic function determines the distribution uniquely (you can
invert the characteristic function to identify the distribution).
• Characteristic functions can be used to generate moments of the ran-
dom variable.

(k) dk ΦX (t)
ΦX (0) = = ik E[X k ]
dtk t=0

• If X and Y are independent, then ΦX+Y (t) = ΦX (t) ΦY (t).

• Example: Let X be Bernoulli p. Then, ΦX (t) = (1 − p) + eit p.
• Example: Let X be Binomial (n,p). Then, ΦX (t) = ((1 − p) + eit p)n .
it
• Example: Let X be Poisson λ. Then, ΦX (t) = eλ(−1+e ) .
t2
• Example: Let X ∼ N (0, 1). Then, ΦX (t) = e− 2 .
• Example: Let Xn ∼ geo( nλ ). Let Yn = Xn
n . Show that Yn → Y in
distribution, where Y ∼ exp(λ).
• Example: Let Yn ∼ Binomial(n, nλ ). Show that Yn → Y in distri-
bution, where Y ∼ Poisson(λ), by showing that ΦYn (t) → ΦY (t) as
n → ∞.

5
5. Proof of CLT:
Pn Xi −µ
Given Zn = √1 .
n i=1 σ

As {Xn } are i.i.d., we have,

n
Y t t
ΦZn (t) = Φ( Xi −µ ) √ = ΦnX1 −µ √
σ n ( σ ) n
i=1

Consider Y1 = X1σ−µ , which is a zero mean random variable with unit
variance. Using a Taylor series expansion around 0, we have,

(1) (2) t2 t2
ΦY1 (t) = ΦY1 (0) + ΦY1 (0)t + ΦY1 (0) + o(t2 ) = 1 + 0 − + o(t2 )
2! 2
Then, ΦZn (t) is
n
t2 t2

t t2
ΦZn (t) = ΦnY1 √ = 1− +o → e− 2 = ΦZ (t)
n 2n n

where Z ∼ N (0, 1). This (using Levy’s theorem) implies that Zn converges
in distribution to Z, a standardized normal random variable.

6
6. Illustration of Convolution and CLT:

Figure 3: Illustration of CLT. Figure plots the density of sum (scaled) of i.i.d.
random variables for different values of n and for three different densities.

7. Applications of central limit theorem: Let XP 1 , X2 , · · · , be i.i.d. random

n
variables with finite µ and σ 2 . Define Sn = i=1 Xi . If n is large, Sn is
approximately normal, and

c − nµ
P(Sn ≤ c) ≈ Φ √
nσ

8. Example: We load on a plane 100 packages whose weights are independent

random variables that are uniformly distributed between 5 and 50 pounds.
What is the probability that the total weight will exceed 3000 pounds?
(Hint: 1 - ΦZ (1.92) = 0.0274)
9. Example: We poll n voters and record the fraction Mn of those polled
who are in favor of a particular candidate. Find the minimum n (using
CLT) such that our error in estimate is ±0.01 with 0.95 confidence. (Hint:
9604, see Bertsekas and Tsitsiklis).

7
5 Convergence with Probability One
1. Let Y1 , Y2 , · · · be a sequence of random variables. We say that Yn con-
verges to another random variable Y with probability one (or almost
surely) if
P ({ω : Yn (ω) → Y (ω)}) = 1
w.p.1 a.s.
If Y = c, then we say Yn −−−→ c or Yn −−→ c.

2. Example: Let X1 , X2 , · · · be a sequence of independent random variables

that are uniformly distributed in [0, 1]. Define Yn = min(X1 , · · · , Xn ).
Show that Yn → 0 w.p.1. (Hint: Example 5.1.4 from Chapter 5: Limit
Theorems).
3. Example: Let X1 , X2 , · · · be a sequence of independent random variables
that are uniformly distributed in [0, 1]. Define Yn = max(X1 , · · · , Xn ).
Show that Yn → 1 w.p.1.
1 w.p.1
4. Example: Let P(Yn = 1) = n = 1 − P(Yn = 0). Does Yn −−−→ 0? (Hint:
No and Yes!)
1 w.p.1
5. Example: Let P(Yn = n) = 1, for all n. Then, show that Yn −−−→ 0.

6. Strong Law of Large Numbers: Let X1 , X2 , · · · be i.i.d. random variables

with finite µ. The sample mean converges to µ with probability one, i.e.,
( n
)!
1X
P ω: Xi (ω) → µ =1
n i=1

8
Proof : Suppose that the fourth moment of Xn exists, i.e., E[Xn4 ] = K <
∞.
Pn
Without loss of generality, assume that µ = 0. Define Sn = i=1 Xi .
Then,

4 4 4 n 2 2
E[Sn ] = nE[X1 ] + E [X1 ] ≤ nK + 3n(n − 1)K
2 2

Dividing by n4 , we have,

Sn4

K 3K
E 4
≤ 3+ 2
n n n

and,
∞ 4 X∞
X Sn K 3K
E 4 ≤ + 2 <∞
n=1
n n=1
n3 n

If E[Y ] < ∞, then P(Y < ∞) = 1. So,

∞
X Sn4
< ∞, w.p.1.
n=1
n4

4
Sn Sn
and, n4 → 0 w.p.1., and n → 0 w.p.1.
Hence, X̄n → µ with probability one.
7. Some properties
• Convergence with probability one implies convergence in probabil-
ity, but convergence in probability does not imply convergence with
probability one.
• Let Xn → a and Yn → b w.p.1.
– Xn + Yn → a + b w.p.1.
– If g(·) is continuous, then g(Xn ) → g(a) w.p.1.
– E[Xn ] need not converge to a

8. For the experiments given below, compare the time average of the outcome
with the expected outcome (ensemble average) at any time n.
(a) X1 is Bernoulli with mean 0.5. And, X2 = X1 , X3 = X1 , X4 =
X1 , · · ·
(b) X1 is Bernoulli with mean 0.5. And, Xi = X1 if i is odd and Xi =
1 − X1 when i is even.
(c) {Xn } are i.i.d. Bernoulli with mean 0.5.

Probability Theory and Mathematical
71% (14)
Probability Theory and Mathematical
695 pages
Limiting Distributions
No ratings yet
Limiting Distributions
10 pages
4 Convergence and Simulation
No ratings yet
4 Convergence and Simulation
55 pages
Lecture Notes 4 Convergence (Chapter 5) 1 Random Samples: 1 N N 1 N N I
No ratings yet
Lecture Notes 4 Convergence (Chapter 5) 1 Random Samples: 1 N N 1 N N I
12 pages
확통1 LectureNote06 on Limit Theorems
No ratings yet
확통1 LectureNote06 on Limit Theorems
36 pages
Lecture 7: Convergence and Limit Theorems
No ratings yet
Lecture 7: Convergence and Limit Theorems
23 pages
Đồ_án_CSXS (1)
No ratings yet
Đồ_án_CSXS (1)
28 pages
Chapter3 Asymtotic Stats
No ratings yet
Chapter3 Asymtotic Stats
114 pages
Various Modes of Convergence: Definitions
No ratings yet
Various Modes of Convergence: Definitions
6 pages
Covergence
No ratings yet
Covergence
18 pages
Math5846_chapter6
No ratings yet
Math5846_chapter6
85 pages
Lec 6
No ratings yet
Lec 6
7 pages
Convergence of Random Variables
No ratings yet
Convergence of Random Variables
11 pages
hw7 - Sol 2
No ratings yet
hw7 - Sol 2
15 pages
Math556 11 ModesOfConvergence
No ratings yet
Math556 11 ModesOfConvergence
9 pages
Data Analysis Slides
No ratings yet
Data Analysis Slides
43 pages
Osobine Var
No ratings yet
Osobine Var
19 pages
Convergence Concepts: 2.1 Convergence of Random Variables
No ratings yet
Convergence Concepts: 2.1 Convergence of Random Variables
6 pages
Recitation_1
No ratings yet
Recitation_1
10 pages
Lec 8
No ratings yet
Lec 8
13 pages
Unit 3 YT Part2
No ratings yet
Unit 3 YT Part2
74 pages
Approximations To Probability Distributions: Limit Theorems
No ratings yet
Approximations To Probability Distributions: Limit Theorems
15 pages
Introduction To Probability Theory
No ratings yet
Introduction To Probability Theory
13 pages
Probability and Stochastic Process 51
No ratings yet
Probability and Stochastic Process 51
11 pages
9 CLT
No ratings yet
9 CLT
19 pages
Lec 4
No ratings yet
Lec 4
8 pages
Chapter7 (Probability)
No ratings yet
Chapter7 (Probability)
15 pages
Lesson4 MAT284 PDF
100% (1)
Lesson4 MAT284 PDF
36 pages
Chapter 5 Limit Theorems
No ratings yet
Chapter 5 Limit Theorems
31 pages
Random Variables: 1.1 Elementary Examples
No ratings yet
Random Variables: 1.1 Elementary Examples
14 pages
Week 16 - L13 - Limit Theorems
No ratings yet
Week 16 - L13 - Limit Theorems
22 pages
MTL106 - by Amaiya Singhal (PROBABILITY AND STOCHASTIC PROCESSES)
No ratings yet
MTL106 - by Amaiya Singhal (PROBABILITY AND STOCHASTIC PROCESSES)
53 pages
Chapter 7 8fhjg
No ratings yet
Chapter 7 8fhjg
9 pages
Unit 3 - Bounds and Inequalities
No ratings yet
Unit 3 - Bounds and Inequalities
25 pages
Lect 05
No ratings yet
Lect 05
22 pages
Unit3-Probability and Stochastic Processes (18MAB203T)
No ratings yet
Unit3-Probability and Stochastic Processes (18MAB203T)
25 pages
Chap 1samp Distributions
No ratings yet
Chap 1samp Distributions
7 pages
G Chebyshev's
No ratings yet
G Chebyshev's
9 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
7 pages
Probability II Upload Week 9
No ratings yet
Probability II Upload Week 9
3 pages
Convergence of Random Variables
No ratings yet
Convergence of Random Variables
7 pages
Convergence of Random Variables - Wikipedia
No ratings yet
Convergence of Random Variables - Wikipedia
17 pages
STA 211 Lecture 3
No ratings yet
STA 211 Lecture 3
21 pages
ch2
No ratings yet
ch2
24 pages
ORF309 Limit Theorems
No ratings yet
ORF309 Limit Theorems
7 pages
Random Variables
No ratings yet
Random Variables
8 pages
03 Asym - Ipynb Econ Prob
No ratings yet
03 Asym - Ipynb Econ Prob
3 pages
Print_PTEPS
No ratings yet
Print_PTEPS
66 pages
CH7 Prob Supp
No ratings yet
CH7 Prob Supp
5 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
4 pages
Chebyshev's Inequality:: K K K K
No ratings yet
Chebyshev's Inequality:: K K K K
4 pages
Convergence in Probability
No ratings yet
Convergence in Probability
10 pages
Chebysev Inequality: Suppose and Variance
No ratings yet
Chebysev Inequality: Suppose and Variance
13 pages
Prob Notes
No ratings yet
Prob Notes
70 pages
MScFE 622 CTSP_Compiled_Notes_M1
No ratings yet
MScFE 622 CTSP_Compiled_Notes_M1
16 pages
Section53
No ratings yet
Section53
35 pages
Convergence
No ratings yet
Convergence
7 pages
Stochastic Processes SM
No ratings yet
Stochastic Processes SM
82 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet
Syllabus
No ratings yet
Syllabus
33 pages
Emiprical Risk Minimization
No ratings yet
Emiprical Risk Minimization
12 pages
Probability notes
No ratings yet
Probability notes
19 pages
Probability Theory and Examples 4th Edition Rick Durrettinstant download
100% (1)
Probability Theory and Examples 4th Edition Rick Durrettinstant download
49 pages
Fat Tailes and Fragility
No ratings yet
Fat Tailes and Fragility
75 pages
Cours de Probabilité
No ratings yet
Cours de Probabilité
53 pages
EDA Counting Rules
No ratings yet
EDA Counting Rules
7 pages
Asset and Liability Management For Actuaries Chapter 3 - Liability Management Group 2 - Risk Pooling
No ratings yet
Asset and Liability Management For Actuaries Chapter 3 - Liability Management Group 2 - Risk Pooling
22 pages
Problems: MN) MN
No ratings yet
Problems: MN) MN
11 pages
Risk Management and Insurance
No ratings yet
Risk Management and Insurance
71 pages
Introduction To Probability For Data Science
No ratings yet
Introduction To Probability For Data Science
70 pages
Borel Sigma
No ratings yet
Borel Sigma
7 pages
6.262 Discrete Stochastic Processes - Notes - 0. Course Content
No ratings yet
6.262 Discrete Stochastic Processes - Notes - 0. Course Content
10 pages
Stochastic Processes Theory for Applications Robert Gallager pdf download
100% (1)
Stochastic Processes Theory for Applications Robert Gallager pdf download
56 pages
BDZT2024
No ratings yet
BDZT2024
17 pages
Grinstead, Snell. Introduction To Probability. Errata (October 2006) (9s) - MV
No ratings yet
Grinstead, Snell. Introduction To Probability. Errata (October 2006) (9s) - MV
9 pages
The 7Figure Trader Blueprint
No ratings yet
The 7Figure Trader Blueprint
52 pages
6 Infinite Random Sequences: 6a Introductory Remarks Almost Certainty
No ratings yet
6 Infinite Random Sequences: 6a Introductory Remarks Almost Certainty
16 pages
Notes On Asymptotic Theory: IGIER-Bocconi, IZA and FRDB
No ratings yet
Notes On Asymptotic Theory: IGIER-Bocconi, IZA and FRDB
11 pages
Foundations of Data Science Avrim Blum - Download the full ebook version right now
100% (1)
Foundations of Data Science Avrim Blum - Download the full ebook version right now
69 pages
Skript 2022
No ratings yet
Skript 2022
112 pages
Quantitative Methods 1
No ratings yet
Quantitative Methods 1
15 pages
Uncertainty Theory Baoding Liu Fourth Edition
No ratings yet
Uncertainty Theory Baoding Liu Fourth Edition
491 pages
C2 - Life Insurance Pricing Fundamentalsf
No ratings yet
C2 - Life Insurance Pricing Fundamentalsf
19 pages
What Is A Martingale - Doob
No ratings yet
What Is A Martingale - Doob
14 pages
MH370 and The Bayes Theory
No ratings yet
MH370 and The Bayes Theory
25 pages
Instant Download (Ebook) Probability: Theory and Examples by Rick Durrett ISBN 9781108473682, 1108473687 PDF All Chapters
100% (10)
Instant Download (Ebook) Probability: Theory and Examples by Rick Durrett ISBN 9781108473682, 1108473687 PDF All Chapters
55 pages
Vdoc - Pub - Random Geometric Graphs Oxford Studies in Probability 5
No ratings yet
Vdoc - Pub - Random Geometric Graphs Oxford Studies in Probability 5
345 pages

ee5110-lecture-limit-theorems

Uploaded by

ee5110-lecture-limit-theorems

Uploaded by

EE5110: Probability Foundations for Electrical Engineers

July - Nov 2024

1 Weak Law of Large Numbers

(use Chebyshev inequality)

Figure 2: Sample average X̄n (corresponding the above figure) as a function

4. Example: Consider an event A defined in the context of some probabilistic

lim FYn (y) = FY (y)

for all y at which Fy (y) is continuous.

4 Central Limit Theorem

2. Central limit theorem: Let X1 , X2 , · · · be i.i.d. random variables with

lim P(Zn ≤ z) = Φ(z) for all z

where t ∈ R and i2 = −1.

• If X and Y are independent, then ΦX+Y (t) = ΦX (t) ΦY (t).

As {Xn } are i.i.d., we have,

7. Applications of central limit theorem: Let XP 1 , X2 , · · · , be i.i.d. random

8. Example: We load on a plane 100 packages whose weights are independent

2. Example: Let X1 , X2 , · · · be a sequence of independent random variables

6. Strong Law of Large Numbers: Let X1 , X2 , · · · be i.i.d. random variables

If E[Y ] < ∞, then P(Y < ∞) = 1. So,

You might also like