0% found this document useful (0 votes)

16 views7 pages

ORF309 Limit Theorems

Uploaded by

Darren Alexis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views7 pages

ORF309 Limit Theorems

Uploaded by

Darren Alexis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Limit Theorems of Probability theory

Mark Cerenzia∗

1 Variance
We introduced the expectation of a random variable as a quantity representing the “typical size”
or “best guess” of a random variable X. At one extreme, if X is deterministic, it is in fact equal to
its expectation, i.e., X ≡ EX. At the other extreme, when X is “very random,” it should be very
often far from EX. Thus, to capture “how random” a random variable is, the following quantity
measures the discrepancy from its mean.
Definition 1.1. The variance of a random variable X is the mean squared distance from its mean:
2
Var(X) = σX := E(X − EX)2 .
p
The quantity σX := Var(X) is the standard deviation. More generally, the covariance of random
variables X, Y is defined by
Cov(X, Y ) = σX,Y := E[(X − EX)(Y − EY )].
We say that X, Y are uncorrelated if Cov(X, Y ) = 0. This is implied by (but not equal to) the
independence of X, Y since E[g(X)h(Y )] = E[g(X)] · E[h(Y )] in this case.
Of course, one could imagine many other measures of the dispersion of the values of X around
its mean, but the variance is special in its many nice properties.1
Lemma 1.1. The variance satisfies the following:
• If a, b ∈ R, then Var(aX + b) = a2 V ar(X).
• Var(X) ≥ 0
• Var(X) = 0 if and only if X is deterministic, and thus X ≡ EX.
• Var(X) = E(X 2 ) − (EX)2 .
•
Var(X + Y ) = Var(X) + Var(Y ) + 2 Cov(X, Y )
Hence, if X, Y are uncorrelated or independent, then

Var(X + Y ) = Var(X) + Var(Y )

∗
Although the choice of presentation and much of the commentary is my own, these notes draw heavily from many
sources: Gabor Székely’s Paradoxes in probability theory and mathematical statistics, Ramon van Handel’s ORF 309
notes, and Saeed Ghahramani’s “Fundamentals of Probability, with Stochastic Processes”
1
However, expectation/variance also have some limitations; for example, they can be a bit too sensitive to outliers.

1
Much of the previous lemma may be proven relying on the relationship Var(X) = Cov(X, X),
the fact that Cov(X, c) = 0, and finally the fact that (X, Y ) 7→ Cov(X, Y ) is bilinear, i.e.,

n m
! n X
m
X X X
Cov ai X i , bj Yj = ai bj Cov(Xi , Yj )
i=1 j=1 i=1 j=1

Example 1.1. If τ ∼ Exp(λ), we can readily calculate E[τ ] = 1/λ2 . Further, if γ ∼ Γ(k, λ), then
we saw γ can be realized as a sum of k-many independent Exp(λ) distributed random variables,
and thus we immediately get Var(γ) = k/λ2 .
Similarly, if X1 , X2 , . . . form a Bernoulli process, then Var(Xi ) = p(1 − p). Hence, we have
Var(Sn ) = n · p(1 − p), where Sn := X1 + · · · + Xn .

2 Laws of Large Numbers

We finally have the language and tools to discuss the law of large numbers, the theorem justifying
the relative frequency interpretation of probabilities alluded to a few times above.
Let A be some event (such as getting at least one heads in two flips of a coin). Suppose that we
repeat independent trials of this experiment indefinitely. Let Ai denote the event that the original
event A occurs on the ith instance of the experiment. Then we have the number of “successes”
where A occurs is given by
n
X
Sn := 1Ak ∼ Binomial(n, p), p = P(A).
k=1

In particular, Sn /n is the fraction of times A occurs in n repetitions of the experiment. Notice also
that the expectation of Sn /n is given by E[Sn /n] = P(A). For small n, Sn /n can be quite far from
this expected value P(A); however, as the number of experiments n grows, we will see the mass of
the distribution of Sn /n “concentrates” around its mean P(A).2
The Law of Large Numbers guarantees that Sn /n “converges” to P(A) as n → ∞. More gen-
erally, this theorem says that if X1 , X2 , . . . are i.i.d. random variables
Pn with the same distribution
1
as some given random variable X, then the empirical average n k=1 Xk over n experiments “con-
verges” to E[X] with probability 1.
Notice that the result above suggests the random variables Sn /n concentrate around the deter-
ministic constant E[X], so establishing the above result will require making precise the idea that
Sn /n becomes less and less random. The previous section introduced variance as a measure of
randomness
However, we were quite vague on both the technical assumptions as well as what we mean by
“converge.” We will be more precise on these points as we develop two versions.

Weak Law of Large Numbers

2
Let X be a given random variable satisfying EXP < ∞, and let X1 , X2 , . . . be i.i.d. random variables
with the same distribution as X. Write Sn := nk=1 Xk .
2
Implicit in this discussion is the requirement A ∈ F. Indeed, we can only say whether A occurs on the ith trial
if we are able to observe it. Thus, the intuitive interpretation of F as the information available to the observer and
the technical definition as the collection of events whose probabilities we can compute are inextricably linked!

2
In this subsection, we sketch the proof of a version of the Weak Law of Large Numbers 3 that
takes “convergence” to be in the sense of mean square (for statisticians) or L2 -convergence (for
mathematicians), i.e.,
E(Sn /n − EX)2 → 0
as n → ∞. But since E[Sn /n] = EX, we immediately see that mean square convergence corresponds
to the variance vanishing:
n
2 variance properties 1 X Var(X) n→∞
E(Sn /n − EX) = Var(Sn /n) = 2
· Var(Xk ) = = 0,
n k=1 n

as required. This proves the weak law of large numbers!

Strong Law of Large Numbers

Note the convergence in the weak law of large numbers involves an expectation, so it is basically
saying that Sn /n converges to EX “on average.” More precisely, the framework says we indefinitely
perform independent Ptrials of an experiment and each “ω ∈ Ω” yields a different value for the
empirical average n1 nk=1 Xk (ω). The weak law of large numbers then says if we average over ω all
these empirical averages, we will get EX in the limit.
However, we would like to conclude the empirical average n1 nk=1 Xk (ω) converges to EX for
P
every instance ω, i.e., every time we run independent trials of the experiment.
The Strong Law of Large Numbers guarantees this statement: “with probability one” (abbre-
viated w.p.1), Sn /n converges to EX as n → ∞, i.e., P(limn→∞ Sn /n = P(A)) = 1. One way of
establishing this “almost sure” convergence is to show that Sn /n converges to EX “fast enough.”
One way of capturing “fast enough” is to demand that the infinite series formed by the mean square
distances converges:
∞
"∞ # ∞
X X X
2 2
+∞ > E(Sn /n − EX) = E (Sn /n − EX) =⇒ (Sn /n − EX)2 < ∞ w.p.1,
n=1 n=1 n=1

and thus we must have (Sn /n − EX)2 → 0, and thus too |Sn /n − EX| → 0, w.p.1, as required.
Unfortunately, the first condition of the last display can never hold in our framework. Indeed,
the astute reader will remember that E(Sn /n − EX)2 = V ar(Sn /n) P = V ar(X)/n, so the series
∞
appearing in the last display is proportional to the harmonic series k=1 1/k = +∞, which is
famous for its divergence!
However, the series ∞ 2
P
k=1 1/k converges, so a flash of inspiration suggests that we square the
terms of our series, i.e., we should sum the terms E(Sn /n−EX)4 instead. This approach will require
more assumptions, namely, EX 4 < +∞, and a little more work! First note that we can write
n n
1X 1X
Sn /n − EX = (Xk − EX) = Zk ,
n k=1 n k=1

where we write Zk := Xk − EX and note EZk = 0. Then we have

n
41 X check! 1 3(n − 1)
E(Sn /n − EX) = 4 E[Zk Zℓ Zr Zs ] = 3
E(X − EX)4 + V ar(X)2 ,
n k,ℓ,r,s=1 n n3
3
The normal statement is for “convergence in probability,” but this convergence is implied by mean square
convergence.

3
where as indicated the last equality needs to be checked. Since EZk = 0 and recalling that E[U ·V ] =
EU · EV for independent random variables U, V , the only terms that survive are those for which
k = ℓ = r = s (only n such terms) or for which you have two distinct pairs of indices (only
1 4
2 2
· n(n − 1) = 3n(n − 1)/n such terms). Note the latter terms are of the form E(X − EX)2 =
V ar(X). This explains the last equality, so we can repeat the argument technique above by summing
E(Sn /n − EX)4 instead of E(Sn /n − EX)2 , which completes the proof!

Example 2.1. A real number is called a normal number if the limiting relative frequency of any
digit d ∈ {0, . . . , 9} in its decimal expansion of the number is 1/10. Intuitively, no digit occurs more
frequently than any other. We will use the strong law of large numbers to immediately prove Borel’s
normal number theorem: if X is uniform in (0, 1), then X is a normal number with probability 1.
The amazing part of this theorem is that only a few specific numbers have been shown to be
normal (concatenating the natural numbers or the prime numbers is known to produce a normal
number, the√ latter case being a theorem of Copeland-Erdös), and according to Wikipedia, the classic
numbers 2, π, e are believed to be normal, but there is still no proof of this fact!
P∞To Xprove
k 4
the normal number theorem, write the decimal expansion of X as X = 0.X1 X2 X3 . . . =
k=1 10k . Fix d ∈ {0, . . . , 9}. Then the sequence Yn := 1{Xn =d} , n ≥ 1 are i.i.d. with mean
µ = EYn = P(Xn = d) = 1/10. PnHence, the strong law of large numbers applies to let us conclude
1
that the empirical average n k=1 Yk , which is also the relative frequency of d in the first n digits
of the decimal expansion, converges almost surely and in mean square to 1/10, as required.

3 Central Limit Theorem

Reflect on what the law of large numbers is saying. The empirical average Sn /n = n1 nk=1 Xk , which
P
is a random variable, has its distribution or probability mass concentrate around the deterministic
constant EX for large n. Visually, simulations show that the probability mass forms a spike or
sharpening “bell” centered at EX as n → ∞.
But why do we expect this shape around the mean? For concreteness, let us imagine the Xi are
outcomes of independent rolls of a die. ThenP the distribution of each Xi is uniform on {1, . . . , 6}.
Despite this, it is very unlikely that the sum nk=1 Xk assumes the extreme values n (all 1’s) or 6n
(all
Pn 6’s). However, there are many ways to achieve intermediate values, so it is likely that the sum
k=1 Xk will assume values in the middle, P say, between 3n and 4n. Hence, as n → ∞, we expect
the probability mass of the distribution of nk=1 Xk to be a hump between 3n and 4n, which will
taper off as you approach.
Another illustration (and perhaps the best I know of) is due to Galton. Imagine we drop marbles
repeatedly from a fixed dropping point above a series of interlacing pegs, so that the marbles bounce
back and forth before ending upPin receptacles. Formally think of the repeated displacements of
a given marble as a sum Sn = nk=1 Xk where Xk = ±1 with probability 1/2 each. Then most
marbles will end up in those receptacles underneath the dropping point, with only a few making it
to some extreme receptacles. When viewed from the side after many marbles have been dropped, we
will again see the proportion of marbles actually form the hump reflecting the higher concentration
of mass around the middle. The reader should search “Galton board” for a visualization of this.
To summarize, independent variables cannot conspire to produce extreme behavior. Hence,
despite the “chaotic” origins of the model, some predictability in fact develops. It is the content of
4
Here, we may assume this expansion is unique since the only numbers with nonunique decimal representations
are those that terminate, which occurs only for rational numbers whose denominators (in lowest terms) has factors
that divide 10 (i.e., 2 or 5), and thus they form a countable set.

4
the Central Limit Theorem that the hump or “bell” in the probability mass described above can
2
always be described by the function “exp(− x2 )”.
Definition 3.1. A continuous random variables X is said to be normally or Gaussian distributed
with mean µ ∈ R and variance σ 2 , written X ∼ N (µ, σ 2 ) if its probability density function is given
by
(x − µ)2

1
fX (x) = √ · exp − , x ∈ R.
2πσ 2 2σ 2
We refer to X as standard normal if it has mean zero µ = 0 and unit variance σ 2 = 1.
But how do we zero in on this shape? On the one hand, Var(Sn ) = N · V ar(X), which grows
linearly with N (this is “zooming Pin”); on the other hand, by the law of large numbers, we know
that the distribution of Sn /n = n nk=1 Xk concentrates around the constant EX, reflected by the
1

fact that Var(Sn /n) = Var(X)/n vanishes (this is “zooming out”). The appropriate scaling to zoom
2
in or out just√
the right amount to retain the hump or “bell” shape “exp(− x2 )” is found by noticing
that Var(Sn / n) = Var(X) is constant. Hence, the scaling √1n nk=1 Xk playing a prominent role
P
in the next statement.
Theorem 3.1 (Central Limit Theorem). Let X1 , X2 , . . . be i.i.d. random variables with
Pn mean
2 1
µ ∈ R and variance 0 < σ < ∞. Write the running empirical average as Xn := Sn /n = n k=1 Xk .
p √
Writing µn := µSn = ESn = n · µ and σn := σSn = Var(Sn ) = n · σ, consider the z-score 5
Sn − µn √

Xn − µ
Zn := = n .
σn σ
Then we have
n→∞
Probability distribution of Zn → Z ∼ N (0, 1).
Put another way, no longer scaling out σ 2 , if Y ∼ N (0, σ 2 ), then for any real numbers a < b,
Z b
√ x2

n→∞ 1
P a < n Xn − µ ≤ b → P(a < Y ≤ b) = √ exp − 2 dx.
2πσ 2 a 2σ
p
Definition 3.2. The standard deviation of a random variable X is defined as σ = V ar(X).
Remark. In 1733, de Moivre introduced the 68-95-99.7 rule, a really useful fact to keep in mind
when working with normal distributions. The rule states that 68% of the mass falls within one
standard deviation of the mean, 95% percent within two standard deviations of the mean, and
99.7% within three standard deviations of the mean. More precisely, if X ∼ N (µ, σ 2 ), we have

68.27 k = 1

P(µ − kσ < X < µ + kσ) ≈ 95.45 k = 2

99.73 k = 3


One can additionally use symmetry of the normal distribution about its mean to further reason
about how mass is distributed nearby the mean.
R∞ (x−µ)2 √
Of course, we also know the calculation −∞ e− 2σ2 dx = 2πσ 2 holds since it must be a pdf.
2
However, general integrals involving “exp(− x2 )” like the one appearing in the statement of the
central limit theorem cannot be evaluated explicitly. One must settle either for estimates using the
68-95-99.7 rule or uses numerical tables/computer functions (see “erf ”) that provide (very good)
approximations of such integrals.
5
√
Intuitively, we are zooming in by a factor n to see how the empirical average “fluctuates” about its mean.

5
The central limit theorem is perhaps the most famous example of the concept of “universality”
from physics: a wide variety of random phenomena exhibit the normal distribution. Put another
way, the normal distribution is “attracting” in some sense, arising in just about any random model
you can think of as long as you know where to look. For a small list, the weight or height of a man
or woman, measurement errors, molecules, growth distributions of plants/animals/organs, etc. For
example, the first of these is the result of many environmental and genetic factors each contributing
a small amount, so we expect that polling weight or heights of a random sample of a given gender6
will obey a normal distribution.
We turn to some examples.

Example 3.1. The rates of return for a stock are i.i.d. R1 , R2 , . . . each equally likely assumes the
two values .30 or -.25. An investor notes that the expected rate of return on the ith trading day
is positive ERi = .025, so she invests c dollars in it. After the first trading day, the value of her
shares becomes c · (1 + R1 ), which she reasons she expects will be c · 1.025. Continuing to reinvest,
the value of her shares becomes
c · (1 + R1 ) · · · (1 + Rn )
on the nth trading day. By independence, she expects her investment growth will be c · (1.025)n ,
which is exponential in n, so her returns should be to the moon.
Unfortunately, there is a very high probability (essential certitude) that the value can become
arbitrarily small. Define Xi := ln(1 + Ri ) and write µ = EXi = −0.127 < 0, σ 2 = V ar(Xi ) =
0.0597 > 0. Letting δ ∈ (0, 1) be small, we have

P(c · (1 + R1 ) · · · (1 + Rn ) < c · δ) = P((1 + R1 ) · · · (1 + Rn ) < δ)

n
!
X
=P Xk ≤ ln δ
k=1
√ √
= P n(Xn − µ) ≤ n(ln δ/n − µ)
Z √n(ln δ/n−µ)
x2

1
≈√ exp − 2 dx,
2πσ 2 −∞ 2σ

where the last approximation is accurate by the central limit theorem for n large enough. Since µ
is negative, the upper limit of integration gets arbitrarily large and thus this probability can get as
close to one as we like regardless of how small δ > 0 is! Indeed, if we take δ = .10 and computing
the relevant values µ, σ 2 , one can see from numerical tables that the righthandside will be over .99
after n = 50 trading days. Put another way, with near certainty, the value of the shares held by
the investor will reach 10% of her original investment after 50 days despite positive EV (expected
value). This suggests more a depressing conclusion: the initial investment will eventually disappear!

Example 3.2 (Stirling’s Formula). Let X1 , X2 , . . . be i.i.d. Poisson random variables Pwith rate
λ = 1. Recall that µ := EXi = λ = 1 and σ := V ar(Xi ) = λ = 1. As usual, write Sn = nk=1 Xk .
2

We saw using the theory of Poisson processes that Sn also has Poisson distribution with rate λ = n.
Thus, we have
n
−n n
P(Sn = n) = e · .
n!
6
However, mixing genders here can cause issues with this approximation.

6
However, we can rewrite this probability as follows:

P(Sn = n) = P(n − 1 < Sn ≤ n) = P(1 − 1/n < Sn /n ≤ 1) = P(−1/n < Sn /n − µ ≤ 0)

√ √
= P(−1/ n < n (Sn /n − µ) ≤ 0)
Z 0 2
1 x
(central limit theorem) ≈√ √
exp − dx
2π −1/ n 2
Z 0
√
2
1 x
(change of variables x 7→ x/ n) =√ · exp − dx.
2πn −1 2n
R0 2 R0
x
For large n, the factor −1 exp − 2n dx ≈ −1 exp (0) dx = 1. Hence, setting the last two displays
equal to each other gives the approximation
Poisson Gaussian
z }| { z }| {
n
n CLT 1
e−n · = P(Sn = n) ≈ √ .
n! 2πn
Rewriting this gives Stirling’s formula for the asymptotic growth of the factorial:
√
n! ∼ 2πn nn e−n .

We remark that this example is a bit circular since the first proof of the central limit theorem (the
de Moivre-Laplace theorem) does so in the special case the Xi form a Bernoulli process. Nevertheless,
most modern proofs of the CLT do not explicitly use this formula, and further this argument provides
a probabilistic interpretation of this formula, thus also serving as a mnemonic device.

Trading Psychology
92% (13)
Trading Psychology
87 pages
Introductory Econometrics: Solutions of Selected Exercises From Tutorial 1
100% (1)
Introductory Econometrics: Solutions of Selected Exercises From Tutorial 1
2 pages
Topics in Probability Theory and Stochastic Processes Steven R. Dunbar
100% (1)
Topics in Probability Theory and Stochastic Processes Steven R. Dunbar
26 pages
Chapter3 Asymtotic Stats
No ratings yet
Chapter3 Asymtotic Stats
114 pages
Unit 3 YT Part2
No ratings yet
Unit 3 YT Part2
74 pages
Lecture Notes 4 Convergence (Chapter 5) 1 Random Samples: 1 N N 1 N N I
No ratings yet
Lecture Notes 4 Convergence (Chapter 5) 1 Random Samples: 1 N N 1 N N I
12 pages
Lecture 7: Convergence and Limit Theorems
No ratings yet
Lecture 7: Convergence and Limit Theorems
23 pages
Various Modes of Convergence: Definitions
No ratings yet
Various Modes of Convergence: Definitions
6 pages
Gambling, Random Walks and The Central Limit Theorem: 3.1 Random Variables and Laws of Large Num-Bers
No ratings yet
Gambling, Random Walks and The Central Limit Theorem: 3.1 Random Variables and Laws of Large Num-Bers
59 pages
4 Convergence and Simulation
No ratings yet
4 Convergence and Simulation
55 pages
STAT0009 Introductory Notes
No ratings yet
STAT0009 Introductory Notes
4 pages
Chap 4
No ratings yet
Chap 4
12 pages
STB351 Unit 1a
No ratings yet
STB351 Unit 1a
20 pages
Chapter8 (Law of Numbers)
No ratings yet
Chapter8 (Law of Numbers)
24 pages
STA 211 Lecture 3
No ratings yet
STA 211 Lecture 3
21 pages
Convergence of Random Variables - Wikipedia
No ratings yet
Convergence of Random Variables - Wikipedia
17 pages
The Law of Large Numbers
No ratings yet
The Law of Large Numbers
10 pages
Đ Án CSXS
No ratings yet
Đ Án CSXS
28 pages
Ee5110 Lecture Limit Theorems
No ratings yet
Ee5110 Lecture Limit Theorems
9 pages
The Law of Large Numbers
No ratings yet
The Law of Large Numbers
15 pages
Lecture29 Law of Large Numbers
No ratings yet
Lecture29 Law of Large Numbers
5 pages
Print PTEPS
No ratings yet
Print PTEPS
66 pages
Convergence Concepts: 2.1 Convergence of Random Variables
No ratings yet
Convergence Concepts: 2.1 Convergence of Random Variables
6 pages
Strong Law
No ratings yet
Strong Law
9 pages
Recitation 1
No ratings yet
Recitation 1
10 pages
Limiting Distributions
No ratings yet
Limiting Distributions
10 pages
Math556 11 ModesOfConvergence
No ratings yet
Math556 11 ModesOfConvergence
9 pages
Convergence of Random Variables
No ratings yet
Convergence of Random Variables
11 pages
Convergence of Random Variables
No ratings yet
Convergence of Random Variables
7 pages
Math5846 Chapter6
No ratings yet
Math5846 Chapter6
85 pages
BSDS Slides-Week9
No ratings yet
BSDS Slides-Week9
6 pages
Week 16 - L13 - Limit Theorems
No ratings yet
Week 16 - L13 - Limit Theorems
22 pages
Law of Large Numbers
No ratings yet
Law of Large Numbers
2 pages
Chapter 2: Axioms of Probability
No ratings yet
Chapter 2: Axioms of Probability
8 pages
Covergence
No ratings yet
Covergence
18 pages
MIT6 436JF18 Lec06
No ratings yet
MIT6 436JF18 Lec06
18 pages
Lecture Note 4
No ratings yet
Lecture Note 4
8 pages
Probability II Upload Week 9
No ratings yet
Probability II Upload Week 9
3 pages
Lesson4 MAT284 PDF
100% (1)
Lesson4 MAT284 PDF
36 pages
Osobine Var
No ratings yet
Osobine Var
19 pages
Prob Notes
No ratings yet
Prob Notes
70 pages
Probability 2 Notes
No ratings yet
Probability 2 Notes
5 pages
Week10class1 Solution
No ratings yet
Week10class1 Solution
30 pages
Notes 3
No ratings yet
Notes 3
5 pages
Lect 05
No ratings yet
Lect 05
22 pages
Chapter10 2b
No ratings yet
Chapter10 2b
4 pages
Random Variables: 1.1 Elementary Examples
No ratings yet
Random Variables: 1.1 Elementary Examples
14 pages
Chapter7 (Probability)
No ratings yet
Chapter7 (Probability)
15 pages
Random Variable Giri and Banerjee
No ratings yet
Random Variable Giri and Banerjee
18 pages
Introduction To Probability Theory
No ratings yet
Introduction To Probability Theory
13 pages
Probability Theory Notes Chapter 3 Varadhan
No ratings yet
Probability Theory Notes Chapter 3 Varadhan
50 pages
Unit 2
No ratings yet
Unit 2
28 pages
Law of Large Number
No ratings yet
Law of Large Number
5 pages
Open MOAT Solutions
No ratings yet
Open MOAT Solutions
16 pages
Lec 4
No ratings yet
Lec 4
8 pages
Probability Theory (MATHIAS LOWE)
No ratings yet
Probability Theory (MATHIAS LOWE)
69 pages
Slides Large Sample
No ratings yet
Slides Large Sample
148 pages
Bgpev2 Asymptotic
No ratings yet
Bgpev2 Asymptotic
31 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Elgenfunction Expansions Associated with Second Order Differential Equations
From Everand
Elgenfunction Expansions Associated with Second Order Differential Equations
E. C. Titchmarsh
No ratings yet
Introduction to Differentiable Manifolds
From Everand
Introduction to Differentiable Manifolds
Louis Auslander
4.5/5 (2)
Stat 700 HW3 Solutions, 10/9/09
No ratings yet
Stat 700 HW3 Solutions, 10/9/09
4 pages
Probability Theory and Stochastic Processes With Applications
70% (10)
Probability Theory and Stochastic Processes With Applications
382 pages
(Ebook PDF) Knowing The Odds An Introduction To Probability Instant Download
No ratings yet
(Ebook PDF) Knowing The Odds An Introduction To Probability Instant Download
58 pages
جاوب غلط
No ratings yet
جاوب غلط
24 pages
Central Limit Theorem - Solved Problems
No ratings yet
Central Limit Theorem - Solved Problems
7 pages
CEU Probability1 Solutions05 2013fall
No ratings yet
CEU Probability1 Solutions05 2013fall
2 pages
Statistics Interview Questions & Answers For Data Scientists
No ratings yet
Statistics Interview Questions & Answers For Data Scientists
43 pages
Chapter 3 - Probability
No ratings yet
Chapter 3 - Probability
175 pages
Emanuel Parzen Modern Probability Theory and Its Applications
100% (2)
Emanuel Parzen Modern Probability Theory and Its Applications
480 pages
B.Tech - B.Des Syllabus
No ratings yet
B.Tech - B.Des Syllabus
110 pages
International Journal of Forecasting: Nassim Nicholas Taleb, Yaneer Bar-Yam, Pasquale Cirillo
No ratings yet
International Journal of Forecasting: Nassim Nicholas Taleb, Yaneer Bar-Yam, Pasquale Cirillo
10 pages
On Single Point Forecasts For Fat-Tailed Variables
No ratings yet
On Single Point Forecasts For Fat-Tailed Variables
10 pages
Asymptotic Theory For OLS
No ratings yet
Asymptotic Theory For OLS
15 pages
(Mathematics and Its Applications 26) Shiryayev A. N. - Selected Works of A.N. Kolmogorov. Volume II - Probability Theory and Mathematical Statistics-Springer (1992)
No ratings yet
(Mathematics and Its Applications 26) Shiryayev A. N. - Selected Works of A.N. Kolmogorov. Volume II - Probability Theory and Mathematical Statistics-Springer (1992)
611 pages
UG Mathematics
No ratings yet
UG Mathematics
65 pages
A Course in Mathematical Statistics. George G. Roussas
No ratings yet
A Course in Mathematical Statistics. George G. Roussas
593 pages
L2 Biostatistics Probability
No ratings yet
L2 Biostatistics Probability
84 pages
Lecture Notes Markov
No ratings yet
Lecture Notes Markov
18 pages
(Ebook) An Introduction To Statistics by George Woodbury ISBN 9780534377557, 0534377556 Instant Download
No ratings yet
(Ebook) An Introduction To Statistics by George Woodbury ISBN 9780534377557, 0534377556 Instant Download
53 pages
M.Sc. Stat
No ratings yet
M.Sc. Stat
28 pages
Econometric Analysis MT Official Problem Set Solution 5
100% (1)
Econometric Analysis MT Official Problem Set Solution 5
6 pages
University of Pune
No ratings yet
University of Pune
40 pages
Additional Mathematics Project Work 2010
No ratings yet
Additional Mathematics Project Work 2010
22 pages
Estimating Distributions and Densities: 36-350, Data Mining, Fall 2009 23 November 2009
No ratings yet
Estimating Distributions and Densities: 36-350, Data Mining, Fall 2009 23 November 2009
7 pages
Paper III Stastical Methods in Economics
No ratings yet
Paper III Stastical Methods in Economics
115 pages
2609 Revisedsyllabusof MSC Statistics IISemester OLD
No ratings yet
2609 Revisedsyllabusof MSC Statistics IISemester OLD
6 pages

ORF309 Limit Theorems

Uploaded by

ORF309 Limit Theorems

Uploaded by

Limit Theorems of Probability theory

Var(X + Y ) = Var(X) + Var(Y )

2 Laws of Large Numbers

Weak Law of Large Numbers

as required. This proves the weak law of large numbers!

Strong Law of Large Numbers

where we write Zk := Xk − EX and note EZk = 0. Then we have

3 Central Limit Theorem

P(c · (1 + R1 ) · · · (1 + Rn ) < c · δ) = P((1 + R1 ) · · · (1 + Rn ) < δ)

P(Sn = n) = P(n − 1 < Sn ≤ n) = P(1 − 1/n < Sn /n ≤ 1) = P(−1/n < Sn /n − µ ≤ 0)

You might also like