0% found this document useful (0 votes)

7 views32 pages

8 Stat Rec

statistica

Uploaded by

Alice Rossi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views32 pages

8 Stat Rec

statistica

Uploaded by

Alice Rossi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

How to use CLT

An insurance company has 10,000 automobile policyholders. If the

expected yearly claim per policyholder is 260 with a standard
deviation of 800, approximate the probability that the total yearly
claim exceeds 2.8 million.
10000

claims Ci
S sum fall
is
8002
E Ci 260 Vki
i th customer
Ci claim for
the distribution
CLT to approximate
use
We can
apox
of
eeEz
E.si 196 / 218
2 28
P P 5 89
5
90
2800000 1

I 7 28 1 112 2.5

1 2.5
2.5
1 0.9938
0.0062
Example
Darth Vader wants to measure the distance between the Death
Star and Tatooine. However, due to atmospheric disturbances,
measurements will not yield the exact distance d. As a result,
Vader has decided to make a series of 36 measurements and then
use their average value as an estimate of the actual distance.
Assume that the values of the successive measurements are
independent random variables with a mean of d light years and a
standard deviation of 2 light years.
I Approximate the probability that the estimated value of the
distance will be within 0.5 light-years from d.

197 / 218
X X measurements E Xi d
36
V Xi 4
x̅ É estimation of distance
rgeenug.hr
gzm36is
meaning that CIT can be applied

P ix also 5 Ñ
f

P P 121 1.5
15 15
P 1.5 251.5 1.51 81 1.5 2811.5 1
811.57
1 2 0.9332 1
0.8664
Example
Darth Vader wants to measure the distance between the Death
Star and Tatooine. However, due to atmospheric disturbances,
measurements will not yield the exact distance d. As a result,
Vader has decided to make a series of 36 measurements and then
use their average value as an estimate of the actual distance.
Assume that the values of the successive measurements are
independent random variables with a mean of d light years and a
standard deviation of 2 light years.
I Approximate the probability that the estimated value of the
distance will be within 0.5 light-years from d.
I How many measurements Vader needs in order to be at least
95% certain that his estimate is accurate to within 0.5 light
years?

198 / 218
We consider now n measurements with n
generic

x̅ NON

0.95 P IX d 40.5

P P 12K
0.95
15 14 1
Pl Fs Z E 06
20141 1
0.95428141 1

1.955281 0.975581
0.025

1.96 0.025
70.02s

1.96 Vn 37.84 ns 7.84

How large is “large n”?
How large n should be to have a good approximation depends on
the shape of the population distribution. According to the textbook

A general rule of thumb is that you can be confident of the normal

approximation whenever the sample size n is at least 30.
In most cases the normal approximation is valid for much smaller
sample sizes.
Indeed, usually a sample size of 5 will suffice for the approximation
to be valid.

(I would be a little more cautious about this last statement)

199 / 218
How large is “large n”?
How large n should be to have a good approximation depends on
the shape of the population distribution. According to the textbook

A general rule of thumb is that you can be confident of the normal

(I would be a little more cautious about this last statement)

NOTE: If the population is Normal, then X̄ is normal for all n.

(this is not an approximation from CLT,

it is due to the additive property of normal random variables)
200 / 218
Normal approximation to the Binomial distribution
One of the first important applications of the CLT was related to
Binomial random variables.
We know that a Binomial random variable X with parameters
(n, p) can be expressed as a sum of n independent random variables
X = E 1 + E2 + · · · + E n
with E [Ei ] = p and V [Ei ] = p(1 p).

201 / 218
Normal approximation to the Binomial distribution
One of the first important applications of the CLT was related to
Binomial random variables.
We know that a Binomial random variable X with parameters
(n, p) can be expressed as a sum of n independent random variables
X = E 1 + E2 + · · · + E n
with E [Ei ] = p and V [Ei ] = p(1 p).

As a consequence of the CLT, when n is large X is approx normal

with expectation np and variance np(1 p).
Equivalently,
X
X np p
p and qn
np(1 p) p(1 p)
n
are approximately standard normal.
203 / 218
Problem
Suppose that 60 percent of the residents of a city are in favor of
teaching evolution in high school.
1. Determine expected value and standard deviation of the
proportion of a random sample of size n that is in favor when
n = 10 n = 100 n = 1000 n = 10000

successes in the sample

number of

sample proportion
1 p
E XJ np V XJ np
1 P P
P
E ftp.np PVCEJ f.M
204 / 218
Expectation of sample proportion is
0.6
p for any possible n

variance of sample proportion is

becomes smaller and

PII 021 smaller as n increases

of course standard deviation is Ivorience

Problem
Suppose that 60 percent of the residents of a city are in favor of
teaching evolution in high school.
1. Determine expected value and standard deviation of the
proportion of a random sample of size n that is in favor when
n = 10 n = 100 n = 1000 n = 10000

2. Find the probability that over 55 percent of the members of

the sample are in favor of the proposal if the sample size is
n = 10 n = 100 n = 1000 n = 10000

205 / 218
n 10

P 0 55 P X 3 5 5

P x 6 P X 7 P X 10
10 R

6.61 0.4
K G

or d binom 6 size 10
prob 0.6

8
For n too

P 0.55 P X 355

100 k

R
Ei E
55
0.6 0.4

or we can use normal approximation

9 tox
No
P P
I
055

P 2 1
It I
Continuity correction for Normal approx to Binomial
When using Normal approximation to Binomial, note that:
since the normal is a continuous random variable,
P(X = i) would always be approximated as 0
even if it’s strictly positive (because Bernoulli is discrete).

206 / 218
Continuity correction for Normal approx to Binomial
When using Normal approximation to Binomial, note that:
since the normal is a continuous random variable,
P(X = i) would always be approximated as 0
even if it’s strictly positive (because Bernoulli is discrete).

To overcome this problem, it is best to compute

P(X = i) = P(i 0.5 < X  i + 0.5)
This is called the continuity correction.

207 / 218
Example
Suppose for a Binomial (n = 100, p = 0.40) you need to
approximate P(35  X  40):
P(35  X  40) = P(34.5  X  40.5)
!
34.5 40 X np 40.5 40
= P p p  p
24 np(1 p) 24
' P ( 1.12  Z  0.10)
= = (0.10) ( 1.12)
= (0.10 (1 (1.12))
= 0.5398 (1 0.8686) = 0.4084

36 P X 60
Pl X 35 P
39.52 560.5
34.5 2 335.5 P 35.52 536.5
208 / 218
Summary of sample mean properties
No matter what the population distribution is, denote
µ = the population expectation
2
= the population variance
X1 + · · · + Xn
then the sample mean X̄ = will have
n
I E [X̄ ] = µ
I V [X̄ ] = n
2

I CLT: for large n, the distribution is approximately normal

209 / 218
Expectation of the sample variance S 2
1 Pn
Remember: S2 = n 1 i=1 (Xi X̄ )2

It is possible to prove that E [S 2 ] = 2

(no time to do it in class, if interested see textbook for details)

this explains the denominator n 1

210 / 218
Sampling from a normal population
When the population is normally distributed,
I We have seen that the sample mean X̄ is normal for all n:
X̄ µ
p is standard normal for all n
/ n

211 / 218
Sampling from a normal population
When the population is normally distributed,
I We have seen that the sample mean X̄ is normal for all n:
X̄ µ
p is standard normal for all n
/ n
I Now we discuss a result that permits to obtain probabilities
Pn 2
2 i=1 (Xi X̄ )
for the sample variance S = n 1 :
(n 1)S 2 2
2
has n 1 distribution

xi
t.fi
f

NOTE EI 22 YESTERDAY
212 / 218
Sampling from a normal population
When the population is normally distributed,
I We have seen that the sample mean X̄ is normal for all n:
X̄ µ
p is standard normal for all n
/ n
I Now we discuss a result that permits to obtain probabilities
Pn 2
2 i=1 (Xi X̄ )
for the sample variance S = n 1 :
(n 1)S 2 2
2
has n 1 distribution
I Rather counterintuitive, but important: X̄ and S 2 are
independent

213 / 218
Problem
1. The following data sets come from normal populations whose
standard deviation is specified. In each case, determine the
value of a statistic whose distribution is chi-squared, and tell
how many degrees of freedom this distribution has.
(a) 104, 110, 100, 98, 106; = 4
(b) 1.2, 1.6, 2.0, 1.5, 1.3, 1.8; = 0.5
(c) 12.4, 14.0, 16.0; = 2.4
2. Explain why a chi-squared random variable having n degrees
of freedom will approximately have the distribution of a
normal random variable when n is large.
Hint: Use the central limit theorem.

214 / 218
do
In

EM
the observed value that statistic is
of
2 103.6
1
104 6
03.61

106

1031
Z Z Zi
i I I
Y Yet Yu

Yi
haeg.gg

ssmE afpox
we can
apply C
nt no
X
Hence when n is large enough a X

is
approximately normal with some

parameters µ and

SPOILER N J 2n
M
The t distribution
If we standardize the sample mean using sample variance (instead
of population variance)
X̄ µ
p is no longer normal.
S/ n
This is said to be a t distribution with n 1 degrees of freedom

(Tn 1 ).
The density function of a t looks similar to a standard normal
density, although it is somewhat more spread out, resulting in its
having “larger tails”.

As the degree of freedom parameter increases, the density becomes

more and more similar to the standard normal density (see picture
next slide).

216 / 218
Plot of standard normal and t densities

217 / 218
Quantiles of the t distribution
If Td is a t random variable with d degrees of freedom, its
100(1 ↵) percentile is
td,↵ such that P(Td > td,↵ ) = ↵
(same concept as z↵ for the standard normal)

Table in the appendix of the textbook.

218 / 218

STA1006S Notes 2 DVW
No ratings yet
STA1006S Notes 2 DVW
21 pages
Chapter 5 - Sample Statistics
No ratings yet
Chapter 5 - Sample Statistics
90 pages
Stab22 Lecture8
No ratings yet
Stab22 Lecture8
21 pages
Limit Theoram
No ratings yet
Limit Theoram
20 pages
Lec 5
No ratings yet
Lec 5
64 pages
Econ-2042 - Unit 5-HO
No ratings yet
Econ-2042 - Unit 5-HO
22 pages
Chapter 5b. Continuous Variable
No ratings yet
Chapter 5b. Continuous Variable
55 pages
Topic 4
No ratings yet
Topic 4
2 pages
ISO Module 4 BCS301
No ratings yet
ISO Module 4 BCS301
25 pages
Chapter 7
No ratings yet
Chapter 7
10 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
16 pages
Edexcel Statistics Mechanics (Year 2) Binomial Distribution
No ratings yet
Edexcel Statistics Mechanics (Year 2) Binomial Distribution
4 pages
BUS 5 Prob Dist
No ratings yet
BUS 5 Prob Dist
35 pages
Topic 7 AM025 (Normal Distribution) STUDENT
No ratings yet
Topic 7 AM025 (Normal Distribution) STUDENT
82 pages
Sampling Distributions
No ratings yet
Sampling Distributions
32 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
74 pages
M6 2020 Normal Distribution Lecture Notes
No ratings yet
M6 2020 Normal Distribution Lecture Notes
31 pages
Random Variables
No ratings yet
Random Variables
8 pages
Chapter 6: Normal Probability Distributions
No ratings yet
Chapter 6: Normal Probability Distributions
15 pages
Common Probability
No ratings yet
Common Probability
47 pages
Lecture7 8
No ratings yet
Lecture7 8
30 pages
Section 1: Practice Questions (Students Are To Attempt All These Questions) Concept of Random and Non-Random Samples 1 (2007/NJC/P2/Q6i)
No ratings yet
Section 1: Practice Questions (Students Are To Attempt All These Questions) Concept of Random and Non-Random Samples 1 (2007/NJC/P2/Q6i)
15 pages
Sampling Distributions
No ratings yet
Sampling Distributions
25 pages
Lect9 Math231
No ratings yet
Lect9 Math231
42 pages
Tutorial 06 Soln
No ratings yet
Tutorial 06 Soln
4 pages
SLHT in Stat Prob 7 8
No ratings yet
SLHT in Stat Prob 7 8
9 pages
MIT18 05S14 Class6slides PDF
No ratings yet
MIT18 05S14 Class6slides PDF
24 pages
PS1 Tutorials Wk9 Solutions (2025)
No ratings yet
PS1 Tutorials Wk9 Solutions (2025)
16 pages
Chapter 5
No ratings yet
Chapter 5
20 pages
Psychoperiscope (Original Manuscript)
No ratings yet
Psychoperiscope (Original Manuscript)
20 pages
Sampling Distributions
No ratings yet
Sampling Distributions
25 pages
PS V
No ratings yet
PS V
3 pages
B.A. P Basic Statistics For Econ 3gp3l47
No ratings yet
B.A. P Basic Statistics For Econ 3gp3l47
16 pages
Notes ch3 Sampling Distributions
No ratings yet
Notes ch3 Sampling Distributions
20 pages
Stat 2 MCQ
No ratings yet
Stat 2 MCQ
115 pages
Statistics Chapter 7
No ratings yet
Statistics Chapter 7
25 pages
Statisticshomeworkhelpstatisticstutoringstatisticstutor Byonlinetutorsite 101015122333 Phpapp02
No ratings yet
Statisticshomeworkhelpstatisticstutoringstatisticstutor Byonlinetutorsite 101015122333 Phpapp02
25 pages
Himamaylan National High School
0% (1)
Himamaylan National High School
34 pages
Lesson 21: Normal Distributions: What If A Normal Isn't Standard?
No ratings yet
Lesson 21: Normal Distributions: What If A Normal Isn't Standard?
25 pages
Orientation - Basic Mathematics and Statistics - ND
No ratings yet
Orientation - Basic Mathematics and Statistics - ND
33 pages
Probability Distributions
No ratings yet
Probability Distributions
18 pages
Chapter 7 - 7.1 - 7.2 - Nalanda
No ratings yet
Chapter 7 - 7.1 - 7.2 - Nalanda
74 pages
Prob Dist Updated-Wps
No ratings yet
Prob Dist Updated-Wps
65 pages
Sampletest2 Fall2003
100% (1)
Sampletest2 Fall2003
8 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
4 pages
Sample Size
No ratings yet
Sample Size
51 pages
Stat - Prob 11 - Q3 - SLM - WK6-8
No ratings yet
Stat - Prob 11 - Q3 - SLM - WK6-8
34 pages
Effectiveness of Personal Budgeting To Daily Expenses Among Grade 12 Students of Malinta National High School
100% (1)
Effectiveness of Personal Budgeting To Daily Expenses Among Grade 12 Students of Malinta National High School
60 pages
Chapter 5: Sampling & Estimation Example 1:: Solution
No ratings yet
Chapter 5: Sampling & Estimation Example 1:: Solution
3 pages
BUS 172 Practice Maths
0% (1)
BUS 172 Practice Maths
18 pages
Pertemuan 8
No ratings yet
Pertemuan 8
29 pages
PR2 Sample-Size
No ratings yet
PR2 Sample-Size
12 pages
Bonsa R10
No ratings yet
Bonsa R10
48 pages
Term 1: Business Statistics: Session 4: Continuous Probability Distributions
No ratings yet
Term 1: Business Statistics: Session 4: Continuous Probability Distributions
26 pages
Unit 3 Study Guide Answers
No ratings yet
Unit 3 Study Guide Answers
7 pages
Project Proposal
No ratings yet
Project Proposal
18 pages
Pr (X ≤x) =F (x) = 1 2πσ 1 2πσ: Φ (z) = e dt
No ratings yet
Pr (X ≤x) =F (x) = 1 2πσ 1 2πσ: Φ (z) = e dt
8 pages
Vodafone
No ratings yet
Vodafone
66 pages
Research Methodology Chapter 4: Analysis: Assoc. Prof Sallehuddin Muhamad UTM Razak School
No ratings yet
Research Methodology Chapter 4: Analysis: Assoc. Prof Sallehuddin Muhamad UTM Razak School
73 pages
Final CA
No ratings yet
Final CA
27 pages
Onobrakpeya, & Ugwuonah, 2023
No ratings yet
Onobrakpeya, & Ugwuonah, 2023
20 pages
GENG5507 Stat TutSheet 5 Solutions
No ratings yet
GENG5507 Stat TutSheet 5 Solutions
5 pages
Abebe Sisay Proposal - New
No ratings yet
Abebe Sisay Proposal - New
34 pages
Chapter 8
No ratings yet
Chapter 8
19 pages
Estimation of Parameters 2
No ratings yet
Estimation of Parameters 2
37 pages
6.5 - The Central Limit Theorem: Objectives
No ratings yet
6.5 - The Central Limit Theorem: Objectives
6 pages
2011 CAPE Applied Math P1
100% (3)
2011 CAPE Applied Math P1
9 pages
Statistics Project: Khizar Bin Nasir Salman Ali Ghause Ahmad
No ratings yet
Statistics Project: Khizar Bin Nasir Salman Ali Ghause Ahmad
11 pages
Statistics Study Guide: Matthew Chesnes The London School of Economics September 22, 2001
No ratings yet
Statistics Study Guide: Matthew Chesnes The London School of Economics September 22, 2001
22 pages
Prevalence Factors Early Onset Menarche Adolescent School Girls
No ratings yet
Prevalence Factors Early Onset Menarche Adolescent School Girls
7 pages
lx25lm
No ratings yet
lx25lm
9 pages
Chap 5
No ratings yet
Chap 5
3 pages
Teo Et Al. 2013 - 2
No ratings yet
Teo Et Al. 2013 - 2
19 pages
Corrected SLK3 StatisticsandProbability Week7 8
No ratings yet
Corrected SLK3 StatisticsandProbability Week7 8
16 pages
Guidelines For The Development of A Communication Strategy: Matthew Cook Caitlin Lally Matthew Mccarthy
100% (1)
Guidelines For The Development of A Communication Strategy: Matthew Cook Caitlin Lally Matthew Mccarthy
15 pages
Problem Set 4 - Engineering Statistics PDF
No ratings yet
Problem Set 4 - Engineering Statistics PDF
4 pages
Statistics Homework Help, Statistics Tutoring, Statistics Tutor - by Online Tutor Site
No ratings yet
Statistics Homework Help, Statistics Tutoring, Statistics Tutor - by Online Tutor Site
30 pages
Business Statistics
No ratings yet
Business Statistics
6 pages
ADMS 2320 Test 1 Sheet
No ratings yet
ADMS 2320 Test 1 Sheet
1 page
Decision Sciences 1: Sample Questions (Set 2)
No ratings yet
Decision Sciences 1: Sample Questions (Set 2)
8 pages
Answer Key (Second Project)
No ratings yet
Answer Key (Second Project)
13 pages
Assignment 5 - Engineering Statistics - Spring 2018
No ratings yet
Assignment 5 - Engineering Statistics - Spring 2018
6 pages
MIT14 30s09 Lec17
No ratings yet
MIT14 30s09 Lec17
9 pages
Untitled
No ratings yet
Untitled
6 pages
Business Research Methods: Hypothesis Testing
No ratings yet
Business Research Methods: Hypothesis Testing
29 pages
Data Analysis Report (PR and GRPO)
No ratings yet
Data Analysis Report (PR and GRPO)
4 pages
Sampling and Sampling Distributions
No ratings yet
Sampling and Sampling Distributions
30 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Mathematical Analysis 1: theory and solved exercises
From Everand
Mathematical Analysis 1: theory and solved exercises
Alessio Mangoni
5/5 (1)
Topology and Geometry for Physicists
From Everand
Topology and Geometry for Physicists
Charles Nash
3.5/5 (1)

8 Stat Rec

Uploaded by

8 Stat Rec

Uploaded by

How to use CLT

An insurance company has 10,000 automobile policyholders. If the

1.96 Vn 37.84 ns 7.84

A general rule of thumb is that you can be confident of the normal

(I would be a little more cautious about this last statement)

A general rule of thumb is that you can be confident of the normal

(I would be a little more cautious about this last statement)

NOTE: If the population is Normal, then X̄ is normal for all n.

(this is not an approximation from CLT,

As a consequence of the CLT, when n is large X is approx normal

successes in the sample

variance of sample proportion is

becomes smaller and

of course standard deviation is Ivorience

2. Find the probability that over 55 percent of the members of

or we can use normal approximation

To overcome this problem, it is best to compute

I CLT: for large n, the distribution is approximately normal

It is possible to prove that E [S 2 ] = 2

(no time to do it in class, if interested see textbook for details)

this explains the denominator n 1

As the degree of freedom parameter increases, the density becomes

Table in the appendix of the textbook.

You might also like