0% found this document useful (0 votes)

59 views9 pages

Balaji Statistics With R-Package Central Limit Theorem (CLT) : Solved Example

The central limit theorem states that the sampling distribution of sample means approaches a normal distribution as the sample size increases, regardless of the shape of the population distribution. According to the CLT, the mean of sample means equals the population mean, and the standard deviation of sample means equals the population standard deviation divided by the square root of the sample size. The CLT is applicable for sample sizes of 30 or more. The CLT allows statisticians to use normal distributions to analyze sample means even if the underlying population is not normally distributed.

Uploaded by

Ashutosh Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views9 pages

Balaji Statistics With R-Package Central Limit Theorem (CLT) : Solved Example

Uploaded by

Ashutosh Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Balaji

Statistics with R-package

Module 5

Central limit theorem[CLT]

The Central Limit Theorem is the sampling distribution of the sampling means approaches a
normal distribution as the sample size gets larger, no matter what the shape of the data
distribution. An essential component of the Central Limit Theorem is the average of sample
means will be the population mean.
Similarly, if you find the average of all of the standard deviations in your sample, you will find the
actual standard deviation for your population.

 Mean of sample is same as the mean of the population. μx

 The standard deviation of the sample is equal to the standard deviation σx of the
population divided by the square root of the sample size.n
Central limit theorem is applicable for a sufficiently large sample sizes (n ≥ 30). The formula for
central limit theorem can be stated as follows:

μȲ ¯¯¯=μx
and
σȲ ¯¯¯=σx/√n
Where,
μx = Population mean
σx = Population standard deviation
μȲ ¯¯ = Sample mean
σȲ ¯¯¯ = Sample standard deviation
n = Sample size

Solved Example
Question: The record of weights of the male population follows the normal distribution. Its mean
and standard deviations are 70 kg and 15 kg respectively. If a researcher considers the records
of 50 males, then what would be the mean and standard deviation of the chosen sample?

Solution:
Mean of the population μx = 70 kg
Standard deviation of the populationσ x = 15 kg
sample size n = 50
Mean of the sample is given by:
Ȳ ¯ = 70 kg
Standard deviation of the sample is given by:
σȲ ¯¯¯ = σx√n
σȲ ¯¯¯ = 15√50 =SE
σȲ ¯¯¯ = 2.122 = 2.1 kg (approx)=SE

The demonstration of CLT

Section -A
#SAMPLING DITRIBUTION: SAMPLE MEAN
#Consider a normal population of 1000 students
#who appear for a written test of 100 marks.
#X1,X2,....,X1000. Examiner says mean is 62
#with sd=12.generate a random samples
#of 20 students each.
pdata<-rnorm(1000,62,12)
hist(pdata)

abline(v=62,col="red")
iter<-500
n<-40
am<-rep(NA,iter)
for(i in 1:iter){
d<-sample(pdata,n)
d
am[i]<-mean(d)
}
hist(am,col="red")
n1<-length(am)
n1
#AM OF ALL 500 SAMPLE MEANS
sam<-mean(am)
sam
hist(am)
abline(v=sam,col="green")
#SD OF 500 SAMPLE MEANS
ssd<-sd(am)
ssd
#IT IS KNOW AS SE OF SAMPLE MEAN
#SE=sigma/sqrt(n)
se<-12/sqrt(40)
se
# probability distribution of sample mean is
#ND[MU,SE] is the conclusion.
Section-B

#sampling distribution of non normal distribution

data(trees)
names(trees)
x<-trees$"Girth"
x
mu<-mean(x)
mu
sigma<-sd(x)
sigma
hist(x)
# x data is not ND
n<-20
iter<-300

am<-rep(NA,iter)
for(i in 1:iter){
d<-sample(x,n)
d
am[i]<-mean(d)
}
hist(am,col="red")
n1<-length(am)
n1
#AM OF ALL 500 SAMPLE MEANS
sam<-mean(am)
sam
hist(am,col="green")
abline(v=sam,col="red")
#SD OF 500 SAMPLE MEANS
ssd<-sd(am)
ssd
#IT IS KNOW AS SE OF SAMPLE MEAN
#SE=sigma/sqrt(n)
se<-sigma/sqrt(n)
se
# probability distribution of sample mean is
#ND[MU,SE] is the conclusion.
Testing of hypothesis on Population mean[s].
General idea:- [i] Null hypothesis Ho
[ii] Alternate hypothesis H1
[iii] level of significance α.=P[reject Ho under
the assumption it is true.]
[iv] Critical /Rejection region for given α
[v] Critical value and then Decision rule.

[i] #No sample data available:[CV Approach]

One sample µ=µ0

Ex:-1 An inventor has developed a new,

energy-efficient lawn mower engine. He claims
that the engine will run continuously for 5 hours
(300 minutes) on a single gallon of regular
gasoline. From his stock of 2000 engines, the
inventor selects a simple random sample of 50
engines for testing. The engines run for an
average of 295 minutes, with a standard
deviation of 20 minutes. Test the null hypothesis
that the mean run time is 300 minutes against
the alternative hypothesis that the mean run
time is not 300 minutes. Use a 0.05 level of
significance. (Assume that run times for the
population of engines are normally distributed.)
[a] X=Run time per engine in minutes
2000 values in population, unknown

H0: µx=300 H1: µx≠300 ,

µx=300 , µx>300 µx<300 possibility.
Only one is true. Finding the truth with the
help of a random sample is called Testing of
hypothesis.
[b] GIVEN l.o.s= α= 0.05 = P[reject Ho under
the assumption it is true.]
Two sided test
[c] Z=(Ȳ- µ0)/se=sd/sqrt(n)
SND for critical values.
Conclusion will be Accept H0,if Z in acceptance
region-[1-α] reject H0 if Z lie in [α]

EX:-2 Bon Air Elementary School has

1000 students. The principal of the school
thinks that the average IQ of students at
Bon Air is at least 110. To prove her point,
she administers an IQ test to 20 randomly
selected students. Among the sampled
students, the average IQ is 108 with a
standard deviation of 10. Based on these
results, should the principal accept or reject
her original hypothesis? Assume a
significance level of 0.05. (Assume that test
scores in the population of engines are
normally distributed.)
H0: µx≥110 H0: µx
˂110
l.o.s= 0.05 Left sided SMALL SAMLE T TEST
t=(Ȳ- µ0)/sd/sqrt(n) t- for cv because n<30

[ii] Direct sample data available.[P-value Approach]

data(sleep)
names(sleep)
x<-sleep$"extra"
y<-sleep$"group"
z<-sleep$"ID"

one sample t-test

t.test(x,mu=1.5)

[iii] #Two samples test µ1=µ2

Directly from samples data
with (sleep,t.test(x[y==1],x[y==2]))
plot(x~y)
Ex:-3 Within a school district, students were randomly
assigned to one of two Math teachers - Mrs. Smith and Mrs. Jones.
After the assignment, Mrs. Smith had 30 students, and Mrs. Jones
had 35 students.

At the end of the year, each class took the same standardized test.
Mrs. Smith's students had an average test score of 78, with a
standard deviation of 10; and Mrs. Jones' students had an average
test score of 85, with a standard deviation of 15.

Test the hypothesis that Mrs. Smith and Mrs. Jones are equally
effective teachers. Use a 0.10 level of significance. (Assume that
student performance is approximately normal.)

Mrs. Smith Mrs. Jones HO: µ1=µ2 H1: µ1≠µ2

N 30 35

AM 78 85

SD 10 15

Ex:-4 The Acme Company has developed a new battery. The

engineer in charge claims that the new battery will operate
continuously for at least 7 minutes longer than the old battery.

To test the claim, the company selects a simple random sample of

100 new batteries and 100 old batteries. The old batteries run
continuously for 190 minutes with a standard deviation of 20
minutes; the new batteries, 200 minutes with a standard deviation
of 40 minutes.

Test the engineer's claim that the new batteries run at least 7
minutes longer than the old. Use a 0.05 level of significance.
(Assume that there are no outliers in either sample.)

OLD[ µ1] NEW [µ2] HO: µ1-µ2≤-7, H1: µ1-µ2˃-7

N 100 100

AM 190 200

SD 20 40

****************************************

TEST OF HYPOTHESIS ON PARAMETER,MEAN

Q1. What marks you expect for this paper out of 100?
Ans is 80.

Q2. How much confidence are you?

Ans is 90%.

Q3 .What is the meaning of other 10%?

Ans is Possibility that the true value is either <80 or
>80. This means probability of not getting the target is
0.10.
Q4. To know the true mark, what is to be done.?
Ans is only after the experiment ,examination.

The above discussion is indicating that one of the

three possibility, < 80 >80 or=80 is true with
probabilities .05,.05 and .90.
Q5. Researcher wish to reach the objective with a
high probability. But while doing the research
possibly he may get a different output. This
probability will be small. Based on available
information one has to verify
Whether Yes or No for the result on objective. This
process of deciding the truth is Statistically called Test
of hypothesis.

Notations:
1. Null hypothesis
H0: Claim on certain parameter. In the discussion
Mark=80, Need not be true.
2. Alternate hypothesis.
H1: It is negation of H0. It may be one sided or
two sided.
3. X1,X2,X3,….Xn is available information .Sample
data.
4. In test of hypothesis The probability of
acceptance is known as Acceptance region. Other
probability is Rejection region in the graph of
analysis. It is denoted by 1-α and α. In two sided
study the three probabilities are 1-α ,α/2, α/2.
This α in general is called LEVEL OF SIGNIFICANCE
IN testing of hypothesis.

Z=[Variable-AM]/sd for data values.

Z=[Sample mean-AM]/SE. for sample mean data.
SE=sd/sqrt[n]

Kuvempu. Universe
No ratings yet
Kuvempu. Universe
73 pages
Mahabharata12 Shanti
No ratings yet
Mahabharata12 Shanti
960 pages
On Job Annual Training Plan 2023
No ratings yet
On Job Annual Training Plan 2023
3 pages
Tax Invoice: 1046.17 Total Invoice Amount Rs
No ratings yet
Tax Invoice: 1046.17 Total Invoice Amount Rs
2 pages
Of Plymouth Plantation PDF
100% (2)
Of Plymouth Plantation PDF
4 pages
Balaji-Module - 1-Module - 1
No ratings yet
Balaji-Module - 1-Module - 1
3 pages
LDB MP2020 FRMWRK
No ratings yet
LDB MP2020 FRMWRK
77 pages
YLSTD30-40K01小功率直流充电桩用户手册User Manua V1 - (EN&CN) ) 已校对
No ratings yet
YLSTD30-40K01小功率直流充电桩用户手册User Manua V1 - (EN&CN) ) 已校对
17 pages
Chapter 3 Data Modeling Using The Entity Relationship ER Model
No ratings yet
Chapter 3 Data Modeling Using The Entity Relationship ER Model
55 pages
DCK-datacenter Strategies PDF
No ratings yet
DCK-datacenter Strategies PDF
26 pages
Intro To Psych L6
No ratings yet
Intro To Psych L6
10 pages
Briefing Dealer - Update Socialization Jan 2020
No ratings yet
Briefing Dealer - Update Socialization Jan 2020
17 pages
Framo Pumps
No ratings yet
Framo Pumps
5 pages
Vitotres343 TechGuide PDF
No ratings yet
Vitotres343 TechGuide PDF
32 pages
Case 1.1 Dell
No ratings yet
Case 1.1 Dell
21 pages
KP Technical Seminal Final Report FINAL
No ratings yet
KP Technical Seminal Final Report FINAL
30 pages
Group 2 - Aspects of Connected Speech
No ratings yet
Group 2 - Aspects of Connected Speech
31 pages
Case Study BKS 1
No ratings yet
Case Study BKS 1
2 pages
Happy Birthday
No ratings yet
Happy Birthday
2 pages
The Star Weaver
No ratings yet
The Star Weaver
2 pages
Effectiveness of PPE Welding Presentation
No ratings yet
Effectiveness of PPE Welding Presentation
11 pages
Project Proposal Seminar Workshop
No ratings yet
Project Proposal Seminar Workshop
6 pages
LCD TV: Service Manual
No ratings yet
LCD TV: Service Manual
51 pages
Analysis of Consumer Satisfaction and Lo 300543b7
No ratings yet
Analysis of Consumer Satisfaction and Lo 300543b7
18 pages
Product Management Case
No ratings yet
Product Management Case
4 pages
Case Study OBB 02
No ratings yet
Case Study OBB 02
1 page
Kagawaran NG Edukasyon: OUA MEMO 00-0821-0062
No ratings yet
Kagawaran NG Edukasyon: OUA MEMO 00-0821-0062
112 pages
BALAJI - Module - 6 7 AOV, CHI
No ratings yet
BALAJI - Module - 6 7 AOV, CHI
8 pages
Tata Value Management: (Tata Steel CVM Implementation)
No ratings yet
Tata Value Management: (Tata Steel CVM Implementation)
8 pages
1 Datasheet Solis-3P10K-4G
No ratings yet
1 Datasheet Solis-3P10K-4G
2 pages
Impact of HL On QOL
No ratings yet
Impact of HL On QOL
8 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
Lexicology Summary 1
No ratings yet
Lexicology Summary 1
1 page
Battery Room Gas Monitoring Application Note WSA Datasheet
No ratings yet
Battery Room Gas Monitoring Application Note WSA Datasheet
3 pages
Timber Formwork Design
No ratings yet
Timber Formwork Design
12 pages
HDBS Parameters
No ratings yet
HDBS Parameters
1 page
Gmail - 1st International Conference On Advances in Computing, Communication and Networking (ICAC2N2024) - Submission (295) Has Been Created
No ratings yet
Gmail - 1st International Conference On Advances in Computing, Communication and Networking (ICAC2N2024) - Submission (295) Has Been Created
2 pages
MAT 361 Lecture 24 25
No ratings yet
MAT 361 Lecture 24 25
45 pages
6 Inferential Statistics
100% (1)
6 Inferential Statistics
55 pages
10 Statistical Inference
No ratings yet
10 Statistical Inference
22 pages
P&S Unit - Iv PDF
No ratings yet
P&S Unit - Iv PDF
62 pages
T - Test
100% (2)
T - Test
32 pages
Do Hhjmbfujhfddgbkod
No ratings yet
Do Hhjmbfujhfddgbkod
1 page
BITM Srs Schedule Date 09th November 2022
No ratings yet
BITM Srs Schedule Date 09th November 2022
1 page
BITM Srs Schedule Date 15th November 2022
No ratings yet
BITM Srs Schedule Date 15th November 2022
1 page
BITM Srs Schedule Date 08th November 2022
No ratings yet
BITM Srs Schedule Date 08th November 2022
1 page
Customer Delight
No ratings yet
Customer Delight
1 page
All
No ratings yet
All
1 page
BITM B Weekly Syllabus Coverage Status
No ratings yet
BITM B Weekly Syllabus Coverage Status
1 page
06 - Testing of Hypothesis
No ratings yet
06 - Testing of Hypothesis
24 pages
5 - Stat Lecture..
No ratings yet
5 - Stat Lecture..
44 pages
Inference Using Normal and T Distribution
No ratings yet
Inference Using Normal and T Distribution
9 pages
Lecture 4
No ratings yet
Lecture 4
28 pages
Sampling
No ratings yet
Sampling
34 pages
Inferential Stat - One Sample
No ratings yet
Inferential Stat - One Sample
41 pages
Eda Research
No ratings yet
Eda Research
11 pages
Chapter 3 Hypothesis Testing
No ratings yet
Chapter 3 Hypothesis Testing
80 pages
Quiz 2 Cheatsheet v3
No ratings yet
Quiz 2 Cheatsheet v3
2 pages
F2 Statistical Inference
No ratings yet
F2 Statistical Inference
43 pages
Stat Unit 3 - T Test
No ratings yet
Stat Unit 3 - T Test
25 pages
5 - Test of Hypothesis (Part - 1)
No ratings yet
5 - Test of Hypothesis (Part - 1)
46 pages
Toh Solved
No ratings yet
Toh Solved
37 pages
Unit 5. Test of Significance
No ratings yet
Unit 5. Test of Significance
56 pages
Cosm Unit - IV
No ratings yet
Cosm Unit - IV
18 pages
Sampling PDF
No ratings yet
Sampling PDF
117 pages
Chapter 4test of Hypotheses
No ratings yet
Chapter 4test of Hypotheses
42 pages
SB K49 Lecture8
No ratings yet
SB K49 Lecture8
51 pages
Z Test
No ratings yet
Z Test
25 pages
Stat 255 Supplement 2011 Fall
100% (1)
Stat 255 Supplement 2011 Fall
78 pages
Solutions One Sample Hypothesis Testing 7
No ratings yet
Solutions One Sample Hypothesis Testing 7
12 pages
Stat Prob Q4 W5
No ratings yet
Stat Prob Q4 W5
7 pages
Theory of Decision
No ratings yet
Theory of Decision
9 pages
CH 8
No ratings yet
CH 8
20 pages
Module 3A0 Tests For A Population Mean
No ratings yet
Module 3A0 Tests For A Population Mean
52 pages
Lecture Note 5
No ratings yet
Lecture Note 5
8 pages
Q4W5 Module 8 - Solving Problems Involvinheheg Test of Hypothesis On The Population Mean
No ratings yet
Q4W5 Module 8 - Solving Problems Involvinheheg Test of Hypothesis On The Population Mean
12 pages
Bản sao 07 - One Population Hypothesis Testing-1
No ratings yet
Bản sao 07 - One Population Hypothesis Testing-1
7 pages
Chapter IX Hypothesis Testing
No ratings yet
Chapter IX Hypothesis Testing
31 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
29 pages
SNM 1
No ratings yet
SNM 1
7 pages
Inbound 7189032283768096828
No ratings yet
Inbound 7189032283768096828
9 pages
Week 4
No ratings yet
Week 4
7 pages
Unit III - Formulae
No ratings yet
Unit III - Formulae
37 pages
Probability and Statistics - 3
No ratings yet
Probability and Statistics - 3
59 pages
GEC 410 DR Agarana M.C.: Hypothesis Testing
No ratings yet
GEC 410 DR Agarana M.C.: Hypothesis Testing
75 pages
Testing of Hypothesis: BY Arvind
No ratings yet
Testing of Hypothesis: BY Arvind
20 pages
The Central Limit Theorem and Hypothesis Testing Final
100% (1)
The Central Limit Theorem and Hypothesis Testing Final
29 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
12 pages
EFREN S. TELLERMO-Course Professor
No ratings yet
EFREN S. TELLERMO-Course Professor
11 pages
Hypothesis Test
No ratings yet
Hypothesis Test
23 pages
Flipped Notes 9 Applications of Testing Hypothesis
No ratings yet
Flipped Notes 9 Applications of Testing Hypothesis
27 pages
L15 Testing of Hypothesis
No ratings yet
L15 Testing of Hypothesis
42 pages
2022 Scheme Module 3 BCS302
No ratings yet
2022 Scheme Module 3 BCS302
17 pages
Descriptive Statistics Vs Inferential Statistics
No ratings yet
Descriptive Statistics Vs Inferential Statistics
8 pages
2 Hypothesis-Testing
No ratings yet
2 Hypothesis-Testing
43 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
5 pages
Introduction To Hypothesis Testing Purpose: Goodson/ 3360hyp 1
No ratings yet
Introduction To Hypothesis Testing Purpose: Goodson/ 3360hyp 1
5 pages
H T S M: Ypothesis Ests FOR A Ingle EAN
No ratings yet
H T S M: Ypothesis Ests FOR A Ingle EAN
1 page
Large Sample Test
No ratings yet
Large Sample Test
6 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
6 pages
05 Assignment 5 Solutions
0% (1)
05 Assignment 5 Solutions
7 pages

Balaji Statistics With R-Package Central Limit Theorem (CLT) : Solved Example

Uploaded by

Balaji Statistics With R-Package Central Limit Theorem (CLT) : Solved Example

Uploaded by

Balaji

Statistics with R-package

Central limit theorem[CLT]

 Mean of sample is same as the mean of the population. μx

The demonstration of CLT

#sampling distribution of non normal distribution

[i] #No sample data available:[CV Approach]

Ex:-1 An inventor has developed a new,

H0: µx=300 H1: µx≠300 ,

EX:-2 Bon Air Elementary School has

[ii] Direct sample data available.[P-value Approach]

one sample t-test

[iii] #Two samples test µ1=µ2

Mrs. Smith Mrs. Jones HO: µ1=µ2 H1: µ1≠µ2

Ex:-4 The Acme Company has developed a new battery. The

To test the claim, the company selects a simple random sample of

OLD[ µ1] NEW [µ2] HO: µ1-µ2≤-7, H1: µ1-µ2˃-7

TEST OF HYPOTHESIS ON PARAMETER,MEAN

Q2. How much confidence are you?

Q3 .What is the meaning of other 10%?

The above discussion is indicating that one of the

Z=[Variable-AM]/sd for data values.

You might also like