0% found this document useful (0 votes)
471 views33 pages

15chap 3.1 Sampling Distribution

The document discusses sampling and sampling distributions. It defines key terms like population, parameter, sample, statistic. A parameter describes a characteristic of the entire population, while a statistic describes a sample. The mean of all students is a parameter, while the mean of a sample of students is a statistic. The distribution of sample statistics is called a sampling distribution. As sample size increases, the sampling distribution of the mean approximates the normal distribution. Examples are provided to illustrate calculating the mean and variance of sampling distributions.

Uploaded by

Nyah Margarett
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
471 views33 pages

15chap 3.1 Sampling Distribution

The document discusses sampling and sampling distributions. It defines key terms like population, parameter, sample, statistic. A parameter describes a characteristic of the entire population, while a statistic describes a sample. The mean of all students is a parameter, while the mean of a sample of students is a statistic. The distribution of sample statistics is called a sampling distribution. As sample size increases, the sampling distribution of the mean approximates the normal distribution. Examples are provided to illustrate calculating the mean and variance of sampling distributions.

Uploaded by

Nyah Margarett
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 33

Chapter 3.

Sampling Distribution
Population Parameter and Sample Statistic
Sampling is the process to obtain useful information on a large
target group. The target group is called the population, the
portion of the population which is selected and observed is
called the sample, and the number of members in the sample
is called the sample size. The members in the sample are
called sampling units. The listing of all possible sampling units
is called sampling frame.
The numerical measures which describe certain characteristics of a
population are called population parameters and the numerical
measures calculated from the sample are called sample statistic.
Example:
1. The mean height of all Grade 11 students in Metro Manila is a
population parameter.
2. The mean height of the 1000 Grade 11 students in the sample is a
sample statistic.

It is important to notice that in every given population, the parameter


is a fixed quantity. On the other hand, the value of a statistic might
vary depending on the selected sample. They can also be measures of
proportion or dispersion and not limited to means or averages.
Determine whether each of the following is a population parameter
or a sample statistic.
1. The mode weight of all citizens in Metro Cebu.
parameter
2. The inter-quartile range of the residence of 50 copper wires taken
from a batch of 10,000 wires.
statistic

3. The maximum time to run 100 m for 3 basketball players taken


from the members of a basketball team.
statistic
4. The standard deviation of the test marks of all the students in
Grade 10. parameter
Types of Inferences
1. Estimation:
In this type of inference, we estimate the value of a population
parameter.

2. Testing:
In this type of inference, we formulate a decision about the value
of a population parameter.

3. Regression:
In this type of inference, we make predictions or forecasts about
the value of a population parameter.
Random Sample

Sampling can be done in two ways:


1. With replacement
Any datum chosen from the population to form the sample is returned to the
population so that it has the chance of being chosen again.

2. Without replacement
This means that once a datum is chosen, it has no further chance of being
chosen. In this case, the size of the population for sampling is reduced by
one as each sampling unit is chosen.

A sample chosen without replacement is called a random sample.


Sampling Distribution

sample statistics (plural)


It is a random variable that can be mean, proportion, or standard deviation
calculated from different samples. The sample statistic vary and will also take
on a random value.

sample mean
It is the mean computed from the randomly selected samples.

sampling distribution
A sample statistic together with its corresponding probability distribution
forms a sampling distribution.
Sampling Distribution
sampling error
It is the difference between the results obtained from a sample and the results
obtained from the population from which the sample was selected.
sampling error = 𝑥ҧ − 𝜇

Properties of the Distribution of Sample Means


1. The mean of the sample means 𝐸(𝑋) ത = 𝜇 ҧ is the same as the population
𝑥
mean 𝜇.
2. The variance of the sample means is 𝑉𝑎𝑟 𝑋ത = 𝜎𝑥ҧ 2 is smaller than the
variance of the population 𝜎 2 . The variance of the sample means is equal
𝜎2
to the variance of the population divided by the sample size, 𝑉𝑎𝑟 𝑋ത = .
𝑛
Remarks:

1. The standard deviation of the sample mean is also called the


𝜎
standard error of the means. It is denoted by 𝜎𝑥ҧ = .
𝑛

2. The variance of the sample mean can be reduced y increasing


the sample size. This means that the larger the sample size the
lose the sample mean is to the population mean.
3. For sampling without replacement, the variance of the
𝝈𝟐 𝑵−𝒏
ഥ =
sample mean is 𝑽𝒂𝒓 𝑿 𝒙 where N is the population
𝒏 𝑵−𝟏
size and n is the sample size.
Example 1:
Consider a population consisting of three numbers 6, 8, 10. A random
sample of size 2 is drawn from the population with replacement. Let X be
the random variable representing the number drawn each time.

Table for Sample Means:


Samples 6,6 6,8 6,10 8,6 8,8 8,10 10,6 10,8 10,10

𝑿 6 7 8 7 8 9 8 9 10

Sampling Distribution:
𝑼𝒔𝒆 𝒕𝒉𝒆 𝒄𝒂𝒍𝒄𝒖𝒍𝒂𝒕𝒐𝒓 𝒕𝒐 𝒇𝒊𝒏𝒅

𝒙 6 7 8 9 10
𝑷(ഥ
𝒙) 1/9 2/9 3/9 2/9 1/9 ഥ =𝟖
𝑬 𝑿
ഥ = 𝟏. 𝟑𝟑
𝑽𝒂𝒓 𝑿
Example 2:
Suppose a population consists of four data, namely 5, 8, 12, 16 and each
has an equal probability to be chosen. Suppose a sample of size 2 is
chosen from the population with replacement.
a. Find the mean 𝜇 and variance 𝜎 2 of the population.

b. Find the probability distribution of the sample mean 𝑋.
c. Find 𝐸 𝑋ത and 𝑉𝑎𝑟 𝑋ത

Samples 5,5 5,8 5,12 5,16 8,8 8,12 8,16 12,12 12,16 16,16

𝑿 5 6.5 8.5 10.5 8 10 12 12 14 16
Example 2:
a. Find the mean 𝜇 and variance 𝜎 2 of the population.

5+8+12+16
𝜇= = 10.25 𝜎 2 = 17.19
4

b. Find the probability distribution of the sample mean.



𝒙 5 6.5 8 8.5 10 10.5 12 14 16
𝑷(ഥ
𝒙) 0.1 0.1 0.1 0.1 0.1 0.1 0.2 0.1 0.1

c. Find 𝐸 𝑋ത and 𝑉𝑎𝑟 𝑋ത .

𝐸 𝑋ത = 10.25 Note: the sample variance is less


𝑉𝑎𝑟 𝑋ത = 115.37 − 10.25 2 = 10.31 than the population variance, that
is 10.31 < 17.19.
Example 3:
A population consists of the 5 numbers {0, 2, 4, 6, 8}.
Samples of size 3 are drawn from this population
without replacement.
a. List all the possible samples that could be
obtained and find the mean of each sample.
b. Find the mean of all the sample means in (a) and
compare it with the population mean.
c. Find the probability that the sample mean is
equal to 4.
Example 3:
Solutions:
Find the mean 𝜇 𝑎𝑛𝑑 𝜎 2 of the population. 𝜇 = 4 𝑎𝑛𝑑 𝜎 2 = 8
a. Samples 0,2,4 0,2,6 0,2,8 0,4,6 0,4,8 0,6,8 2,4,6 2,4,8 2,6,8 4,6,8

𝑿 2 2.7 3.3 3.3 4 4.7 4 4.7 5.3 6

b. ഥ
𝒙 2 2.7 3.3 4 4.7 5.3 6
𝑷(ഥ
𝒙) 0.1 0.1 0.2 0.2 0.2 0.1 0.1

𝐸 𝑋ത = 4 𝑎𝑛𝑑 𝑡ℎ𝑒 𝑉𝑎𝑟 𝑋ത = 17.33 − (4)2 = 1.33


𝝈𝟐 𝑵−𝒏
ഥ =
For sampling without replacement, the variance of the sample mean is 𝑽𝒂𝒓 𝑿 𝒙 ,
𝒏 𝑵−𝟏
ഥ = 𝟖 𝒙 𝟓−𝟑 = 𝟖 𝒙 𝟐 = 𝟒 = 𝟏. 𝟑𝟑
𝑽𝒂𝒓 𝑿
𝟑 𝟓−𝟏 𝟑𝒙𝟒 𝟑

c. 𝑃 𝑋ത = 4 = 0.2
Example 4:
Suppose a random sample of size 200 is taken from a population with a
mean of 510 kg and a standard deviation of 15 kg;
a. Find the mean and the variance of the sample mean.
b. If it is required to reduce the standard error of the mean to less than
0.5 kg, what is the minimum sample size?
Solution:
𝝈𝟐 𝟏𝟓𝟐
a. 𝑆𝑖𝑛𝑐𝑒 𝜇 = 510, 𝑡ℎ𝑒𝑛 𝐸 𝑋ത = 𝜇𝑥ҧ = 𝟓𝟏𝟎 𝒌𝒈 and 𝑽𝒂𝒓 𝑿
ഥ = = = 𝟏. 𝟏𝟑 𝒌𝒈𝟐
𝒏 𝟐𝟎𝟎
𝜎 15
b. < 0.5, 𝑛> , 𝑛 > 30, n > 900 , Answer: 901
𝑛 0.5
Sampling Distribution of the sample
mean from a Normal Distribution

Recall the notation:


Normal Distribution 𝑿~𝑵(𝝁, 𝝈𝟐 )
Standard Normal Distribution 𝑿~𝑵(𝟎, 𝟏)
𝝈𝟐
For a Random sample mean 𝑿~𝑵(𝝁, )
𝒏
Example 5:
Suppose a random sample of size 16 is taken from a
normal population X with a mean 51 and a standard
deviation 4;
a. State the sampling distribution of the sample mean.
b. Find the probability that the sample mean is greater
than 52.
c. Find the probability that the sample mean lies
between 49.5 and 51.5.
Example 5:
Given: 𝜇 = 51, 𝜎 = 4, 𝑛 = 16 Note: n < 30

a. State the sampling distribution of the sample mean.


42
𝐸 𝑋ത = 51, 𝑉𝑎𝑟 𝑋ത = =1 ഥ ~𝑵 𝟓𝟏, 𝟏 .
Hence, the statement is 𝑿
16

a. Find the probability that the sample mean is greater than 52.
52 − 51
𝑃 𝑋ത > 52 = 𝑃 = 𝑃 𝑍 > 1 = 𝟎. 𝟏𝟓𝟖𝟕
1
a. Find the probability that the sample mean lies between 49.5 and
51.5.
49.5 − 51 51.5 − 51
𝑃 49.5 < 𝑋ത < 51.5 = 𝑃 <𝑍< = 𝑃 −1.5 < 𝑍 < 0.5 = 𝟎. 𝟔𝟐𝟒𝟕
1 1
Sampling Distribution of the sample mean from
other Distribution
If the population from which the sample is taken does not
follow a normal distribution, then the sampling distribution of
the sample mean is also not normal.

Central Limit Theorem is used when the sample size is


sufficiently large to allow the use of normal distribution to
approximate the sampling distribution.
Sampling Distribution of the sample mean from
other Distribution

Central Limit Theorem states that when sampling a large


population at any distribution, the sample means have a normal
distribution whenever the sample size is 30 or more.

ҧ
𝑥−𝜇
Formula: 𝑍= 𝜎
𝑛
Example 6:
A sample size of 400 is taken from a population X that
follows a Poisson distribution with 𝜇 = 4. Find the
probability that the sample mean is less than 3.85.
Solution:
𝑿~𝑷𝒐 𝝁 𝒘𝒉𝒆𝒓𝒆 𝝁 = 𝝈𝟐 = 𝟒
3.85−4
𝑃 𝑋ത < 3.85 = 𝑃 𝑍 < 2 = 𝑃 𝑍 < −1.5 = 𝟎. 𝟎𝟔𝟔𝟖
400
Example 7:
The mean and standard deviation of the number of
phone calls that a company receives in an hour are found
to be 25 and 15 respectively. A sample of n different
hours is randomly selected. If the probability that the
sample mean number of phone calls received is greater
than 24 per hour and is 97.5%, find the least possible
value of n assuming that n is sufficiently large.
Example 7:
Solution:
Let X be the random variable representing the number of phone calls received in an
hour.
Central Limit Theorem can be used to approximate the distribution.
−29.4 − 𝑛
𝑃 𝑋ത > 24 > 0.975 >
−1 −1
24−25
𝑍> 15 > 0.975 (29.4)2 < ( 𝑛)2
𝑛
−𝟏
-1.96 ≻ 𝟏𝟓 𝒏 > 𝟖𝟔𝟒. 𝟑𝟔
𝒏
− 𝑛
−1.96 > Answer: The least possible sample size is 865.
15
Assessment 1:
The following table shows the probability distribution of a
discrete random variable X. (recall: its not sample mean)
𝒙 0 2 4 6 8
𝑷(𝒙) 0.1 0.3 0.2 0.25 0.15

Suppose a sample of size 100 is taken with replacement.


a. Find the mean and the variance of the random variable X.
b. Find the probabilities that the sample mean is greater than
4.5.
Assessment 1:
𝒙 0 2 4 6 8
𝑷(𝒙) 0.1 0.3 0.2 0.25 0.15

a. Use the probability distribution to solve for E(X) and Var(X).


E(X) = 4.10 and Var(X) = 6.19; hence SD(X) = 2.49

b. Use Central Limit theorem to solve for the probability of 𝑋ത > 4.5.
𝑥ҧ − 𝜇 4.5 − 4.10 0.4
𝑧= 𝜎 = = = 1.61
2.49 0.249
𝑛 100
𝑃 𝑍 > 1.61 = 0.0537
Assessment 2:
A random sample of size 20 is taken from a normal population
with mean 10 and variance 6. Find the probabilities that the
sample mean 𝑋ത is:
a. greater than 10.5,
b. less than 9.7; and
c. between 9.8 and 10.2
Assessment 2:

Use the Central Limit theorem to solve for the probabilities of the sample mean 𝑋.
𝜇 = 10 𝜎2 = 6 hence 𝜎 = 2.45
a. greater than 10.5
ҧ
𝑥−𝜇 10.5−10 0.5
𝑧= 𝜎 = 2.45 = = 0.91 𝑃 𝑍 > 0.91 = 0.1814
0.5478
𝑛 20

b. less than 9.7


ҧ
𝑥−𝜇 9.7−10 −0.3
𝑧= 𝜎 = 2.45 = = −0.55 𝑃 𝑍 < −0.55 = 0.2912
0.5478
𝑛 20

c. between 9.8 and 10.2


ҧ
𝑥−𝜇 9.8−10 −0.2 ҧ
𝑥−𝜇 10.2−10 0.2
𝑧= 𝜎 = 2.45 = = −0.37 𝑧= 𝜎 = 2.45 = = 0.37
0.5478 0.5478
𝑛 20 𝑛 20

𝑃 −0.37 < 𝑍 < 0.37 = 0.0537


Group Work:
There are 8 questions which will be answered by 8 members.
The 2 leaders shall give one question per student. The member
will encode the problem and encode the answer in less than
one half of a short coupon bond. The member will include
his/her name below the final answer. The 2 leaders will collect
the eight cuts and paste them in short coupon bonds
consecutively. Submit the output on January 7, 2018. Work on
this during December 10-14, 2018.
Criteria: Accuracy – 70%
Neatness and Design – 30%
Group Work:
Format:
Front Page:
URDANETA CITY NATIONAL HIGH SCHOOL
Senior High School
Science, Technology, Engineering, and Mathematics (STEM) Strand
SY 2018 - 2019
Statistics and Probability
ACTIVITY ON SAMPLING DISTRIBUTION and ESTIMATION OF PARAMETERS
Group No.
Leader’s Names (2)
Grade and Section
Question 1:
Suppose a population consists of 4 data, namely 1, 2, 4, and 8, and each
has an equal probability to be chosen.
a. Find the mean 𝜇 and variance 𝜎 2 of the population.
b. Suppose a sample of size 2 is chosen from the population with
replacement (𝑋):ത

b.1. Find the probability distribution of the sample mean 𝑋,
ത and Var (𝑋)
b.2. Find 𝐸(𝑋) ത .
Question 2:
Suppose a population consists of 4 data, namely 1, 2, 4, and 8, and each
has an equal probability to be chosen.
a. Find the mean 𝜇 and variance 𝜎 2 of the population.
b. Suppose a sample of size 2 is chosen from the population without
replacement (𝑌) ത :

b.1. Find the probability distribution of the sample mean 𝑌,
ത and Var(𝑌)
b.2. Find 𝐸(𝑌) ത .
Question 3:
Suppose we take a sample of size n from a population with standard
deviation of 16. If the standard error of the mean is less than 0.5, find the
minimum value of n.

Question 4:
A random sample of size 50 is taken from a normal population with mean
120 cm and standard deviation of 10 cm. Find the probabilities that :
a. the sample mean is less than or equal to 117 cm.
b. the sample mean lies between 121 cm and 124 cm.
Next are…

Point Estimation
Confidence Interval for Population Mean
Confidence Interval for Population Proportion

You might also like