0% found this document useful (0 votes)
5 views44 pages

Sampling Sta414

The document discusses information collection and sampling techniques, emphasizing the importance of representative samples when conducting surveys. It outlines methods such as census and sampling, the significance of probability sampling, and the concept of sampling distribution. Additionally, it covers standard error, sampling error, and the central limit theorem, providing examples and formulas related to these statistical concepts.

Uploaded by

screamwala
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views44 pages

Sampling Sta414

The document discusses information collection and sampling techniques, emphasizing the importance of representative samples when conducting surveys. It outlines methods such as census and sampling, the significance of probability sampling, and the concept of sampling distribution. Additionally, it covers standard error, sampling error, and the central limit theorem, providing examples and formulas related to these statistical concepts.

Uploaded by

screamwala
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 44

STA 411: QADM

Osama Bin Ajaz


Information Collection &
Sampling techniques
• Information can be obtained from the following two methods.
• Census: • obtaining information for the entire population of interest.
• conducting a census may be time consuming, costly, impractical,
or even impossible.
• Sampling: • Process of selecting a sample.
• Method for obtaining a sample from the population.
• Because the sample will be used to draw conclusions about the
entire population, it should be a representative sample.
What we will learn?
• How to conduct surveys.
• How to make sure they are representative,
• and what can go wrong.
Why sampling?
When a census is not possible?
• Suppose you are being recommended for a Lipid profile test.

• Would you prefer to measure the entire population of your blood, or


would you prefer to give a blood sample?

• For a textile company, What percentages of raw fabric are defective?


Speed
• It takes several years to successfully plan and execute a census of the
entire population.

• Getting figures such as monthly unemployment rates would be


impossible with a census.

• It is much faster to collect a sample than a census if the population is


large
Accuracy
• If the sample is representative, then the inferences made from the
sample will be almost accurate.
Remember!
• Data must represents the larger group on the
topic of interest.

• What can happened if the samples are not


representative? See the case study in the next
slide.
what can happen when a sample is not representative?
An important method of ensuring that
sample data represents a larger population is
to use a Probability Sampling.
Probability Sampling
• Most modern sampling procedures involve the use of probability sampling.

• In probability sampling, a random device—such as tossing a coin, consulting a


table of random numbers, or employing a random-number generator—is
used to decide which members of the population will constitute the sample
instead of leaving such decisions to human judgment.

• Probability sampling eliminates unintentional selection bias and permits the


researcher to control the chance of obtaining a non-representative sample.
Simple Random Sampling

• There are two types of simple random sampling.


• Sampling with replacement
• Sampling without replacement
Example # 01: Sampling without
replacement
• The top five state officials of Oklahoma are as shown: (i) Governor (G),
(ii) Lieutenant Governor (L), (iii) Secretary of State (S), (iv) Attorney
General (A), and (v) Treasurer (T). Consider these five officials a
population of interest.

Example # 02: Sampling with
replacement
• Select random samples of size 2 from a population of 5 students:
Ahmer, Bilal, Chand, Daniyal and Ejaz.
AA, BA, CA, DA, EA
AB, BB, CB, DB, EB
AC, BC, CC, DC, EC
AD, BD, CD, DD, ED
AE, BE, CE, DE, EE
• (a) P(Ahmer)
Example # 03:
• Assume students as a population and the marks obtained by them in
a certain statistics class are 20, 15, 12, 16, and 18. Draw all possible
random samples of two students when sampling is done
• (i) with replacement
• ii) without replacement
• Calculate the mean marks for each samples.
20, 15, 12, 16, and 18.
SAMPLING DISTRIBUTION
• The probability distribution of a statistic computed from all possible
samples of the same size is called a sampling distribution.
• Since sampling distribution is a probability distribution, therefore the
sum of all probabilities is always one.
• Sampling distribution of a statistics provide all the information one
needs in making decisions about the values of the population
parameters.
• The sampling distribution of a statistic depends on the distribution of
the population, the size of the samples, and the method of choosing
the samples.
Standard Error (S.E) & Sampling
Error
• The standard deviation of a sampling distribution of a sample statistic
is called the standard error of the statistic.
• It measures the dispersion of the value of a statistic, that might be computed
from all possible samples.
• Sampling error is the error resulting from using a sample to estimate
a population characteristic.
Sampling Distribution of Sample Mean

• Suppose a researcher selects a sample of 30 adult males and finds the


mean of the measure of the triglyceride levels for the sample subjects
to be 187 milligrams/deciliter.
• Then suppose a second sample is selected, and the mean of that
sample is found to be 192 milligrams/deciliter.
• Continue the process for 100 samples.
• 187, 192, 184, . . . , 196
• What happens?
• The mean becomes a random variable.
Sampling Distribution of Sample
Mean
Example # 03:
• Suppose a professor gave an 8-point quiz to a small class of four
students. The results of the quiz were 2, 6, 4, and 8. For the sake of
discussion, assume that the four students constitute the population.
• (a) Calculate population mean and standard deviation.
• (b) Draw the graph of original distribution.
• (c) Draw all possible samples of size 2 with replacement & Calculate
mean of each sample.
• (d) Construct Frequency distribution of Sample Mean and repeat Part
(a) & (b)
2, 6, 4, and 8
sample sample mean 5 pop mean
2 2 2 2.236068 pop SD
2 6 4
2 4 3 Sampling dist of a sample mean
2 8 5 xbar f
6 4 5 2 1
6 8 7 3 2
4 8 6 4 3
6 6 6 5 4
4 4 4 6 3
8 8 8 7 2
6 2 4 8 1
4 2 3 toal 16
8 2 5
4 6 5 1.581139
8 6 7
Central Limit Theorem (Contd.)
• If the sample size is sufficiently large, the central limit theorem can be
used to answer questions about sample means in the same manner
that a normal distribution can be used to answer questions about
individual values. The only difference is that a new formula must be
used for the z values. It is
Example # 04
• Hours That Children Watch Television: A. C. Neilsen reported that children
between the ages of 2 and 5 watch an average of 25 hours of television per
week. Assume the variable is normally distributed and the standard
deviation is 3 hours. If 20 children between the ages of 2 and 5 are
randomly selected, find the probability that the mean of the number of
hours they watch television will be greater than 26.3 hours. Z = 1.94
(0.0262)
• Age of Vehicle: The average age of a vehicle registered in the United States
is 8 years, or 96 months. Assume the standard deviation is 16 months. If a
random sample of 36 vehicles is selected, find the probability that the mean
of their age is between 90 and 100 months.
0.02619
Summary of Formulas
Example # 05:
• Suppose that the population of interest consists of the five starting players
on a men’s basketball team, who we will call A, B, C, D, and E. Further
suppose that the variable of interest is height, in inches.

• (a.) Obtain the sampling distribution of the sample mean for samples of size
2.
• (b.) Make some observations about sampling error when the mean height of
a random sample of two players is used to estimate the population mean
height.
Example # 06:
Sample
Size and
Sampling
Error
Finite Population Correction
Factor
• The formula for the standard error of the mean is accurate when the samples are
drawn with replacement Or are drawn without replacement from a very large or
infinite population.
• Since sampling with replacement is for the most part unrealistic, a correction
factor is necessary for computing the standard error of the mean for samples drawn
without replacement from a finite population.

where N is the population size and n is the sample size.


• This correction factor is necessary if relatively large samples are taken from a
small population, because the sample mean will then more accurately
estimate the population mean and there will be less error in the estimation.
Example # 07
• Consider the following population of size 5: 0, 3, 6, 9 & 12.
• (a) Draw all possible samples of size 3 w/o replacement and the find
mean of each sample.
• Find sampling distribution of sample mean and verify that

• Find standard error of


Sampling distribution
of
Difference between means
Example # 08: Sampling Difference
b/w two means
• Consider the following finite populations:
Population – I: 3, 4, 5 & Population – II: 0, 3
(a) Draw all possible random samples of size 2 from population – I with
replacement.
(b) Draw all possible random samples of size 3 from population – II with
replacement.
Population – I: 3, 4, 5 &
Population – II: 0, 3

0 1 1 1 2 2 3

3 3 2 2 2 1 1 0
3.5 3.5 2.5 2.5 2.5 1.5 1.5 0.5
4
4.5
4
5
4.5
4
Other 3 methods of sampling

• Samples randomly selected from each strata.


• Within strata population is same relative to the characteristics under study.

• Some clusters are randomly selected.


• Every unit of selected clusters are included in the sample.
Thank you

You might also like