Stat 2 Unit 1
Stat 2 Unit 1
COURSE OBJECTIVE
8
SAMPLING THEORY
• Examining (studying) the entire group (population,
which may be difficult or even impossible to
examine), we may examine (study) only a small
part (portion) of the population (entire group of
objects or people).
• The process of obtaining samples is called
sampling.
• In general, Sampling is a technique of
collecting data only on a part of the
population to reveal the characteristics of the
entire population.
• Draw valid inferences about certain facts for
the population from results found in the
sample; a process known as statistical
inferences. 9
Basic concepts
Population :- is the total group of people
which you want to draw conclusions.
Sampling element :- are the individual cases
in the population (usually, persons).
Sample frame :- is the actual list of sampling
units from which the sample or some stage of
sample is selected.
Sample size :- it is a representative size of
the population. There are calculations to help
you determine this.
Bias:- Systematic errors produced by your
sampling procedure.
10
REASONS / NEED FOR SAMPLING
Time: as it is difficult to contact each and every
individual of the whole population.
Cost: The cost or expenses of studying all the
items (objects or individual) in a population may be
prohibitive.
Physically Impossible: Some population are
infinite, so it will be physically impossible to check
the all items in the population, such as populations of
fish, birds, snakes, mosquitoes.
Destructive Nature of items: to test the quality of
some products or items it may be necessary to
consume or destroy it. Under such circumstances a
census would consume everything or destroying it.
Reliability: Using a scientific sampling technique
the sampling error and the non-sampling error can 11 be
Sampling Methods
12
16
It is done when the population is known to be have heterogeneity with
regard to some factors and those factors are used for stratification
Using stratified sampling, the population is divided into homogeneous,
mutually exclusive groups called strata.
Among strata there is heterogeneity and within each strata units are
homogeneous
A population can be stratified by any variable that is available for all units
prior to sampling (e.g., age, sex, province of residence, income, etc.).
4. Cluster sampling
17
19
B. Non-probability
sampling
20
21
1. Convenience or haphazard
sampling
22
Definitions
A sampling distribution is a probability distribution for the
possible values of a sample statistic, such as a sample mean,
sample proportion, the difference between two sample mean
and the difference between two sample proportions.
32
7 3 0.1429
7.5 9 0.4285
8 6 0.2857
8.5 3 0.1429
Total 21 1.0000
35
2. If the population from which samples are taken is not normal, the
distribution of sample means will be approximately normal if the sample
size (n) is sufficiently large (n ≥ 30). The larger the sample size is used, the
closer the sampling distribution is to the normal curve.
The significance of the Central Limit Theorem is that it permits us to use
sample statistics to make inference about population parameters without
knowing anything about the shape of the frequency distribution of that
population other than XX X
Z X whatwe can get from the sample.
X X
36
Example 1:
The distribution of annual earnings of all bank tellers
with five years of experience is skewed negatively.
This distribution has a mean of Birr 15,000 and a
standard deviation of Birr 2000. If we draw a random
sample of 30 tellers, what is the probability that their
earnings will average more than Birr 15,750 annually?
37
Example 2:
Suppose that during any hour in a large department
store, the average number of shoppers is 448, with a
standard deviation of 21 shoppers. What is the
probability of randomly selecting 49 different shopping
hours, counting the shoppers, and having the sample
mean fall between 441 and 446 shoppers, inclusive?
Sampling Distribution of Proportions ( P)
38
Pq
2. The standard error of the proportion is equal to: P , where P= population
n
proportion
q=1–P
n = sample size.
40
Example :
0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
1.8 0.4641 0.4649 0.4656 0.4664 0.4671 0.4678 0.4686 0.4693 0.4699 0.4706
1.9 0.4713 0.4719 0.4726 0.4732 0.4738 0.4744 0.4750 0.4756 0.4761 0.4767
2.0 0.4772 0.4778 0.4783 0.4788 0.4793 0.4798 0.4803 0.4808 0.4812 0.4817
2.1 0.4821 0.4826 0.4830 0.4834 0.4838 0.4842 0.4846 0.4850 0.4854 0.4857
2.2 0.4861 0.4864 0.4868 0.4871 0.4875 0.4878 0.4881 0.4884 0.4887 0.4890
2.3 0.4893 0.4896 0.4898 0.4901 0.4904 0.4906 0.4909 0.4911 0.4913 0.4916
2.4 0.4918 0.4920 0.4922 0.4925 0.4927 0.4929 0.4931 0.4932 0.4934 0.4936
2.5 0.4938 0.4940 0.4941 0.4943 0.4945 0.4946 0.4948 0.4949 0.4951 0.4952
2.6 0.4953 0.4955 0.4956 0.4957 0.4959 0.4960 0.4961 0.4962 0.4963 0.4964
2.7 0.4965 0.4966 0.4967 0.4968 0.4969 0.4970 0.4971 0.4972 0.4973 0.4974
2.8 0.4974 0.4975 0.4976 0.4977 0.4977 0.4978 0.4979 0.4979 0.4980 0.4981
2.9 0.4981 0.4982 0.4982 0.4983 0.4984 0.4984 0.4985 0.4985 0.4986 0.4986
3.0 0.4987 0.4987 0.4987 0.4988 0.4988 0.4989 0.4989 0.4989 0.4990 0.4990