J. K. Shah Classes Sampling Theory and Theory of Estimation
J. K. Shah Classes Sampling Theory and Theory of Estimation
Standard deviation σ s
Proportion P p
Size N n
: 501 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
Sampling Theory : The study of relationship existing between a population and the
samples drawn from population is called Sampling theory which is based on sampling.
SAMPLING WITH REPLACEMENT (SRSWR) : While selecting the units for a sample,
when a unit of sample selected is replaced before the next unit is selected then it is
called sampling with replacement.
In this case the total number of samples that can be drawn = (N)n For E.g.: Let
Population = {a, b, c}
N = 3, let n = 2
No. of samples = {(a, b) (a, c) (b, c) (b, a) (c, a) (c, b) (a, a) (b, b) (c, c)}
N = 3, let n = 2
TYPES OF SAMPLING
A sample can be selected from a population in various ways. Different situations call for
different methods of sampling. There are three methods of Sampling:
3. Mixed Sampling
: 502 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
: 503 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
: 504 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
• Purposive Sampling;
• Cluster Sampling;
• Quota Sampling;
• Convenience Sampling;
• Sequential Sampling.
Purposive Sampling: Purposive sampling is the method of sampling by which a sample
is drawn from a population based entirely on the personal judgement of the investigator. It
is also known as Judgement Sampling or Deliberate Sampling. A randomness finds no
place in it and so the sample drawn under this method cannot be subjected to
mathematical concepts used in computing sampling error.
Cluster Sampling: Cluster Sampling involves arranging elementary items in a population
into hetrogeneous subgroups that are representative of the overall population. One such
group constitutes a sample for study.
Quota Sampling: In quota sampling method, quotas are fixed according to the basic
parameters of the population determined earlier and each field investigator is assigned
with quotas of number of elementary units to be interviewed.
Convenience Sampling: In convenience sampling, a sample is obtained by selecting
convenient population elements from the population.
Sequential Sampling: In sequential sampling a number of sample lots are drawn one
after another from the population depending on the results of the earlier samples draw
from the same population. Sequential sampling is very useful in Statistical Quality
Control. If the first sample is acceptable, then no further sample is drawn. On the other
hand if the initial lot is completely unacceptable, it is rejected straightway. But if the initial
lot is of doubtful and marginal character falling in the area lying between the acceptance
and rejection limits, a second sample is drawn and if need be a third sample of bigger
size may be drawn in order to arrive at a decision on the final acceptance or rejection of
the lot. Such sampling can be based on any of the random or non-random method of
selection.
3. Mixed Sampling : It is partly probabilistic and partly Non- probabilistic in
nature.
Systematic sampling comes under the category of Mixed Sampling
Advantages of Random (OR Probability) Sampling
: 505 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
From a population of size N, number of samples of size n can be drawn. These samples
will give different values of a statistic. E.g. if different samples of size n are drawn from a
population, different values of sample mean are obtained. The various values of a
statistic thus obtained, can be arranged in the form of a frequency distribution known as
Sampling Distribution. Thus we can have sampling distribution of sample mean x ,
sampling distribution of sample proportion p etc.
5. Errors in Sampling
Any statistical measure say, mean of the sample, may not be equal to the
corresponding statistical measure (mean) of the population from which the sample has
been drawn. Thus there can be discrepancies in the statistical measure of population,
i.e., parameter and the statistical measures of sample drawn from the same population
i.e., statistic. These discrepancies are known as Errors in Sampling.
6. Standard Error of a Statistic : Standard error is used to measure the variability of the
values of a statistic computed from the samples of the same size drawn from the
population, whereas standard deviation is used to measure the variability of the
observations of the population itself.
: 506 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
The standard deviation of the sample statistics is called standard error of that statistic.
E.g. if different samples of the same size n are drawn from a population, we get different
values of sample mean x . The S.D. of x . is called standard error of x . . It is obvious
that the standard error of x . will depend upon the size of the sample and the variability
of the population.
σ s
i) Standard error of sample mean SE ( x ) = or
n n
σ=Population S.D
and s=Sample S.D
P (1 − P ) p(1 − p)
ii) Standard error of proportion SE (p) = or
n n
Where P=Population proportion
P=Sample proportion
n
If i) Population size is Finite and the Sampling Fraction ≥.05
N
And ii) Samples are drawn Without Replacement(SRSWOR)
Then , each of the above formula for Standard Error will be multiplied by the factor
N −n
( Finite Population correction or Finite Population Multiplier)FPC
N −1
• Formula for standard Error when i) n<30( small sample)
: 507 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
σ
Small (n < 30) SD is known SE x =
n
s
Small (n < 30) SD is unknown SE x =
n-1
Most commonly used confidence level is 95% when no reference given use Z = 3
: 508 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
x - z SE ( x ) , x + z SE ( x )
p - Z SE (p) , p + Z SE (p)
The need for determination of the proper size of the sample is very great for practical
use in business where either the standard error is known on the basis of past
experience or where a given absolute level of accuracy is desired. If the sample size is
too large, more money and time have to be spent but the result obtained from the large
sample may not be more accurate than that from a smaller sample. On the other hand,
a valid conclusion may not be reached if the sample size too small. The method of
determining a proper size is given for the following two cases ;
Z 2σ 2
∴ n=
E2
Z 2 P (1 − P )
∴n=
E2
Here E = |P - p| i.e. the difference between the sample proportion p and the
population proportion P.
: 509 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
Simple random sampling with/ without replacement and estimation of sample proportion:
1. A population comprises 3 members 1, 5, 3. Draw all possible samples of size two (i) with
replacement (ii) without replacement
Find the sampling distribution of sample mean in both cases.
2. In simple random sampling with replacement, the total number of possible sample with
distinct permutation of member is:
(N = Size of Population, n = Sample size)
a) Nxn
b) Nn
c) N
d) n
3. In simple random sampling without replacement, the total number of possible sample with
distinct permutation of member is:
(N = Size of Population, n = Sample size)
a) Nn
b) P(N, n)
c) C(N,n)
d) None of the above
4. If from a population with 20 members, a random sample without replacement of 2
members is taken, the number of all such samples is :
a) 400
b) 190
c) 210
d) 200
5. If from a population with 25 members, a random sample with replacement of 2 members
is taken, the number of all such samples is:
a) 50
b) 300
: 510 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
c) 625
d) 125
6. If from a population with 25 members, a random sample without replacement of 3
members is taken, the number of all such sample is:
a) 3400
b) 1250
c) 3400
d) 2300
7. A random sample of 200 articles taken from a large batch of articles contains 25 defective
articles. What is the estimate of the proportion of defective articles in the entire batch?
a) 0.125
b) 0.075
c) 0.250
d) 0.025
8. A random sample of 200 articles taken from a large batch contains 15 defective articles.
What is the estimate of the sample proportions of defective articles?
a) 0.075
b) 0.02
c) 0.03
d) 0.06
: 511 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
d) 1.589
10. A simple random sample of size 64 is drawn from a finite population consisting of 122
units. If the population standard deviation is 16.8. Find the standard error of sample mean
when the sample is drawn with replacement.
a) 2.1
b) 2.9
c) 4.2
d) None of the above
11. A simple random sample of size 64 drawn from a finite population consisting of 122 units.
If the population standard deviation is 16.8, find the standard error of sample mean when
the sample is drawn without replacement.
a) 2.1
b) 2.9
c) 2.07
d) 1.45
12. A random sample of 400 oranges was taken from a large consignment and 52 were
found to be defective. The standard error of the population of defective ones in a sample
of this size is nearly .
a) 0.17
b) 0.0017
c) 0.017
d) 1.700
13. A simple random sample of size 9 is drawn without replacement from a finite population
consisting of 25 units. If the number of defective units in the sample be 5, then the
standard error of the sample proportion of defectives is:
a) 0.67
b) 0.53
c) 0.11
d) 0.135
: 512 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
15. A random sample of size 5 is taken from a population containing 100 units. If the sample
observations are 18, 7, 13, 10, 12 find an estimate of the standard error of sample mean,
if sampling is done without replacement.
a) 1.85
b) 1.00
c) 1.28
d) 1.78
17. A random sample of 50 items drawn by a particular population has a mean 30 with a S.D.
2.8, construct a 98% confidence interval estimate of the population mean.
18. The quality control manager of a tyre company has sample of 100 tyres and has found
the mean life time to be 30,214 km. The population S.D. is 860. Construct a 95%
confidence interval for the mean life time for this particular brand of tyres.
19. A pharmaceutical company wants to estimate the mean life of a particular drug under
typical weather conditions. Following results were obtained from a sample random
sample of 100 bottles of the drug.
Sample mean = 18 months
: 513 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
24. It is known that the population standard deviation in waiting time for L.P.G. gas cylinder in
Delhi is 15 days. How large a sample should be chosen to be 95% confident, the waiting
time is within 7 days of true average.
25. A manufacturing concern wants to estimate the average amount of purchase of its
product in a month by the customers whose standard deviation is Rs.10. Find the sample
size if the maximum error is not to exceed Rs.3 with a probability of 0.99.
26. Mr. X wants to determine on the basis of sample study, the mean time required to
complete a certain job so that he may be 95% confident that the mean may remain within
+ 2 days of the true mean. As per the available records the population variance is 64
days. How large should be sample be for his study?
: 514 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
27. In measuring reaction time, a psychologist estimated that the standard deviation is 1.08
seconds. What should be the size of the sample in order to be 99% confident that the
error of her estimates of mean would not exceed 0.18 seconds?
28. The incidence of a particular disease in an area in such that 20 per cent people of that
area suffers from it. What size of sample should be taken so as to ensure that the error of
estimation of the proportion should not be more than 5 per cent with 95 per cent
confidence?
1 Sampling is a process whereby we judge the characteristics or draw inference about the
totality or Universe (known as population) on the basis of judging the characteristics of a
selected portion taken from that totality (known as sample).
: 515 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
n
6. Sample Fraction : is called Sampling Fraction where n = Sample Size and N =
N
Population Size.
7. Complete enumeration or census : In case of enumeration, information is collected for
each and every unit the aggregate of all the units under consideration is called the
‘population’ or the ‘universe’. The results are more accurate and reliable but it involves lot
of time, money and man power.
: 517 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
: 518 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
11. A complete list of all the units in a finite population, properly numbered for identification, is
called a:
a) Universe
b) Sampling Data
c) Sampling Units
d) Sampling Frame
12. Statistical data may be collected by complete enumeration called
a) Sample Enquiry
b) Census Enquiry
c) Both a) and b) above
d) Neither a) nor b) above
13. A border patrol checkpoint which stops every passenger van is utilizing :
a) simple random sampling.
b) systematic sampling
c) systematic sampling.
d) complete enumeration
14. A population consisting of all the items which are physically present is called :
a) hypothetical
b) normal population
c) existent population
d) none of the above
: 519 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
: 520 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
22. Increase in reliability and accuracy of results from a sampling study with the increase in
sample size is known as the principle of:
a) statistical regularity
b) optimization.
c) law of increasing returns.
d) inertia of large numbers.
23. Sampling error increases with an increase in the size of the sample.
a) The above statement is true.
b) The above statement is not true.
c) Sampling error do not depends upon the sample size
d) None of the above
: 521 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
33. “Non - sampling errors are present both in census as well as a sample survey.’’ - State
whether the given statement is correct or not.
a) Correct
b) Incorrect
c) Nothing cannot be said
d) None of the above
34. There are more chances of non-sampling errors than sampling errors in case of :
a) studies of large samples.
b) complete enumeration.
c) inefficient investigators.
d) all of the above
Related MCQ’s
36. Values of a particular statistic with their relative frequencies will constitute
a) Probability Distribution
b) Sampling Distribution
c) Theoretical Distribution
d) None of these
: 523 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
40. The population standard deviation describes the variation among elements of the
universe, whereas, the standard error measures the:
a) variability in a statistic due to universe
b) variabillity in a statistic due to sampling
c) variablity in a parameter due to universe
d) variablity in a statistic due to parameter
41. As the units selected in two or more samples drawn from a population are not the same,
the value of a_______ varies from sample to sample, but the _________always remains
constant.
a) mean, standard deviation
b) statistic, standard deviation
c) statistic, parameter
d) parameter, statistic
: 524 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
: 525 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
For Example, we estimate Population Mean µ from Sample Mean x , Population S.D σ
from sample S.D s, Population proportion P from Sample proportion p
.2 ી’ and statistic by ‘t’
For the purpose of Estimation, parameter denoted by ‘ી
1. UNBIASEDNESS
a) If E(t) - θ = 0 or If E(t)= θ
: 526 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
4. Sufficiency :
It is said to be a sufficient estimator of a parameter θ, if it contains all information about
the parameter θ.
NOTE 1:
1. Minimum Variance Unbiased Estimator (MVUE)
(Minimum variance along with 0 bias)
A statistic, which is unbiased and has also minimum variance, that is most efficient, is
said to be MVUE.
NOTE 2:
Best Asymptotically Normal Estimator or Best Linear Unbiased Estimator (BLUE)
If the estimator is unbiased along with minimum variance and at the same time it is a linear
function of θ, that is,
t = f(θ), then it is called Best Linear unbiased estimator (BLUE)
• In interval estimation the confidence limits are the upper and lower limits of the
interval estimate.
Concept of level of significance ∝ : Level of significance is defined as the
significance of error in evaluating the confidence limits. Thus, Level of
significance, the Compliment of Level of Confidence.
Example 1: 5% level of significance =95% level of confidence
Example 2: 1% level of significance =99% level of confidence
Symbolically we expressed the fact that a parameter ી lies between two values C1
and C2 is 1- ∝
1- ∝
That is , P(C1 < θ < C2 )=1-∝
∝” is called the “ level of significance” and 1- ∝ is called the “level of
where “∝
confidence”.
• Sample size is determined from C.L. (Confidence Limits) of mean and proportions
Related MCQ’s:
48. The limits within which the parameter values are expected to lie can be determined by
using which of the following concepts:
a) Sampling Distribution
b) Probability Distribution
c) Standard Deviation
d) Standard Error.
: 527 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
σ N −n
c)
n N −1
d) None of the above
51. A single number that is used to estimate an unknown population parameter is known as:
a) Interval Estimate
b) Point Estimate
c) Estimate
d) Statistics
: 528 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
54. Which of the following(s) are the criterion for an ideal estimator:
I. Unbiased ness and Minimum Variance
II. Sufficiency
III. Consistency
IV. Efficiency
a) I, II and III above
b) I, II, and IV above
c) All of the above
d) I and II only.
59. If statistics T contains all the information about θ, then T is known to be a estimator of θ.
a) Efficient
b) Sufficient
c) Consistent
d) Systematic
63. If x1, x2 ...., xn is a simple random sample of size n from a finite population of N units with
a) E ( x ) = µ2
b) Var ( x ) = σ
c) E( x )=µ
d) All of the above are incorrect
: 530 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
65. The permissible sampling error required to determine sample size for
a) estimating a mean
b) estimating a proportion
c) both a) and b) above
d) none of the above
72. The difference between sample S.D. and the estimate of population S.D. is negligible if
the sample size is :
a) small
b) moderate
c) sufficiently large
d) none of these
73. The difference between the estimate from the sample and the parameter to be estimated
is:
a) sampling error
b) permissible sampling error.
c) confidence level
d) none of these.
: 532 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
78. The values of a characteristics x of a population containing six units are given by 2, 6, 5,
1, 7, 3. Take all possible samples of size two and find the mean of the sample means.
a) 1
b) 4
c) 2
d) 3
79. Let the five numbers 3, 4, 5, 6, 7 constitute a universe. Select all samples of size three
and compute the mean of the sample means. The value thus obtained is:
a) 3
b) 4
c) 5
d) 6
80. Which of the following(s) are the important sampling distribution often used in statistical
analysis?
a) Chi-square distribution
b) F-distribution
c) t-distribution
d) All of the above
Determination of confidence Limits for small samples (n<30 and population S.D σ is
unknown)
• In such a case the distribution follows T-distribution with (n-1) degrees of freedom
an accordingly
s
i) SE( x )= and
n −1
ii) Instead of using “Z” values we shall be using t values from the table for (n-
1) d.f. and at the desired level of confidence or significance
whre n = sample size
s
• ૄ)= x ±
C.L(ૄ .࢚ ࢻ
n − 1 ቀ ൗ,ିቁ
: 533 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
81. Which of the following distribution is used to test the significance if size of the sample is
less than 30 and variance of the population is unknown?
a) Standard Normal Distribution
b) Normal Distribution
c) “t’’ distribution
d) Chi-Square distribution
82. Which Sampling provides separate estimates for populations means for different
purposes and also an over all estimate?
a) Multistage sampling b) Simple random Sampling
c) Systematic sampling d) Stratified sampling
83. When we have an idea that the error might be involved, we use:
a) Point Estimate b) Interval Estimate
c) Both (a)and (b) d) None of these
85. When every member in population has an equal chance of being selection ,then that
sampling is called_______________
a) Restrictive b) Purposive
c) Subjective d) Non- restrictive
86. If every 9th unit is selected from universal set then this type of sampling is known as:
a) Quota sampling b) Systematic sampling
c) Stratified sampling d) None of these
87. The sampling is said to be large sampling if the size of the sample is:
a) Greater than or equal to 30 b) Less than 30
c) Less than or equal to 35 d) Less than 25
88. The method of sampling in which each unit of the population has an equal chance of
being selected in the sample is
a) Random Sampling b) Stratified sampling
c) Systematic sampling d) None of the above
90. In order to test the quality of chalks, the best suitable method wil be__________
a) Complete enumeration b) Simple random sampling
c) Systematic sampling d) Stratified sampling
91. In factory there are 48 employees with employee code from 1 to 48 the employer
desires to take the sample of every sixth employee under the systematic sampling
technique the sample size will be:
a) 6 b) 8 c) 10 d) 7
93. Method used to rest the human blood is called in Statistical terminology________
a) Census Investigation b) Blood Investigation
c) Sample Investigation d) None of these
101. Which would you prefer for ___________”Where testing destroys the quality of the
product”
(a) Full enumeration (b) sampling (c) both (d) none.
103. In control of book keeping and clerical errors Statistical sampling methods are used.
(a) true (b) false (c) both (d) none
106. Which would you prefer ____________when previous experience reveals a low rate of
error.
(a) Larger sample (b) Small sample (c) both (d) none
107. Cluster sampling is ideal in case the data are widely scattered.
(a) True (b) false (c) both (d) none.
108. Stratified random sampling is appropriate when the universe isnot homogenous.
(a) True (b) false (c) both (d) none.
109. In Stratified sampling, the sampling is subdivided into several parts, called
(a) Strata (b) Strati (c) Start (d) none
: 536 :
J. K. SHAH CLASSES Sampling Theory and Theory of Estimation
Theory Answers
1 B 2 A 3 A 4 C 5 C 6 D 7 A 8 A
9 B 10 A 11 A 12 B 13 D 14 C 15 A 16 B
17 A 18 C 19 C 20 B 21 A 22 D 23 B 24 B
25 B 26 D 27 C 28 D 29 C 30 C 31 C 32 C
33 A 34 D 35 C 36 B 37 D 38 C 39 D 40 B
41 A 42 B 43 C 44 A 45 C 46 C 47 A 48 D
49 A 50 C 51 B 52 A 53 D 54 C 55 C 56 A
57 B 58 C 59 B 60 B 61 C 62 D 63 C 64 C
65 C 66 A 67 C 68 A 69 C 70 B 71 A 72 C
73 B 74 B 75 C 76 A 77 B 78 B 79 C 80 D
81 C 82 D 83 B 84 D 85 B 86 B 87 A 88 A
89 D 90 B 91 B 92 B 93 B 94 C 95 B 96 B
97 A 98 C 99 C 100 B 101 B 102 A 103 A 104 B
105 A 106 B 107 A 108 A 109 A 110 C 111 C 112 A
: 537 :