RMUnit3C2
RMUnit3C2
Chapter 2:
Sampling Design
Census and Sample Survey
Two costs are involved in a sampling analysis viz., the cost of collecting
the data and the cost of an incorrect inference resulting from the data.
Researcher must keep in view the two causes of incorrect inferences
viz., systematic bias and sampling error.
A systematic bias results from errors in the sampling procedures, and it
cannot be reduced or eliminated by increasing the sample size.
At best the causes responsible for these errors can be detected and
corrected.
Criteria of Selecting a Sampling Procedure (Contd.)
Usually, a systematic bias is the result of one or more of the following factors:
Non-respondents
If we are unable to sample all the individuals initially included in the sample, there may
arise a systematic bias. The reason is that in such a situation the likelihood of establishing
contact or receiving a response from an individual is often correlated with the measure of
what is to be estimated.
• wealthier individuals may be less likely to participate in income surveys, skewing results.
Indeterminancy principle
Sometimes we find that individuals act differently when kept under observation than what
they do when kept in non-observed situations. Indeterminancy principle may also be a
cause of a systematic bias.
• Workers may deliberately slow down in a time-study exercise if they suspect it will lead
to increased work quotas.
Criteria of Selecting a Sampling Procedure (Contd.)
Natural bias in the reporting of data
Natural bias of respondents in the reporting of data is often the cause of a systematic bias
in many inquiries.
There is usually a downward bias in the income data collected by government
taxation department, whereas we find an upward bias in the income data collected
by some social organisation. People in general understate their incomes if asked
about it for tax purposes, but they overstate the same if asked for social status.
Generally in psychological surveys, people tend to give what they think is the
‘correct’ answer rather than revealing their true feelings.
Sampling errors are the random variations in the sample estimates around the true
population parameters. Sampling error decreases with the increase in the size of
the sample.
Sampling error can be measured for a given sample design and size. The
measurement of sampling error is usually called the ‘precision of the sampling plan’.
If we increase the sample size, the precision can be improved.
While selecting a sampling procedure, researcher must ensure that the procedure causes a
relatively small sampling error and helps to control the systematic bias in a better way.
Characteristics of a Good Sample Design
• When each sample element is drawn individually from the population at large, then the
sample so drawn is known as ‘Unrestricted sample’, whereas all other forms of sampling are
covered under the term ‘restricted sampling’.
Different Types of Sample Designs
• Non-probability Sampling
Applications:
1. Small-Scale Studies: Ideal for pilot studies, exploratory research, or situations with
limited resources.
2. Preliminary Research: Useful for generating hypotheses and understanding trends
before larger studies.
3. Convenience Sampling: Situations where representative sampling is not feasible due to
constraints of time, budget, or access.
Different Types of Sample Designs
• Probability Sampling
– Under this, every item of the universe has an equal chance of inclusion in the
sample.
– Random sampling ensures the law of Statistical Regularity which states that if
on an average the sample chosen is a random one, the sample will have the
same composition and characteristics as the universe.
– This is the reason why random sampling is considered as the best technique of
selecting a representative sample.
Types of Probability Sampling:
For example: Random selection of 20 students from class of 50 student. Each student has equal chance of getting
selected. Here probability of selection is 1/50
4.Systematic Sampling:
Selecting every kth item from a list, where k is a fixed interval.
5.Cluster Sampling:
Dividing the population into clusters and randomly selecting entire clusters for study.
6.Multi-Stage Sampling:
A complex form of cluster sampling conducted in multiple stages.
How to Select a Random Sample?
• Random sampling involves ensuring that each element in the
population has an equal and independent chance of being selected.
• Below are methods and procedures for selecting a random sample in
practice.
• Lottery Method
• Successive Drawing without Replacement
• Using Random Number Tables
• a. Lottery Method
• Process:
• Write each element of the population on a slip of paper.
• Place all slips into a container.
• Mix thoroughly and draw slips one by one without looking until the
required sample size is reached.
• Drawbacks:
• Impractical for large or complex populations.
• Limited utility in modern, large-scale sampling scenarios.
How to Select a Random Sample? (Contd..)
b. Successive Drawing without Replacement
How to Select a Random Sample? (Contd..)
c. Using Random Number Tables
The 20 dice rolls constitute a random sample from the infinite population of all possible
dice rolls.
Complex Random Sampling Designs
1. Systematicsampling
2. Stratified sampling
3. Clustersampling
4. Areasampling
5. Multi-stage sampling
7. Sequential sampling
Complex Random Sampling Designs(Contd..)
• Systematic sampling
– Fixed Intervals: After selecting the first item, every k-th item is included
in the sample. For instance, if a 4% sample is needed, k will be 25 (as 4%
of 100 is 1/25). The first item is selected randomly from the first 25, and
then every 25th item is added to the sample.
Complex Random Sampling Designs (Contd..)
Stratified sampling
Example:
A study on education levels may divide the population by age group,
gender, or location.
Complex Random Sampling Designs(Contd..)
Stratified sampling
The following three questions are highly relevant in the context of stratified
sampling:
•Pilot Studies:
• Conduct small-scale surveys to test proposed strata.
• Evaluate variances within and between strata to refine the stratification plan.
• A careful stratification plan improves sampling efficiency by reducing
sampling error.
Complex Random Sampling Designs(Contd..)
How Should Items Be Selected From Each Stratum?
• The selection method within each stratum typically involves:
• Simple Random Sampling: Ensures every item in the stratum has an
equal chance of selection.
• Systematic Sampling: Can be employed when it offers logistical or
analytical advantages over random sampling.
• The choice depends on the research context, the nature of the population,
and available resources.
• Cluster sampling
• Area sampling
• Multi-stage sampling
– In case the cluster sampling units do not have the same number of
elements, it is considered appropriate to use a random selection
process where the probability of each cluster being included in the
sample is proportional to the size of the cluster.
Complex Random Sampling Designs(Contd..)
• Sequential sampling
– This sampling design is some what complex sample design.
– The ultimate size of the sample under this technique is not fixed
in advance, but is determined according to mathematical
decision rules on the basis of information yielded as survey
progresses
The following are the number of departmental stores in 15 cities: 35, 17, 10, 32, 70, 28, 26, 19,
26, 66, 37, 44, 33, 29 and 28. If we want to select a sample of 10 stores, using cities as clusters
and selecting within clusters proportional to size, how many stores from each city should be
chosen? (Use a starting point of 10).
Step-by-Step Solution:
2. Total Sample Size (n): You need to select a sample of 10 stores from these 500 stores.
3. Sampling Interval (k): To select the sample proportional to the size of the cities, we need
to calculate the sampling interval (k), which is the number of stores in the population divided
by the sample size:
Using proportional sampling based on size, the sample of 10 stores is selected as follows:
2 stores from city 5,1 store from each of the following cities: 1, 3, 7, 9, 10, 11, 12, 13, and 14.
This ensures that the number of stores selected from each city is proportional to the number
of stores in the city.