Unit No 2
Unit No 2
1. Population
The entire group of individuals or items you want to study or draw conclusions about.
2. Sample
3. Sampling Frame
4. Sampling Techniques
a. Probability Sampling
Stratified Sampling: Dividing the population into strata (groups) and sampling from each
group proportionately.
Cluster Sampling: Dividing the population into clusters and sampling entire clusters.
b. Non-Probability Sampling
Not every individual has an equal chance of being selected.
5. Sample Size
The number of individuals included in the sample. Larger samples generally yield more
accurate results but require more resources.
6. Sampling Error
The difference between the population parameter and the sample statistic.
Example: The average age of a sample differs slightly from the actual average age of the
population.
7. Representative Sample
Example: If 60% of the population is female, the sample should also have approximately 60%
females.
Diagrammatic Representation:
Population: [ ● ● ● ● ● ● ● ● ● ● ● ● ]
Sample: [● ● ● ● ●]
Sampling Techniques
│ ├── Stratified
│ └── Cluster
├── Convenience
├── Judgmental
├── Quota
└── Snowball
Procedure of Sampling
1. Define the Objective
o Identify the purpose of the study and the population characteristics to focus
on.
o Example: Determine the average household income in a city.
2. Identify the Target Population
o specify the group from which the sample will be drawn.
o Example: All households in the city.
3. Develop a Sampling Frame
o Create a list of all the elements in the population. This serves as the database
for selecting the sample.
o Example: A city’s census data or voter list.
4. Choose a Sampling Method
o Decide between Probability Sampling (e.g., random sampling) or Non-
Probability Sampling (e.g., convenience sampling) based on the study
objectives.
o Example: Randomly selecting 500 households from the voter list.
5. Determine the Sample Size
o Use statistical formulas or guidelines to decide the subjects for reliable
results.
o Example: Calculate that a sample of 500 households provides a 95%
confidence level with a 5% margin of error.
6. Select the Sample
o Apply the chosen sampling method to extract the sample from the sampling
frame.
o Example: Using a random number generator to pick households.
7. Collect Data
o Conduct surveys, interviews, or observations to gather information from the
sample.
o Example: Distribute income questionnaires to selected households.
8. Analyze Data and Generalize Findings
o Use statistical tools to analyse the data and infer conclusions about the entire
population.
o Example: Conclude the average income of city residents.
Implications of Sampling
1. Advantages
o Cost-Effective: Reduces expenses by analysing a smaller group instead of the
whole population.
o Time-Saving: Faster to collect and analyse data from a sample.
o Feasibility: Enables studies on large populations that are otherwise
impractical.
o Detailed Analysis: Resources can be focused on smaller, high-quality data
collection.
2. Limitations
o Sampling Error: The sample may not perfectly represent the population,
leading to inaccuracies.
o Bias: Improper sampling methods can introduce bias, affecting reliability.
o Generalization Issues: Findings may not apply to the population if the sample
isn’t representative.
3. Ethical Implications
o Researchers must ensure the sampling process is fair and unbiased and
respect participants’ privacy and consent.
o Ethical issues can arise if vulnerable populations are excluded or exploited.
4. Practical Applications
o Sampling is widely used in business (market research), healthcare (drug
trials), government (policy surveys), and academia (social studies).
Would you like help with examples, diagrams, or statistical methods used in sampling?
3. Compare census and sample survey. Explain the sampling procedure in detail.
Sampling involves the systematic selection of a subset of the population. The detailed procedure
includes the following steps:
1. Define the Research Objective
Clearly outline the purpose of the study and the information needed.
Example: All customers who visited the retail chain in the last six months.
Create a list or database of all the members of the population. This forms the foundation for
selecting the sample.
Choose the method based on the study objectives, available resources, and the need for
randomness or bias minimisation.
o Stratified Sampling: Population divided into subgroups (strata), and samples taken
from each stratum.
o Cluster Sampling: Divides the population into clusters, and entire clusters are
sampled.
o Population size.
Example: Calculate that 400 participants are needed for a population of 10,000.
Apply the chosen method to extract the sample from the sampling frame.
7. Data Collection
Use surveys, interviews, or experiments to collect information from the selected sample.
8. Analyze Data
Use statistical tools to process and analyse the data. Conclude the entire population based
on the sample.
Assess potential errors (e.g., sampling error, non-sampling error) and take corrective
measures if necessary.
Extend findings from the sample to the population, considering limitations and assumptions.
Example: Conclude customer satisfaction trends for the retail chain based on the sample
data.
Sampling design refers to the method used to select a sample from a population for research or
analysis. There are two primary types of sampling designs: Probability Sampling (where each
element has a known, non-zero chance of selection) and Non-Probability Sampling (where some
elements may not have a known chance of selection). The choice of sampling design depends on the
study's objective, accuracy requirements, resources, and population characteristics.
In probability sampling, every population element has a known chance of being selected. These
methods are often used when generalizability and unbiased estimates are critical.
Procedure: Use a random method like a random number generator or drawing names to
select the sample.
b. Systematic Sampling
Procedure: Pick every nth element after selecting a random starting point.
Example: If you need a sample of 100 from a list of 1,000, pick every 10th individual.
c. Stratified Sampling
The population is divided into subgroups (strata) based on characteristics (e.g., age, gender).
Samples are drawn from each subset.
d. Cluster Sampling
The population is divided into clusters (often geographically). Some clusters are randomly
selected, and all members are surveyed.
Example: Sampling several schools in a city and surveying all students in those schools.
e. Multistage Sampling
Example: First, cluster sampling will be used to choose regions, and then stratified sampling
will be used to select districts within the areas.
In non-probability sampling, not every element has a known chance of being selected. This approach
is less rigorous but more practical for exploratory studies.
a. Convenience Sampling
The researcher selects participants based on their judgment of who would provide the most
helpful information.
c. Quota Sampling
The researcher ensures that specific subgroups are represented in the sample based on
predefined quotas.
Example: Ensuring the sample includes 30% males and 70% females.
d. Snowball Sampling
Participants refer other participants, which is helpful for hard-to-reach or niche populations.
Example: Researching a rare medical condition by asking initial participants to refer others.
The calculation for sample size varies depending on whether the population is finite or infinite. The
formula for estimating sample size is based on desired confidence levels, margin of error, and
population size.
The sample size formula is simpler when the population is infinite or sufficiently large (e.g., more
than 30,000).
Formula:
Where:
n = Sample size
Example:
So, you need a sample size 384 for an infinite or large population.
When the population is finite (i.e., the total number of individuals is known), we adjust the sample
size using the finite population correction (FPC) factor.
Formula:
Where:
Example:
So, you need a sample size 370 for a finite population of 10,000.
Would you like further clarification on any specific sampling design example or help solve a problem
using these formulas?
5. Elaborate on various probabilistic sampling types. Also, explain the advantages &
disadvantages of probability
Probabilistic sampling, also known as random sampling, is a method where each member of the
population has a known, non-zero chance of being selected. This approach ensures that the sample
accurately represents the population, yielding more reliable and generalisable results.
Description: In this method, each individual in the population has an equal chance of being
selected. It is the most straightforward and essential form of probabilistic sampling.
How it Works: A sample is selected randomly using a random number generator or a random
draw (e.g., lottery method).
Advantages:
o Easy to implement and understand.
Disadvantages:
o Requires a complete and accurate list of the population (sampling frame), which
might be hard to obtain.
o This may lead to samples not representative of the entire population, especially if
the population is large and heterogeneous.
2. Systematic Sampling
Description: Systematic sampling involves selecting every nth member from the population
after a random starting point. The interval (n) is determined by dividing the total population
by the desired sample size.
How it Works: First, select a random starting point, then choose every kth element from the
population list.
Example: If you have a list of 1,000 individuals and need a sample of 100, you select every
10th person on the list.
Advantages:
Disadvantages:
o If the population has an inherent pattern, it can lead to bias (e.g., if every 10th
individual shares similar characteristics).
3. Stratified Sampling
Description: Stratified sampling divides the population into distinct subgroups (strata) with
similar characteristics (e.g., age, gender, income level). Then, a random sample is drawn from
each stratum. This method ensures that every subgroup is represented.
How it Works: The population is divided into homogeneous strata, and then a random
sample is taken from each stratum, proportionally or equally.
Advantages:
o Ensures representation of all relevant subgroups, improving the estimates' precision.
Disadvantages:
4. Cluster Sampling
Description: In cluster sampling, the population is divided into clusters (often based on
geographic location or other natural divisions), and then some of these clusters are randomly
selected. All individuals within the chosen clusters are surveyed.
How it Works: The population is divided into clusters; some clusters are selected randomly,
and all members of the selected clusters are included in the sample.
Advantages:
o Easier to implement than other methods, especially when a sampling frame is not
readily available.
Disadvantages:
o Higher sampling error can be introduced if clusters are heterogeneous (i.e., different
within each cluster).
5. Multistage Sampling
Description: This combines several sampling methods (e.g., cluster sampling followed by
simple random sampling within clusters). It's used when the population is large and diverse.
How it Works: First, a random sample of clusters is selected. Then, within the selected
clusters, another form of random sampling (e.g., simple random or stratified) is applied to
choose individual elements.
Example: First, randomly select several districts (clusters) in a city, then choose households
within those districts using simple random sampling.
Advantages:
Advantages:
1. Objectivity:
o Probability sampling minimises selection bias since every member of the population
has a known and non-zero chance of being selected.
2. Generalizability:
3. Precision:
o Probability sampling methods, such as stratified sampling, can improve the accuracy
of estimates by ensuring that specific subgroups of interest are represented.
4. Statistical Analysis:
o With probability sampling, the statistical properties of the sample (e.g., mean,
variance) can be used to estimate the population parameters and calculate
confidence intervals.
5. Lower Bias:
o Since the sample is selected randomly, the likelihood of bias is reduced compared to
non-probability sampling methods.
Disadvantages:
1. Cost:
2. Time-Consuming:
3. Complexity:
o Some probability sampling methods, such as stratified and multistage sampling, are
complex and require careful planning and execution to ensure the sample is
representative.
o For probability sampling to work effectively, a complete and accurate list of the
population is necessary, which may not always be available or feasible to compile.
o If the strata or clusters are not well-defined, some groups may be over- or under-
represented, affecting the study’s validity.
Would you like further elaboration on a specific sampling method or a detailed example of applying
one of these techniques?
Probability sampling is a type of sampling method in which each member of the population has a
known, non-zero chance of being selected. It ensures that the sample accurately represents the
population, making it possible to generalise the findings to the larger group with a known confidence
level. The core principle of probability sampling is random selection, where every individual has an
equal or known chance of being included in the sample.
1. Simple Random Sampling: Every individual has an equal chance of being selected.
2. Systematic Sampling: Selection is based on a fixed interval (e.g., every nth individual from a
list).
3. Stratified Sampling: The population is divided into subgroups (strata), and a sample is taken
from each subgroup.
4. Cluster Sampling: The population is divided into clusters, some selected, and all individuals
within the selected clusters are surveyed.
Advantages:
1. Unbiased Representation: Since each individual has a known chance of being selected, the
sample is more likely to represent the population.
2. Generalizability: The results can be generalised to the entire population, provided the
adequate sample size.
3. Statistical Analysis: Allows advanced statistical methods, such as confidence intervals and
hypothesis testing, to infer results about the population.
Disadvantages:
1. Cost: Probability sampling methods, especially those involving large populations, can be
expensive and require a lot of resources.
4. Requires Complete Population List: For methods like simple random sampling, a complete
list of the population is needed, which may not always be available.
Advantages:
1. Cost-Effective: Generally cheaper and faster than probability sampling because it doesn’t
require a sampling frame or random selection.
2. Flexible: This can be more practical for qualitative research or exploratory studies.
3. Convenient: Quick to implement when the researcher needs to access a specific group or has
limited resources.
Disadvantages:
1. Bias: The sample may not be representative, as it’s chosen based on convenience or
judgment.
3. Lack of Statistical Precision: Non-probability sampling doesn't allow for the same statistical
inferences (e.g., confidence intervals) as probability sampling.
Summary Table
Aspect Probability Sampling Non-Probability Sampling
Random, with a known probability Non-random, based on
Selection
of selection. judgment or convenience.
Bias Low bias. There is a high risk of bias.
More cost-effective and
Cost Expensive and time-consuming.
quicker.
Results can be generalised to the
Generalizability Limited generalizability.
population.
High accuracy and
Accuracy Lower accuracy.
representativeness.
Statistical Analysis Suitable for statistical inference. Limited statistical analysis.
Would you like more detailed examples of when to use each type of sampling? Or perhaps a case
study to illustrate how probability sampling can be applied?