Sampling
Sampling
Question ?
▪ Who appears to constitute the population of
interest?
▪ Which type of sampling procedure best
describes that used by the Researcher?
▪ What are the limitations of this sampling
method, and in what specific ways could the
sampling method have affected the findings?
▪ Why is it so important to get the size of a
sample as close as possible to what is
"correct"?
Key Terms in Sampling
▪ Population
▪ Target population
▪ Sample
▪ Sampling Frame
▪ Sampling
▪ Statistics
▪ Parameter
▪ Sampling error
▪ Generalizability
▪ Biased sample
▪ Sample Design
Population
▪ In statistics, a population is the entire group of individuals,
objects, or events that a study is interested in.
▪ A population is a complete set of persons or objects that
possess some common characteristics that is of interest to
the researcher.
▪ It's also known as the target population or research
population.
▪ The population is defined by the research objectives and
the attributes being investigated.
▪ It refers to the entire group of individuals or objects to
which researchers are interested in generalizing the
conclusions
▪ Example –
▪ JNU Ph.D. students Research Methodology Understanding
▪ Hospital patients
Target Population
▪ Target Population represents specific subgroup within the
population that a researcher is interested in researching and
analysing.
▪ Students at a particular University -JNU
▪ Of a particular programme- Economics, Sociology, History
▪ Patient of disease- TB, Diabetes, Heart
▪ Employees of a particular sector –Formal and Informal Sector
▪ The target population consists of people or things that meet
the designated set of criteria of interest to the researcher.
▪ The group of individuals or items the researcher want to
apply their study results generally is know as the target
population.
▪ Target population depends on Research Objectives
Sample
▪ A sample is a subset of the population.
▪ The concept of sample arises from the inability of
the researchers to test all the individuals in a
given population.
▪ The sample must be representative of the
population from which it was drawn and it must
have good size to undergo statistical analysis.
Sampling frame
▪ A sampling frame is a list, database, or a source of
information that contains all the elements or members of the
target population from which a sample can be drawn.
▪ It serves as the actual source from which you select your
sample.
▪ Ideally, the sampling frame should include every member of
the target population, ensuring that each individual has a
chance of being selected in the sample.
▪ It is a list of all those within a population who can be
sampled, and may include individuals, households or
institutions.
1. Customer Satisfaction Survey
▪ Scenario: A company wants to assess customer
satisfaction for its new product.
▪ Target Population: All customers who purchased
the new product in the last three months.
▪ Sampling Frame: The company's sales database
listing all customers who made a purchase in the
last three months, including their contact details.
▪ Explanation: The sales database serves as the
sampling frame, as it contains the relevant subset
of the customer base that the company wants to
survey.
2.Academic Research on Student Behavior
▪ Scenario: A university researcher wants to study
the online learning behavior of students.
▪ Target Population: All undergraduate students
enrolled in the university.
▪ Sampling Frame: The university's student
registration records, which include all currently
enrolled undergraduate students.
▪ Explanation: The student registration records
provide a comprehensive list of all potential
participants, ensuring that the sample can be
drawn from all undergraduate students.
3.Health Study on Elderly Population
▪ Scenario: A health department wants to study
the prevalence of hypertension among elderly
residents in a city.
▪ Target Population: All residents aged 60 and
above in the city.
▪ Sampling Frame: The city’s healthcare
database or voter registration list that includes
information on residents' ages.
▪ Explanation: The healthcare database or voter
registration list acts as the sampling frame,
allowing the researchers to identify and sample
individuals who are 60 years and older.
4.A health study on diabetes prevalence
among adults in a city
▪ Population: All adults in the city.
▪ Target Population: Adults aged 40-60 in
the city.
▪ Sample: 500 adults aged 40-60 selected
from the target population.
▪ Sampling Frame: Voter registration lists or
healthcare records.
• Sampling frame vs. population
▪ The sampling frame is a list of all people or units from the population that
the sample is selected from. In contrast, the population is the whole
group or collection of humans, objects, or events that are being observed.
Sampling frame vs. target population
The sampling frame, a portion of the target population, is a reference for
researchers as they decide on and choose the proper sample. It describes
the list of people, homes, and other things that researchers have access to
or can utilize to select a sample from. The group of individuals or items the
researchers want to apply their study results generally is known as the
target population
Sampling
• Sampling is simply the process of learning
about the population on the basis of sample
drawn from it ( population). Therefore, in
sampling , only part of the population is
studied and the conclusions are drawn on
that basis for the entire population or
universe .
▪ Sampling unit - the unit of selection in the
sampling process
▪ Study unit (study subjects)- the unit on
which information is collected or on which
observations are made. E.g. Familiar examples
are families, towns, litters, branches of a
company, individual subjects or schools.
Target Population:
The population to be studied/ to which the investigator
wants to generalize his results
Sampling Unit:
smallest unit from which sample can be selected
Sampling frame
List of all the sampling units from which sample is drawn
Sample size
The number of units in a sample is called the
sample size.
Sample Design
A set of rules or procedures that specify how a
sample is to be selected. This can either be
probability or non-probability
What is sampling ?
Determine if a probability
3 or non-probability sampling Determine sample size 5
method will be chosen
Probability Nonprobability
sampling sampling
Sampling Design
Probability Nonprobability
▪ Simple random ▪ Convenience
sampling sampling
▪ Systematic random ▪ Judgment sampling
sampling ▪ Quota sampling
▪ Stratified random ▪ Snowball sampling
sampling
▪ Cluster sampling
▪ Multistage
Sampling
Probability Sampling
Probability sampling is that sampling in which every items
in the universe/population has known chance, of being
chosen for the sample . It implies the selection of sample
items is independent of person making the study . more to
say, the sample is selected on the basis of chance . There
is no bias
Non-Probability Sampling
Non- probability sampling methods are those
which do not provide every items in the
universe/population with a know chance of being
included in the sample
SIMPLE RANDOM SAMPLING
• Applicable when population is small, homogeneous &
readily available
• All subsets of the frame are given an equal probability.
Each element of the frame thus has an equal probability
of selection.
• It provides for greatest number of possible samples.
This is done by assigning a number to each unit in the
sampling frame.
Therefore,
➢ Samples are selected on the basis of chance
➢ Personal bias of the investigators does not influence the
sample selection
• A table of random number or lottery system is used to
determine which units are to be selected. 27
Lottery method
Under this method,
▪ all items of the universe are numbered or
named on separate slips of paper of
identical size and shape.
▪ These slips are then folded and mixed up in
a container.
▪ A blindfold selection is then made of the
number of slips required to constitute the
desired sample size.
▪ The selection items thus depends entirely
on chance.
Random number tables
▪ Random number tables consist of a randomly generated
series of digits (0-9).
▪ To make them easy to read there is typically a space
between every 4th digit and between every 10th row.
▪ When reading from random number tables you can begin
anywhere (choose a number at random) but having once
started you should continue to read across the line or down
a column and NOT jump about.
Here is an extract from a table of random sampling
numbers:
• 3680 2231 8846 5418 0498 5245
7071 2597
• If we were doing market research and wanted to sample two houses
from a street containing houses numbered 1 to 48 we would read off
the digits in pairs
36 80 22 31 88 46 54 18 04 98 52 45 70 71 25
97
and take the first two pairs that were less than 48, which gives house
numbers 36 and 22.
• If we wanted to sample two houses from a much longer road with 140
houses in it we would need to read the digits off in groups of three:
368 022 318 846 541 804 985 245 707 1 25 97
and the numbers underlined would be the ones to visit: 22 and 125
Systematic Random Sampling
Procedure:
Procedure:
• Divide population into clusters.
• Randomly sample clusters.
• Measure all units within sampled
clusters.
Cluster (Area) Random Sampling
Sample size is
( 1st equation)
Cochran’s formula for calculating sample size when the
population is finite
▪ But in this case, the sample size (666) exceeds 5% of the population size
(13191).
▪ So, we need to use the correction formula to calculate the final sample size
▪ Here, N = 13191, 0 n = 666 ( using formua 1)
• But, if the sample size is calculated at 95% confidence level with margin of
error equal to (0.05), the sample size become 384 which does not need
correction formula. So, in this case the representative sample size for our
study is 384
Yamane’s formula for calculating sample size
▪ Yamane suggested another simplified formula for
calculation of sample size from a population which is
an alternative to Cochran’s formula
▪ According to him, for a 95% confidence level and p =
0.5 , size of the sample should be