Chapter1 Sampling Introduction
Chapter1 Sampling Introduction
Introduction
Statistical tools can be used on a data set to draw statistical inferences. These statistical inferences are
in turn used for various purposes. For example, the government uses such data for policy formulation
for the welfare of the people, and marketing companies use the data from consumer surveys to improve
the company and provide better services to the customer. Such data is obtained through sample surveys.
Sample surveys are conducted throughout the world by governmental and non-governmental agencies.
For example, “National Sample Survey Organization (NSSO)” conducts surveys in India, “Statistics
Canada” conducts surveys in Canada, and agencies of the United Nations like “The World Health
Organization (WHO), “Food and Agricultural Organization (FAO)” etc. conduct surveys in different
countries.
Sampling theory provides the tools and techniques for data collection, keeping in mind the objectives to
be fulfilled and the nature of the population.
Sample surveys collect information on a fraction of the total population, whereas census collects
information on the whole population. Some surveys, e.g., economic surveys, agricultural surveys, etc.,
are conducted regularly. Some surveys are need-based and are conducted when some need arises, e.g.,
consumer satisfaction surveys at a newly opened shopping mall to see the satisfaction level with the
amenities provided in the shopping mall.
Sampling unit:
An element or a group of elements on which the observations can be taken is called a sampling unit.
The objective of the survey is to determine the definition of the sampling unit.
Population:
The population is the collection of all the sampling units in a given region at a particular point in time
or a particular period. For example, suppose the medical facilities in a hospital are to be surveyed
through the patients. In that case, the total number of patients registered in the hospital during the
survey period will be the population. Similarly, if wheat production in a district is to be studied, then all
the fields cultivating wheat in that district will constitute the population. The total number of sampling
units in the population is the population size, generally denoted by N. The population size can be finite
or infinite (N is large).
Census:
The complete count of the population is called a census. The observations on all the sampling units in
the population are collected in the census. For example, in India, the census is conducted every tenth
year, and observations of all the persons staying in India are collected.
Sample:
One or more sampling units are selected from the population according to some specified procedure.
A sample consists only of a portion of the population units. Such a collection of units is called the
sample.
In the context of sample surveys, a collection of units like households, people, cities, countries, etc., is
called a finite population.
For example, if a population has 30% males and 70% females, we expect the sample to have nearly
30% males and 70% females.
Another example is if we remove a handful of wheat from 100 Kg. bag of wheat, we expect the same
quality of wheat in hand as inside the bag. Similarly, a drop of blood is expected to give the same
information as all the blood in the body.
Sampling frame:
The sampling frame comprises the list of all the units of the population to be surveyed. All the sampling
units in the sampling frame have identification particulars. For example, all the students in a particular
university listed and their roll numbers constitute the sampling frame. Similarly, the list of households
with the name of the head of the family or house address constitutes the sampling frame. In another
example, the residents of a city area may be listed in more than one frame - as per automobile
registration and the listing in the telephone directory.
For example, the units are selected based on the surveyor's personal judgment. The persons
volunteering to take some medical test or drink a new coffee type also constitute the sample based on
non-random laws.
For example, in order to conduct an experiment for a rare type of disease, the survey is continued till
the required number of patients with the disease is collected.
2. Organizaton of work:
It is easier to manage the organization of a collection of a smaller number of units than all the
units in a census. For example, to draw a representative sample from a state, it is easier to
manage to draw small samples from every city than drawing the sample from the whole state at
a time. This ultimately results in more accuracy in the statistical inferences because the better
organization provides better data, and in turn, improved statistical inferences are obtained.
3. Greater accuracy:
The persons involved in the collection of data are trained personnel. They can collect the data
more accurately if they have to collect a smaller number of units than a large number of units.
Type of surveys:
There are various types of surveys that are conducted based on the objectives to be fulfilled.
1. Demographic surveys:
These surveys are conducted to collect demographic data, e.g., household surveys, family size, number
of males in families, etc. Such surveys are helpful in the policy formulation for any city, state, or
country for the welfare of the people.
2. Educational surveys:
These surveys are conducted to collect educational data, e.g., how many children go to school, how
many persons are graduates, etc. Such surveys are conducted to examine educational programs in
schools and colleges. Generally, schools are selected first, and then the students from each school
constitute the sample.
3. Economic surveys:
These surveys are conducted to collect economic data, e.g., data related to the export and import of
goods, industrial production, consumer expenditure, etc. Such data is helpful in constructing the indices
indicating the growth in a particular sector of the economy or even the overall economic growth of the
country.
4. Employment surveys:
These surveys are conducted to collect employment-related data, e.g., employment rate, labour
conditions, wages, etc., in a city, state, or country. Such data helps in constructing various indices to
know the employment conditions among the people.
6. Agricultural surveys:
These surveys are conducted to collect agriculture-related data to estimate, e.g., the acreage and
production of crops, livestock numbers, use of fertilizers, use of pesticides, and other related topics.
Based on such surveys, the government bases its planning on food issues for the people.
7. Marketing surveys:
These surveys are conducted to collect data related to marketing. They are conducted by major
companies, manufacturers, or those who provide services to consumers, etc. Such data is used to know
consumers' satisfaction and opinions and develop sales, purchase, and promotional activities, etc.
8. Election surveys:
These surveys are conducted to study the outcome of an election or a poll. For example, such polls are
conducted in democratic countries to have people's opinions about any candidate contesting the
election.
2. Population to be sampled:
Based on the survey's objectives, decide the population from which the information can be obtained.
For example, the population of farmers is to be sampled for an agricultural survey. In contrast, the
patient population must be sampled to determine the medical facilities in a hospital.
3. Data to be collected:
It is important to decide which data is relevant for fulfilling the survey's objectives without omitting any
essential data. Sometimes, too many questions are asked, and some of their outcomes are never utilized.
This lowers the quality of the responses and, in turn, results in lower efficiency in the statistical
inferences.
5. Method of measurement:
The choice of measuring instrument and the method to measure the data from the population needs to
be specified clearly. For example, the data has to be collected through interviews, questionnaires,
personal visits, a combination of any of these approaches, etc. The forms in which the data is to be
recorded so that the data can be transferred to mechanical equipment for easily creating the data
summary, etc., are also needed to be prepared accordingly.
6. The frame:
7. Selection of sample:
The sample size needs to be specified for the given sampling plan. This helps determine and compare
different sampling plans' relative cost and time. The method and plan adopted for drawing a
representative sample should also be detailed.
8. The Pre-test:
It is advised to try the questionnaire and field methods on a small scale. This may reveal some troubles
and problems that the surveyor may face in large-scale surveys beforehand.
2. Personal interview:
The surveyor is supplied with a well-prepared questionnaire. The surveyor goes to the respondents and
asks the same questions mentioned in the questionnaire. The data in the questionnaire is then filled out
according to the respondents' responses.
3. Mail enquiry:
The well-prepared questionnaire is sent to the respondents through postal mail, e-mail, etc. The
respondents are requested to fill out the questionnaires and send them back. In the case of postal mail,
the questionnaires are often accompanied by a self-addressed envelope with postage stamps to avoid
any non-response due to the cost of postage.
4. Web-based enquiry:
The survey is conducted online through internet-based web pages. There are various websites that
provide such facilities. The questionnaires are to be in their formats, and the link is sent to the
respondents through e-mail. The respondent is brought to the concerned website by clicking on the link,
and the answers are to be given online. These answers are recorded, and responses, as well as their
statistics, are sent to the surveyor. The respondents should have an internet connection to support the
data collection with this procedure.
5. Registration:
The respondent is required to register the data at some designated place. For example, the number of
births and deaths, along with the details provided by the family members, are recorded at the city
municipal office which the family members provide.
The methods in (1) to (5) provide primary data, which means collecting the data directly from the
source. The method in (6) provides secondary data, which means getting the data from primary sources.