Unit 2 RESEARCH DESIGN, SAMPLING AND DATA COLLECTION
Unit 2 RESEARCH DESIGN, SAMPLING AND DATA COLLECTION
1
2
3
4
5 Object
Object 14
10
11
12
13
7
8
9
Definitions of Research Design - Research design refers to the overall plan and
structure for conducting a research study, outlining the methods, procedures, and
strategies to address the research questions or objectives effectively.
A research design is not only a work plan, it also specifies the type of data required to
address the research problem. It ensures that the data analysis and the conclusions,
lead to answering the initial questions of researcher in a simple way. Therefore, at
research design provides the structure of a research in such a way that it provides
relevant outcomes economically.
1) Objectivity :
Objectivity refers to the ability of the research instruments to give conclusions that
are free from observer's personal biases. A good research design should be able select
those instruments only that provide objective conclusions. Usually, it is believed that
maintaining objectivity is pretty easy, but it proves to be difficult during execution of
research and data analysis.
2) Reliability :
Another essential feature of a good research design is the reliability of responses. The
instruments used in research should be able to provide similar responses to a question
asked from a respondent. If the response varies, the instrument is considered
unreliable. In other words, reliability of research design is measured in terms of
consistency in responses.
3) Validity :
An important characteristic of a good research design is its ability to answer the
questions in the way it was intended to. It should focus on the objective of the
research and make specific arrangements or plan for achieving that objective.
For example, when a research is conducted to measure the effects of advertisements
in viewers, it should be able to answer this, and not the sale of a particular product.
4) Generalizability :
A research design is said to be generalizable if the outcome of the research is
applicable on a bigger population from which the sample is selected. A research
design can be made generalizable by properly defining the population properly,
selecting the sample carefully, analyzing the statistical data appropriately, and by
preparing it methodologically. Therefore, the more the outcomes are generalizable,
more efficient is the research design.
5) Sufficient Information :
Any research is conducted to gain insight of the hidden facts, figures and information.
The research design should be able to provide sufficient information to the researcher
so that he can analyze the research problem in a broad perspective. The research
design should be able to identify the research problem and research objective.
6) Other Features :
Along with the above, there are some other features also that make a research design
good. These are adaptability, flexibility. efficiency, etc. A good research design
should be able to minimize the errors and maximize the accuracy.
It is considered that a good research design should reduce the biasness while should
maximize the reliability of data being collected and analyzed. A good research design
should provide the opportunity as per the various aspects of research problem. It
should minimize the experimental error and should provide maximum information.
Hence, it can be concluded the selection of research design relies upon the research
problem and the nature of research. Following are the major features of a good
research design :
Need for Research Design.
1) Reduces Cost :
Research design is needed to reduce the excessive costs in terms of time, money and
effort by planning the research work in advance.
For example, if a cell phone manufacturer wants to conduct a feature research study
among students in US Universities. An in-depth research study must be conducted if
the researcher is looking for features that the students use, features they would like to
see, and the price they are willing to pay.
This step is imperative to understand the features that need development, the features
that require an upgrade, the device’s pricing, and the go-to-market strategy.
In the year 2016/17 alone, there were 24.7 million students enrolled in universities
across the US. It is impossible to research all these students; the time spent would
make the new device redundant, and the money spent on development would render
the study useless.
Census: A complete study of all the elements present in the population is known as
a census. It is a time consuming and costly process and is, therefore, seldom a
popular with researchers. The general notion that a census generates more accurate
data than sampling is not always true. Limitations include failure in generating a
complete and accurate list of all the members of the population and refusal of the
elements to provide information. The national population census is an example of
census survey.
Bias: Bias is the term refers to how far the average statistic lies from the parameter
it is estimating, that is, the error, which arises when estimating a quantity. Errors from
chance will cancel each other out in the long run, those from bias will not. Bias can
take different forms.
Process of Sampling
An operational sampling process can be divided into seven steps as given below:
Defining the population of interest, for business research, is the first step in sampling
process. In general, target population is defined in terms of element, sampling unit,
extent, and time frame. The definition should be in line with the objectives of the
research study. For ex, if a kitchen appliances firm wants to conduct a survey to
ascertain the demand for its micro ovens, it may define the population as ‘all women
above the age of 20 who cook (assuming that very few men cook)’. However this
definition is too broad and will include every household in the country, in the
population that is to be covered by the survey. Therefore the definition can be further
refined and defined at the sampling unit level, that, all women above the age 20, who
cook and whose monthly household income exceeds Rs.20,000. This reduces the
target population size and makes the research more focused. The population
definition can be refined further by specifying the area from where the researcher has
to draw his sample, that is, households located in Hyderabad.
A well defined population reduces the probability of including the respondents who
do not fit the research objective of the company. For ex, if the population is defined
as all women above the age of 20, the researcher may end up taking the opinions of a
large number of women who cannot afford to buy a micro oven.
Once the definition of the population is clear a researcher should decide on the
sampling frame. A sampling frame is the list of elements from which the sample may
be drawn. Continuing with the micro oven ex, an ideal sampling frame would be a
database that contains all the households that have a monthly income above
Rs.20,000. However, in practice it is difficult to get an exhaustive sampling frame
that exactly fits the requirements of a particular research. In general, researchers use
easily available sampling frames like telephone directories and lists of credit card and
mobile phone users. Various private players provide databases developed along
various demographic and economic variables. Sometimes, maps and aerial pictures
are also used as sampling frames. Whatever may be the case, an ideal sampling frame
is one that entire population and lists the names of its elements only once.
A sampling frame error pops up when the sampling frame does not accurately
represent the total population or when some elements of the population are missing
another drawback in the sampling frame is over —representation. A telephone
directory can be over represented by names/household that have two or more
connections.
3. Specifying the Sampling Unit:
A sampling unit is a basic unit that contains a single element or a group of elements
of the population to be sampled. In this case, a household becomes a sampling unit
and all women above the age of 20 years living in that particular house become the
sampling elements. If it is possible to identify the exact target audience of the
business research, every individual element would be a sampling unit. This would
present a case of primary sampling unit. However, a convenient and better means of
sampling would be to select households as the sampling unit and interview all
females above 20 years, who cook. This would present a case of secondary sampling
unit.
The sampling method outlines the way in which the sample units are to be selected.
The choice of the sampling method is influenced by the objectives of the business
research, availability of financial resources, time constraints, and the nature of the
problem to be investigated. All sampling methods can be grouped under two distinct
heads, that is, probability and non-probability sampling.
The sample size plays a crucial role in the sampling process. There are various ways
of classifying the techniques used in determining the sample size. A couple those hold
primary importance and are worth mentioning are whether the technique deals with
fixed or sequential sampling and whether its logic is based on traditional or Bayesian
methods. In non-probability sampling procedures, the allocation of budget, thumb
rules and number of sub groups to be analyzed, importance of the decision, number
of variables, nature of analysis, incidence rates, and completion rates play a major
role in sample size determination. In the case of probability sampling, however,
formulas are used to calculate the sample size after the levels of acceptable error and
level of confidence are specified. The details of the various techniques used to
determine the sample size will be explained at the end of the chapter.
In this step, the specifications and decisions regarding the implementation of the
research process are outlined. Suppose, blocks in a city are the sampling units and the
households are the sampling elements. This step outlines the modus operandi of the
sampling plan in identifying houses based on specified characteristics. It includes
issues like how is the interviewer going to take a systematic sample of the houses.
What should the interviewer do when a house is vacant? What is the recontact
procedure for respondents who were unavailable? All these and many other questions
need to be answered for the smooth functioning of the research process. These are
guide lines that would help the researcher in every step of the process. As the
interviewers and their co-workers will be on field duty of most of the time, a proper
specification of the sampling plans would make their work easy and they would not
have to revert to their seniors when faced with operational problems.
This is the final step in the sampling process, where the actual selection of the sample
elements is carried out. At this stage, it is necessary that the interviewers stick to the
rules outlined for the smooth implementation of the business research. This step
involves implementing the sampling plan to select the sampling plan to select a
sample required for the survey.
Methods of Sampling –
The sampling method or sampling technique is the process of studying the
population by gathering information and analyzing that data. It is the basis of the data
where the sample space is enormous.
There are several different sampling techniques available, and they can be subdivided
into two groups. All these methods of sampling may involve specifically targeting
hard or approach to reach groups.
Systematic Sampling
In the systematic sampling method, the items are selected from the target population
by selecting the random selection point and selecting the other methods after a fixed
sample interval. It is calculated by dividing the total population size by the desired
population size.
Example:
Suppose the names of 300 students of a school are sorted in the reverse alphabetical
order. To select a sample in a systematic sampling method, we have to choose some
15 students by randomly selecting a starting number, say 5. From number 5 onwards,
will select every 15th person from the sorted list. Finally, we can end up with a
sample of some students.
Stratified Sampling
In a stratified sampling method, the total population is divided into smaller groups to
complete the sampling process. The small group is formed based on a few
characteristics in the population. After separating the population into a smaller group,
the statisticians randomly select the sample.
For example, there are three bags (A, B and C), each with different balls. Bag A has
50 balls, bag B has 100 balls, and bag C has 200 balls. We have to choose a sample of
balls from each bag proportionally. Suppose 5 balls from bag A, 10 balls from bag B
and 20 balls from bag C.
Clustered Sampling
In the clustered sampling method, the cluster or group of people are formed from the
population set. The group has similar significatory characteristics. Also, they have an
equal chance of being a part of the sample. This method uses simple random
sampling for the cluster of population.
Example:
An educational institution has ten branches across the country with almost the
number of students. If we want to collect some data regarding facilities and other
things, we can’t travel to every unit to collect the required data. Hence, we can use
random sampling to select three or four branches as clusters.
All these four methods can be understood in a better manner with the help of the
figure given below. The figure contains various examples of how samples will be
taken from the population using different techniques.
What is Non-Probability Sampling?
The non-probability sampling method is a technique in which the researcher selects
the sample based on subjective judgment rather than the random selection. In this
method, not all the members of the population have a chance to participate in the
study.Non-Probability Sampling Types
Non-probability Sampling methods are further classified into different types, such as
convenience sampling, consecutive sampling, quota sampling, judgmental sampling,
snowball sampling. Here, let us discuss all these types of non-probability sampling in
detail.
Convenience Sampling
In a convenience sampling method, the samples are selected from the population
directly because they are conveniently available for the researcher. The samples are
easy to select, and the researcher did not choose the sample that outlines the entire
population.
Example:
In researching customer support services in a particular region, we ask your few
customers to complete a survey on the products after the purchase. This is a
convenient way to collect data. Still, as we only surveyed customers taking the same
product. At the same time, the sample is not representative of all the customers in that
area.
Consecutive Sampling
Consecutive sampling is similar to convenience sampling with a slight variation. The
researcher picks a single person or a group of people for sampling. Then the
researcher researches for a period of time to analyze the result and move to another
group if needed.
Quota Sampling
In the quota sampling method, the researcher forms a sample that involves the
individuals to represent the population based on specific traits or qualities. The
researcher chooses the sample subsets that bring the useful collection of data that
generalizes the entire population.
Snowball Sampling
Snowball sampling is also known as a chain-referral sampling technique. In this
method, the samples have traits that are difficult to find. So, each identified member
of a population is asked to find the other sampling units. Those sampling units also
belong to the same targeted population.
https://byjus.com/maths/sampling-methods/
the two mean slightly different things.A questionnaire refers specifically to the set of
provides answers. Interviews work best for small groups and help you understand the
other hand, give subjects the freedom to provide their own answers. You can conduct
3. Focus groups
A focus group is a small group of people who have an informal discussion about a
particular topic, product, or idea. The researcher selects participants with similar
interests, gives them topics to discuss, and records what they say.Focus groups can
help you better understand the results of a large-group quantitative study. For
example, a survey of 1,000 respondents may help you spot trends and patterns, but a
focus group of 10 respondents will provide additional context for the results of the
large-group survey.
4. Observation
Observation involves watching participants or their interactions with specific
products or objects. It’s a great way to collect data from a group when they’re
observing people’s behavior without their knowledge. This allows you to see them
acting naturally. On the other hand, you have to conduct overt observation openly,
• When working with secondary data, the researcher doesn’t “collect” data;
suggest, published data has been published and released for public or private
online. Many of these sources are freely accessible and serve as reliable data sources.
But it’s best to search for the latest editions of these publications because dated ones
information such as age distribution. You can usually find information like this in
such as companies’ sales reports and revenue figures, can also be useful for research.
While some of this information is available to the public, you may have to get
4. Newspapers
Newspapers often publish data they’ve collected from their own surveys. Due to the
volume of resources you’ll have to sift through, some surveys may be relevant to
your niche but difficult to find on paper. Luckily, most newspapers are also published
online, so looking through their online archives for specific data may be easier.
5. Unpublished sources
These include diaries, letters, reports, records, and figures belonging to private
individuals; these sources aren’t in the public domain. Since authoritative bodies