Statistics Intro
Statistics Intro
In statistics, sampling is a method of selecting the subset of the population to make statistical
inferences. From the sample, the characteristics of the whole population can be estimated.
Sampling in market research can be classified into two different types, namely probability
sampling and non-probability sampling.
Systematic sampling
Systematic sampling is a probability sampling method in which sample members from a larger
population are selected according to a random starting point but with a fixed, periodic interval.
pg. 1
Example
As another example, if you wanted to select a random group of 1,000 people from a population
of 50,000 using systematic sampling, all the potential participants must be placed on a list and a
starting point would be selected. Once the list is formed, every 50th person on the list (starting
the count at the selected starting point) would be chosen as a participant, since 50,000 ÷ 1,000 =
50.
Multistage sampling.
Example,
A researcher wants to understand pet feeding habits among people living in the Kenya. For this,
he/she requires a sample size of 200 respondents. The researcher selects 10 counties out of 47 at
random. Further, he/she randomly picks out 3 sub-counties per county. From these 30 randomly
selected sub-counties, he/she then chooses 4 pet-owning households to conduct his research.
Types of data
Nominal data
Nominal data are used to label variables without any quantitative value i.e. cannot be measured.
What's your nationality?
American
German
Kenyan
Japanese
Ordinal data
Ordinal data classifies data while introducing an order, or ranking.
What is your poverty status?
Very Rich
Rich
Least poor
Poor
Most Poor
Very Poor
Interval data
Interval data classifies and ranks data but also introduces measured intervals. A great example is
temperature scales, in Celsius or Fahrenheit. However, interval data has no true zero, i.e. a
measurement of ‘zero’ can still represent a quantifiable measure (such as zero Celsius, which is
simply another measure on a scale that includes negative values).
Ratio data
Ratio data is the most complex level of measurement. Like interval data, it classifies and ranks
data, and uses measured intervals. However, unlike interval data, ratio data also has a true zero.
When a variable equals zero, there is none of this variable. A good example of ratio data is the
measure of height—you cannot have a negative measure of height.
pg. 2
Data collection method is selected
There are various methods of collecting primary data. They include;
Observation
Researchers observe and record behaviors, actions, or events in their natural setting. This method
is useful for gathering data on human behavior, interactions, or phenomena without direct
intervention.
Interviews
Interviews involve direct interaction between the researcher and the respondent. They can be
conducted in person, over the phone, or through video conferencing. Interviews can be structured
(with predefined questions), semi-structured (allowing flexibility), or unstructured (more
conversational).
Questionnaires
Researchers design structured questionnaires or surveys to collect data from individuals or
groups. These can be conducted through face-to-face interviews, telephone calls, mail, or online
platforms.
Sampling etc.
pg. 3