Unit 2 Basic Statistical Concepts
Unit 2 Basic Statistical Concepts
Written By:
Aftab Ahmad
Reviewed By:
Dr. Rizwan Akram Rana
Introduction
In this unit you will study some basic concepts like variable, data, population, and
sample. Types of variable, types of data, types of population and types of sample are also
discussed. The purpose of this unit is to give an awareness of these commonly used
concepts.
Objectives
After reading this unit the students will be able to:
1. explain variable and its types.
2. explain data and its types.
3. explain population and its types.
4. explain sample and its types.
16
In research, change variables are referred to as independent variables while the outcome
variables are known as dependent variables. In cause effect relationship, there are some
unmeasured variables affecting the relationship. These are called extraneous variables.
The variables linking cause-effect relationship are called intervening variables. A brief
summary of above mentioned variables is given in the following table.
17
iii) Confounding Variable
A variable that has hidden effect on the experiment.
v) Dependent Variable
Outcome or response of an experiment. An independent variable has direct or
inverse effect upon dependent variable. In graph it is plotted on y-axis.
x) Ratio Variable
Similar to interval variable, but has meaningful zero.
18
ii) Collider Variable
A variable represented by a node on a causal graph that has paths pointing in as
well as out.
v) Dichotomous Variable
Another name for a binary variable.
x) Intervening variable
A variable that is used to explain the relationship between variables.
19
xv) Moderating variable
Changes the strength of an effect between independent and dependent variables.
For example, psychotherapy may reduce stress levels for women more than men,
so sex moderates the effect between psychotherapy and stress levels.
Data
The term “data” refers to the kind of information a researcher obtains to achieve
objectives of his research. All research processes start with collection of data, which
plays a significant role in the statistical analysis. This term is used in different contexts.
In general, it indicates facts or figures from which conclusions can be drawn. Or it is a
raw material from which information is obtained. Data are the actual pieces of
information that you collect through your study. In other words data can be defined as
collection of facts and details like text, figures, observations, symbols, or simply
description of things, event or entity gathered with a view of drawing inferences. It is a
raw fact which should be processed to get information
20
Primary data
As the name suggests, is one which is collected for the first time by the researcher
himself. Primary data is originated by the researcher for the first time for addressing his
research problem. It is also known as first hand raw data. The data can be collected using
various methods like survey, observations, physical testing, mailed questionnaire,
questionnaire filled and sent by enumerators, personal interviews, telephonic interviews,
focus groups discussion, case studies, etc.
Secondary data
Point towards the second hand information already collected and recorded by any other
person with a purpose not relating to current research problem. It is readily available
form of data and saves time and cast of the researcher. But as the data is gathered for the
purpose other than the problem under investigation, so the usefulness of the data may be
limited in a number of ways like relevance and accuracy. Also, the objectives and
methods adopted to collect data may not be suitable to the current situation. Therefore,
the researcher should be careful when using secondary data. Examples of secondary data
are censuses data, publications, internal records of the organizations, reports, books,
journal articles, websites etc.
21
Data
Primary Secondary
ii) The Accessible Population is also known as the study population. It is the
population to which a researcher can apply the conclusions of the study. This
population is a subset of the target population.
22
Sample
A sample is simply a subset or subgroup of population (Frey, Carl, & Gary, 2000).The
concept of sample arises from the inability of the researchers to test all the individuals in
a given population. Sampling is the process of selecting some individuals from the
accessible population, in a way that these individuals represent whole accessible
population. The sample should be representative in a sense that each individual should
represent the characteristics of the whole population (Lohr, 1999). The main function of
the sample is to allow the researchers to conduct the study to individuals from the
population so that the results of their study can be used to derive conclusions that will
apply to the entire population.
Probability sampling
Is a process that utilizes some form of random selection. In probability sampling, each
individual in chosen with a known probability. This type of sampling is also known as
random sampling or representative sampling; and depends on objective judgment.
Various types of probability are as under:
i) Simple Random sampling
In random sampling each member of the population has an equal chance of being
selected as subject. Each member is selected independently of the other member of
population. Many methods are used to proceed with random sampling. In a
commonly used method each member of the population is assigned a unique
number. All assigned numbers are placed in bowl and mixed thoroughly. The
researcher, then blind-folds and picks numbered tags from the bowl. All the
numbers picked are the subjects of the study. Another method is to use computer
for random selection from the population. For smaller population first method is
useful and for larger population computer-aided method is preferred.
23
ii) Systematic Random Sampling
In systematic random sampling, the researcher first randomly picks the first item or
the subject from the population. Then he selects each nth subject from the list. The
procedure involved in this sampling is easy and can be done manually. The sample
drawn using this procedure is representative unless certain characteristics of the
population are repeated for every nth member, which is highly risky.
Suppose a researcher has a population of 100 individuals and he needs 12 subjects.
He first picks his starting number 7. He then picks his interval 8. The members of
his sample will be individual 7, 15, 23, 31, 39, 47, 55, 63, 71, 79, 87, and 95
Advantages
This type of sampling is appropriate when the population has diversified social or ethnic
subgroups.
Disadvantages
While using this type of sampling, there is greater chance of overrepresentation of
subgroups in the sample.
24
or cluster of students such as classes, and then, select the sample of students from among
the clusters.
Advantages
This type of sampling is appropriate for larger population. It saves time and resources.
Disadvantages
In this type of sampling, there is a greater chance of selecting a sample that is not
representative of the whole population.
Non-Probability Sampling or Judgmental Sampling
This technique depends on subjective judgment. It is a process where probabilities cannot
be assigned to the individuals objectively. It means that in this technique samples are
gathered in a way does not give all individuals in the population equal chances of being
selected. Choose these methods could result in biased data or a limited ability to make
general inferences based on the findings. But there are also many situations in which
choosing this kind of sampling techniques is the best choice for a particular research
question or the stage of research.
25
iii) Snowball Sample
This type of sampling is appropriate when the members of the population are
difficult to locate, such as homeless industry workers, undocumented immigrants
etc. a snowball sample is one in which the researcher collects data on a few
members of the target population he or she can locate, then asks to locate those
individuals to provide information needed to locate other members of that
population whom they know. For example, if a researcher wants to interview
undocumented immigrants from Afghanistan, he might interview a few
undocumented individuals he knows or can locate, and would then rely on those
subjects to help locate more undocumented individuals. This process continues
until the researcher has all the interviews he needed, until all contacts have been
exhausted. This technique is useful when studying a sensitive topic that people
might not openly talk about, or if talking about the issue under investigation could
jeopardize their safety.
26
2.4 Activities
1. Suppose a scientist is conducting an experiment to test the what extant a vitamin
could extend a person’s life expectancy. Identify:
i) Independent Variable of the experiment.
ii) Dependent Variable of the experiment.
27
2.5 Bibliography
Bartz, A. E. (1981). Basic Statistical Concepts (2nd Ed.). Minnesota: Burgess Publishing
Company
Deitz, T., & Kalof, L. (2009). Introduction to Social Statistics. UK: Wiley_-Blackwell
Gay, L. R., Mills, G. E., & Airasian, P. W. (2010). Educational Research: Competencies
for Analysis and Application, 10th Edition. Pearson, New York USA.
Gravetter, F. J., & Wallnau, L. B. (2002). Essentials of Statistics for the Behavioral
Sciences (4th Ed.). Wadsworth, California, USA.
28