0% found this document useful (0 votes)
45 views

Research Methods Notes

The document discusses research methods and variables. It defines key research terms like research, research proposal, research project, variables, and types of variables. The types of variables discussed are independent variables, dependent variables, moderating variables, and intervening variables. Relationships between these variables are illustrated with examples.

Uploaded by

Mary Citizen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views

Research Methods Notes

The document discusses research methods and variables. It defines key research terms like research, research proposal, research project, variables, and types of variables. The types of variables discussed are independent variables, dependent variables, moderating variables, and intervening variables. Relationships between these variables are illustrated with examples.

Uploaded by

Mary Citizen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 51

Jane Muchembi

RESEARCH METHODS
Research: Is a process of enquiry and investigation that is systematic, methodical and ethical
(www.brad.ac.uk/intrduction-to-research)
According to Orodho and Kombo (2002) research is a process of arriving at dependable solutions
to problems through planned systematic collection, analysis and interpretation of data.

Research proposal: Is plan, suggestion or request to conduct research which comprises of


introduction, literature review and research methodology (Kombo and Tromp 2006). It consists
of three chapters:-
Chapter one: Introduction
Chapter two: Literature review
Chapter three: Research methodology

Research project or thesis: Is a written scientific report that deals with the concerns related to
the problem or series of problems in one’s area of research (Kombo and Tromp 2006). It
comprises of five chapters:-
Chapter one: Introduction
Chapter two: Literature review
Chapter three: Research methodology
Chapter four: Research findings
Chapter five: Summary, Conclusions, Recommendations

VARIABLES
 You won't be able to do very much in research unless you know what variables are. A variable is
any entity that can take on different values. The term variable is derived from variations. It
refers to differences, attributes, qualities of the cases that we measure or record (Kombo and
Tromp 2006).
 A variable is anything that takes differing or varying values e.g. Age, height, price, income, level
of education, motivation, performance, etc.
 Anything that can vary can be considered a variable. For instance, age can be considered a
variable because age can take different values for different people or for the same person at
different times. Similarly, country can be considered a variable because a person's country can
be assigned a value.

TYPES OF VARIABLES
1. Independent variables (also known as predictor variable)
 An independent variable is what the researcher (or nature) manipulates, a treatment or
program or cause.
 It is the intervention or a condition implemented by the researcher to see if it will create
change on the dependent variable. This could be a program, method, system, or other action.
 Independent variables are predictors or explanatory variables, those that cause changes to
the dependent variables. They influence the dependent variables positively or negatively.

1
Jane Muchembi

e.g. The relationship of availability Computer Aided Design (CAD) software (independent
variable) on the number of defective product i.e. rejects (dependent variable) in fashion
manufacturing firm may be affected by the designers ability to use the CAD technology
(moderating variable).
 Whenever there is independent variable there is also a dependent variable.
 An increase or decrease in units of independent variable will lead to increase or decrease in
dependent variable i.e. variance in dependent variable is accounted for by variance in
independent variables.

2. Dependent variables (also known as criterion variable)


 Dependent variable: A dependent variable is what may change as a result of the independent
variable or intervention. 
 The dependent variable is what is affected by the independent variable – the researchers’
effects or outcomes. For example, if the researcher is studying the effects of a new
educational program on student achievement, the program is the independent variable and
your measures of achievement are the dependent ones.
 A dependent variable could be a behavior, outcome, or other condition.  Introduction of
modern cutting and stitching machines (independent variable) in a garment making firm, for
example, may increase the output or production (dependent variable).
 It is the outcome that the researcher is attempting to predict e.g. performance of a designer. It
is the variable that the researcher is primarily interested in. The researcher’s goal is to
understand, describe it, to find out what influences its behavior or to predict it.
 Dependent variable is the variable of primary interest to the researcher. The researcher’s goal
is to understand and describe the dependent variable or to explain its variability or to predict
it.
 Understanding the dependent variable and the other variables that influence it will help the
researcher to recommend solutions to problems e.g. understanding factors (independent
variables) that hinder many Kenyan fashion designers from starting their own fashion design
businesses (dependent variable) will help the researcher recommend possible ways of
removing those hindrances.

Relationship between independent and dependent


variable

Computer aided design Number of defective products / rejects

Independent variable Dependent variable

3. MODERATING VARIABLE
 Moderating variable is one that has a strong contingent effect on the independent variable-
dependent variable relationship.

2
Jane Muchembi

 Its presences modifies the relationship between the independent and dependent variables e.g. The
designers ability or skills in using CAD technology will interfere with the relationship between
availability of CAD software and number of defective products (rejects).

Relationship between independent, dependent variables modulated by moderating variable

Computer aided design Number of defective products / rejects

Independent variable Dependent variable

Designer’s CAD skills

Moderating variable

4. INTERVENING VARIABLE
 Intervening variable is one that surfaces between the time independent variable starts operating
to the time their impact is felt on it. There is therefore a temporal quality or time dimension to
the intervening variable.
 It surfaces as a function of the independent variable(s) operating in any situation and helps to
conceptualize and explain the influence of independent variable(s) on the dependent variable.
e.g. in a study where the independent variable is workforce diversity which influences the
dependent variable is organizational effectiveness, creative synergy (collaboration, cooperation,
interaction) may surface as the intervening variable. Creative synergy may surface due to
multiracial, multiethnic and multifaceted expertise in problem solving.
 NB: creative synergy in the above example surfaces at time t2 as a function of workforce
diversity which was in place at time t1 to bring about organizational effectiveness at time t3.
 The intervening variable of creative synergy helps us to understand how workforce diversity
brings about organizational effectiveness.

Relationship between independent, intervening and dependent variables

Time t2 t3
t1
:me

Workforce diversity Organizational effectiveness


Creative synergy

Independent variable Intervening variable Dependent variable


3
Jane Muchembi

Relationship between independent, intervening, moderating and dependent variables

Time t1 t2 t3
:me
Independent variable
Workforce diversity Creative synergy Organizational effectiveness

Intervening variable Dependent variable

Managerial expertise

Moderating variable

NB: Inclusion of a moderating variable will affect the relationship between the other variables
e.g. creative synergy may not occur if the management is not able to harness the different
problem solving skills in the diverse workforce.

Abstract: is a summary of the whole research project including the objectives, methodology
used, findings and recommendations. It should not exceed 500 words.

THE PURPOSE CONDUCTING RESEARCH is to:-


i. Review or synthesize existing knowledge
ii. Investigate existing situations or problems
iii. Provide solutions to problems
iv. Explore and analyze more general issues
v. Construct or create new procedures, or systems or technologies
vi. Explain a new phenomenon
vii. Generate new knowledge
viii. Combination of any of the above
(Collis & Hussey, 2003). (www.brad.ac.uk/intrduction-to-research)

RESEARCH TOPIC: Refers to the subject, issue or area under discussion or investigation e.g.
fashion marketing, African dress, interior design.

4
Jane Muchembi

A researcher should choose a topic which interests him/her as this will help sustain the
motivation to complete the research. This is because one will enjoy reading materials in the area,
collecting and analyzing data.
RESEARCH TITLE: is the heading, label or tag. It describes what the research is about. It
should be formulated after selecting the topic.
QUALITIES OF A GOOD TITLE
i. It should be brief i.e. not more than 15 words
ii. It should be specific i.e. it should be easy to identify independent and dependent variable from
the title.
iii. It should be in line with the objectives i.e. it should portray the aims and objectives of the
research.
iv. It should be clear and not ambiguous i.e. it should not be subject to different interpretations. It
should reflect a relationship between independent and dependent variables.
v. It should portray an issue that is researchable i.e. the aspects in question should be
measurable.

REFERENCING OTHER PEOPLES’ WORK WHEN CONDUCTING RESEARCH


A researcher should acknowledge all the sources of information used when conducting the
research by citing the author’s surname and year of publication in the text and writing its full
details in the reference list at the back of the document.

REASONS FOR CITING OTHER PEOPLES WORK


i. The body of knowledge is not built on vacuum but on accumulated knowledge over
generations and so the researcher should try to fit his/her research within already existing
knowledge.
ii. It is courteous to acknowledge use of other peoples’ work.
iii. It provides evidence of thorough examination of literature and shows that the researcher has
read.
iv. It gives support to the points that the researcher has stated in the research.
v. Helps to identify gaps that should be met by the researcher.

FOUR IMPORTANT POINTS WHEN REFERENCING


i. Credit must be given when quoting or citing other peoples.
ii. Adequate information must be provided to enable a reader to locate each reference.
iii. References must be complete and consistent.
iv. References must be recorded in precise format as per the requirement of the University of
Study.

REFERENCING STYLES/SYSTEMS
There are several referencing styles to choose from e.g.
i. Vancouver which uses numbers in the text and footnotes. Each number in the text represents a
reference or publication whose details are given in the footnotes.

5
Jane Muchembi

ii. Modern Language Association (MLA)


iii. Harvard system which is an author –date system
iv. The American Psychological Association (APA) system which is also a an author –date
system which is recommended for social sciences.

CHOICE OF REFERENCING STYLE/SYSTEMS


The choice is influenced by the discipline of study and the university.
NB: A researcher should familiarize himself or herself with all the styles and thoroughly study
the style that he/she will adopt.
The American Psychological Association (APA) system
 It is an author –date system which is recommended for social sciences.
 It requires that the researcher identifies or acknowledges or writes the authors’ surname/names
and date (year) of publication at the point where they are cited in the text (proposal or project)
and full details of the publication in the reference list at the end of the publication.

Referencing in the text


 This can be done at the beginning or end of the sentence.
 When done at the end of the sentence the authors name and date are put into bracket and full stop
at the after the bracket e.g. teenage girls prefer wearing high heeled shoes (Kelly, 2014). If there
are two authors write both authors surnames names separated by & and then year e.g. (Kelly &
Allan, 2014).
 When used at the beginning of the sentence only the date is in bracket e.g. According to Kelly
(2014) teenage girls prefer wearing high heeled shoes.
 In case the author is not known cite a (short title, date)
 If the date is not known cite (Author, n.d.).
 Corporate authors like UN Cite in full first used and acronyms thereafter e.g. (United Nations
(UN), 2001) first time and (UN, 2001) thereafter.
 Page numbers are only written in citations for direct quotations but not paraphrased ones.

Referencing in the reference list


 This is placed immediately after chapter three for the proposal and chapter five for the project.
 For text books, one starts with the authors surname/names initials (date). Then title of the book
(in Italics). The city of publication: publisher. E.g. Kelly, J. M. & Allan, B. S. (2014).
Introduction to Research Methods. Nairobi: Acts Publishers.
 For journal articles, start with surname/s (date). Title of the article, Journal name (in Italics),
Volume No. Part no. Pages. E.g. Kelly, J.M. & Allan, B.S. (2014). University dressing Code,
Fashion and Design Journal, 43 (3), 72-90
 For journal published in the internet < URL: http://www. remainder of full internet electronic
conference/bulletin board >
e.g. <URL:http://www.stingray.ivision.co.uk/groups/emu/frindex.htm>
 For Journals article published in the internet start with the surname, Initials (date). ‘Title of the
article’, Journal Name (in Italics). Volume No. Part no. (online) (cited day, month, year)

6
Jane Muchembi

Available from <URL: http://www. remainder of full internet address>e.g. Illingworth, N. (2001)
‘The internet matters’ Sociological Research Online 6:2, (online) (cited 20 March 2002)
Available from < URL: http://www.scoresonline.org.uk/6/2/illing th.html>
NB:
i. All the authors cited in the text should be included in the reference list.
ii. Authors not cited in the text should not be in the reference list.
iii. Single space within a reference and double space between references.
iv. Use hanging indents in references
v. The publication in the reference list should be arranged in alphabetical order by the
authors’ names i.e. authors whose surnames start with letter “A” should be listed before
those whose surnames start with “B” and so on.

An example of reference list

Babbie, E. (2001). The practice of social Research: 9th Ed. CA: Wadworth Thomson.

Neville C. (2007). An introduction to research and research methods. Bradford University


school of Management.

Saunders, M, Lewis, P. & Thornhill, A. (2007) Research Methods for Business Students (4th
Edition.) Harlow: Prentice Hall.

NB: Refer to the APA Manual for more details.

7
Jane Muchembi

CHAPTER ONE: INTRODUCTION

1.0 BACKGROUND TO THE STUDY


 Refers to the setting or position of the study. It is a brief overview of the problem the researcher
plans to investigate. It should tell the readers why the researcher feel that the research is worth the
effort. It should be a highlight of the key literature sources that the researcher plans to draw from.
 It provides background to the problem from international, national and local point of view based
on existing literature. It is similar to literature review in that it cites some literature but different in
that it is much more brief (not more than ½ pages). It is a mini literature.

ROLE OF BACKGROUND TO THE STUDY IN RESEARCH


i. It helps to clarify what brought about the need for the study.
ii. It points out the challenges faced due to the identified issue
iii. It indicates the opportunities for improvement
iv. It demonstrates the researcher’s view of the research problem.
v. It helps to convince the readers that a problem exists and need to be addressed.

QUALITIES OF A GOOD BACKGROUND TO THE STUDY


It should be:-
i. Brief and specific, although it is very similar to the literature review it is less detailed.
ii. Engages the interest of the reader. It should make the reader excited about the opportunity of
having the research problem addressed.
iii. It gives the reader a glimpse of the research problem
iv. It gives the reader an idea of how the proposal is structured
v. The language used is simple and straight forward
vi. Informative and persuasive since it attempts to enlighten the reader on the research problem and
the urgency of addressing it.
(Kombo and Tromp 2006)

1.2 RESEARCH PROBLEM


 A research problem is an issue or concern that puzzles the researcher because of its effects or
consistence despite the measures taken e.g. use of indecent dressing among fashion and design
students despite continued training on importance of dressing decently.
 It can also be defined as situation where gap exists between the actual and desired ideal situation
or state e.g. the desired ideal situation may be zero defects in production but the actual may be that
there are a few defects.
 It is the heart or core of the research project and if a researcher does not understand the problem
he/she will not succeed in the research.

QUALITIES OF A GOOD OR EFFECTIVE RESEARCH PROBLEM


i. It is concise and clearly stated. It makes the reader aware that there is a definite issue that need to
be addressed or problem that needs to solved. It should answer questions such as:-

8
Jane Muchembi

a) What is the problem?


b) Why is it a problem?
c) Who is getting affected or will be affected if the situation is not addressed through research?
d) Where/ location does the problem exist?
ii. The research problem has impact in the whole topic being investigated.
iii. It indicates clearly that there is urgent need of the research.
iv. It is researchable i.e. the problem can be investigated through collection and analysis of data.
v. Relevant literature is available i.e. the problem has supportive statements.
vi. Language used is simple and objective i.e. not emotional, comic or poetic.
(Kombo and Tromp 2006)

1.3 RESEARCH OBJECTIVES


These are the intentions or purposes of research stated in specific and measurable terms. They
provide opportunities for measuring end results e.g. to establish the effects of age on the height
of shoes used by women in Nairobi city?

QUALITIES OF GOOD OR EFFECTIVE RESEARCH OBJECTIVES


i. Specific: clearly state what the researcher will do in order to fulfil the purpose of the research.
ii. Measurable: the researcher should be able to measure or tell whether they have been
achieved.
iii. Focused: Should narrow the study to essentials. They should cover the different aspects of the
problem and its contributing factors in a coherent and logical sequence.
iv. Operational: they should be clearly stated in operational terms, specifying exactly what the
researcher will do.
v. Realistic: Not too ambitious and therefore achievable.

1.5 RESEARCH QUESTIONS


These are issues that the researcher plans to answer and are related to the research objectives e.g.
what is effect of age on the height of shoes used by women in Nairobi city?

1.6 RESEARCH HYPOTHESES


Hypothesis: is an educated guess or assumption. It is a tentative explanation for certain behavior
pattern, phenomenon, or events that have occurred or will occur.
Research hypothesis is a statement about the expected relationship between observable or
measurable events.
An experimental research hypothesis states the expected relationship between independent and
dependent variables e.g. increase in consumers salaries will lead to increase in consumption of
fashion products.

FORMS OF STATING HYPOTHESIS


1. The null hypothesis: States that there is no difference between the variables studied. The aim
of testing it is to show that the hypothesis is false and accept the alternative hypothesis. It
symbolized as H0 e.g. H0 1: There is no significant difference in the performance of designers

9
Jane Muchembi

who have gone through a diploma course in design and those who have gone through a degree
program.

2. The alternative hypothesis: is the opposite of the null hypothesis and states that a value or
relationship in the null is not true. It symbolized as H1 e.g. H1 e.g. H1: there is a significant
difference in performance of designers who have gone through diploma and those who have
gone through a degree program.

1.7 SIGNIFICANCE/JUSTIFICATION OF THE STUDY


Outlines the importance of the issue under investigation. Explains who will benefit from the
research and how they will benefit from the findings of the research. This is important because
research is expensive in both money and time and therefore it is necessary to establish the need
or relevance of research.

1.8 LIMITATIONS OF THE STUDY


Refer to the challenges anticipated or faced by the researcher e.g. time, finances inaccessibility
of data non-cooperative respondents etc.

1.9 ASSUMPTIONS OF THE STUDY


What the researcher assumes to be true e.g. the information given by the respondents is true.

DEFINITION OF TERMS
Significant terms which are used whose meanings are not obvious are defined. Start this section
in a new page.

NB: Every chapter should be started in a new page.

CHAPTER TWO: LITERATURE REVIEW


The term literature refers to the analysis of publications such as books, journals, magazines,
manuscripts etc. Literature means the work/publications that the research has consulted or referred
to when conducting research to gain better understanding of the of the research problem. Literature
review is therefore an account or a critical look at what has been published on a topic by accredited
scholars.

REASONS FOR CRITICAL LITERATURE REVIEW WHE N CONDUCTING


RESEARCH
i. It sharpens theoretical foundation of the research. It enables the researcher to study different
theories related to the identified topic. By studying them the researcher gains clarity and better
understanding of the theoretical foundations related to the research.
ii. It gives the researcher insight into what has already been researched in the selected field,
pinpointing strengths and weaknesses. This information guides the researcher in theory
formulation and identifying the gaps.
iii. It enables the researcher to avoid duplication of research already done.

10
Jane Muchembi

iv. Helps the researcher to develop a significant problem of the study which will provide further
knowledge in the field of study.
v. It exposes the researcher to a variety of approaches or methods of conducting research. This
contributes to a well-designed methodology because the researcher can avoid methods that have
been reported in the literate to be weak or have failed.
vi. It helps in developing analytical framework or basis for analyzing and interpreting data. The
researcher will be able to compare and contrast their findings with the findings of the other
researchers.
vii. Helps to convince the reader that:-
a) The researcher is knowledgeable about the problem area and has done the preliminary home
work that is necessary to conduct research
b) Theoretical framework will be structured on work already done and will add to the solid
foundation of existing knowledge.
(Sekaran 2003)

QUALITIES OF GOOD LITERATURE REVIEW


i. It should be critical, organized and analytical in orientation.
-It is used to analyze issues such as whether the title is descriptive, well worded and reasonably
short.
-The researcher also analyzes whether the problem was well stated, objectives, questions and
hypotheses in the studies reviewed were logically formulated.
-The researcher also analyzes the methodology used was appropriate i.e. was the right design,
population, sample and sampling procedure, data collection methods and instruments and data
analysis techniques were used.
- The researcher also analyzes whether it provided the results are significant, applicable and
relevant to his/her research.
ii. It justifies the need for further study i.e.
- it identifies the gaps in the studies reviewed that should be filled with further research.
-It highlights controversies raised in the previous researches which need to be resolved by further
research.
-It points out how current research will positively contribute towards filling in the identified gaps.
iii. It highlights the relationship between the past and the current study i.e.
-It links the current study with the past studies
- It evaluates and shows the relationship between the work already done by other scholars and
the researcher’s work.
-This brings consistency and continuity in relation to identified topic.
iv. It puts the research into perspective
- Citing and analyzing various related studies helps to define the research problem.
- It also act as a guideline in assessment of the research questions.

THE STRUCTURE/COMPONENTS OF LITERATURE REVIEW


2.1 Introduction: It briefly (one paragraph) tells the reader what is discussed in the literature
review.

11
Jane Muchembi

2.2 The body: this may be structured into subtopics based on either the:- a) objectives or
b) Variables
2.3 Summary and identified gaps: summarizes the key and relevant findings from the reviewed
literature and brings out what the previous studies have not covered on the topic and needs to be
researched.
NB:
 When writing the literature review, read the publication critically and analytically, understand
and then summarize the idea you find relevant in your own words. Then cite the author.
 If there is need to use direct quotation put them into quotes and cite the authors name, date and
page number. If the quotation is from 40 words and above indent and single space the whole block
and do not put them in quotes, cite the author, date and page number. Desist from using quotes after
quotes and copy pasting.
 Bring out some discussions on every idea you write about based on what different authors have
written i.e. explain where different authors agree or disagree, contradictions and gaps of what they
have not explained about the idea (missing gaps)
 Only include relevant literature. Not every material you will read is relevant to your study.

Reasons for citing other peoples work when conducting research in clothing, textile and
interior design industry.
i. The body of knowledge is not built on vacuum but on accumulated knowledge over generations
and so the researcher should try to fit his/her research within already existing knowledge.
ii. It is courteous to acknowledge use of other peoples’ work.
iii. It provides evidence of thorough examination of literature and shows that the researcher has
read.
iv. It gives support to the points that the researcher has stated in the research.
v. Helps to identify gaps that should be met by the researcher.

2.4 THEORETICAL FRAMEWORK AND CONCEPTUAL FRAMEWORK


2.4.1 THEORETICAL FRAMEWORK
According to Sekaran (2004), theoretical framework explains the relationship between variables that
are deemed to be integral to the dynamics of the situation being investigated. It is the foundation on
which the entire research is based. It is based on existing relevant theories.

2.4.2 CONCEPTUAL FRAMEWORK


Is a set of broad ideas and principles taken from relevant fields of inquiry and used to structure a
subsequent presentation (Reichel and Ramey (1986) as cited by Kombo and Tromp 2006)

DIFFERENCES BETWEEN CONCEPTUAL AND THEORETICAL FRAMEWORK.


Theoretical framework Conceptual framework
-It is based on existing relevant theories. - It based on the way the researcher conceptualizes
12
Jane Muchembi

certain ideas, actions and relationships


- Since is based on existing theory it is likely -it is a hypothesized mode of identifying concepts or
to have been tested before variables under study and showing their relationships
-May not be supported by schematic diagram -Is better accompanied by schematic diagram

SIMILARITIES BETWEEN CONCEPTUAL AND THEORETICAL FRAMEWORK.


i. They both explain the relationship between dependent and independent variables.
ii. Both form the foundation on which the whole research is based.
iii. Help the researcher to hypothesize and test the relationship between dependent and independent
variables.
iv. Help to identify the existing networks of relationships among variables.
v. Both tend to focus the direction of research so that the researcher does not dwell on the trivial
matters that do not contribute to the body of knowledge or digress
vi. Help to provide a rational basis for explaining or interpreting results of the research.
vii. Enables the researcher to make predictions.
viii. Both may need to be tested or improved through research.

THE PURPOSE OF DEVELOPING A CONCEPTUAL FRAMEWORK WHEN


CONDUCTING RESEARCH
i. Provides a clear link from literature review, to the research problem, questions and objectives.
ii. Gives broad scope of the research and enables the researcher to conceptualize the research
problem from a broad perspective.
ii. It is conceptually aligned with the research methodology and objective and therefore provides an
overall view of the research.
By highlighting the independent and dependent variables, it enables the researcher to focus on the
research problem.

PLACEMENT OF CONCEPTUAL AND THEORETICAL FRAMEWORK


 According to Creswell (2004) placement of theory depends on whether one is conducting a
quantitative or qualitative study. In quantitative research it may be placed at the end of introduction
i.e. chapter one or at the beginning or end of chapter two i.e. literature.
 In qualitative research a researcher does not begin with theory but it emerges during data
collection and analysis and so it can be placed at the end of the research.

CHAPTER THREE: RESEARCH METHODOLOGY


RESEARCH APPROACHES
Quantitative research Qualitative research
- Quantitative research relies on the principle of - Qualitative research is research that involves
verifiability meaning that it seeks confirmation, description, it seeks to describe the behavior of
proof, corroboration or substantiation the population under investigation
- Emphasizes collecting and analyzing numerical - Does not emphasize use of numerical data.

13
Jane Muchembi

data.
Basic research Applied research
- Aims at improving or adding knowledge without - Is designed right from the start to apply its
any particular applied purpose in mind at the findings to a particular situation or to solve a
onset. specific problem.
Deductive research: testing theory Inductive research: building theory
- Moves from general ideas to specific, particular - Moves from particular situations to make or infer
situations. broad general ideas or theories.
- The particular is deduced from the general e.g. - E.g. if you wanted to know the meaning of the
broad theories word fashion to a range of people may be of
- E.g. if you wanted to know the meaning of the word different ages or gender, you could start by
fashion to a range of people may be of different checking the definitions with the range of
ages or gender, you could start by checking the people and then compare with definition given
definitions given by professional associations or by professional associations or sources and
sources and then test the definition with the range then test the definition
of people. - Then collate the data collected and the results
- Then collate the data collected and the results analyzed and presented.
analyzed and presented. - This may lead to new definition or not

3.1. RESEARCH DESIGNS


Research design: can be thought of as a structure of the research. It is the glue that holds all the
elements of research project together.
There are different types of research designs e.g.
I. Descriptive Survey:
i. Describes the state of affairs as it exists.
ii. May lead to formulation of important principles of knowledge and solutions to significant
problems.
iii. Involve collection, classification, analysis, comparison and interpretation of data.
iv. Can be used for collecting information about people’s attitudes, opinions, and habits.

II. Correlational design:


i. Enables the researcher to assess the degree of relationship that exists between two or more
variables e.g. age of the consumer and the height of the shoes used.
ii. Tries to establish how given independent variables affect or cause changes to the dependent
variables.
iii. Tend to compare similarities and differences between groups of population.
iv. The researcher is required to control all other variables to be able to establish a causal
relationship.

III. Evaluative design:


i. Investigates the effects or impacts of a program or project.
ii. Involve collection and analysis of data to facilitate decision making.

14
Jane Muchembi

iii. Assists in meeting accountability demands.


iv. Helps to determine the worth of a program or project or activity.

IV. Experimental design:


i. Involve manipulation or treatment of some variables and then measuring the outcomes.
ii. Two groups of respondents are used i.e. the experimental group and control group.
iii. The experimental group receives the treatment but the control group does not and their
performance or behavior is tested after the experiment.
iv. The researcher assumes that the two groups were the same before the experiment.

V. Case study
i. Seeks to study a unit in detail, in context and holistically.
ii. Enables the researcher to deep information and understanding on the unit being studied
iii. The findings may not be easily generalized.
iv. The researcher should give clear justification for using this design.

VI. Naturalistic/ethnographical design


i. Studies a phenomenon in its ordinary or natural environment
ii. Attempt to study ordinary happenings
iii. Uses common language in reporting
iv. Is mostly qualitative in nature

VII. Historical /longitudinal design


i. Involves integration and description of the true record of past events and their developmental
trends.
ii. It is a longitudinal, genetic or developmental approach attempting to get the actual meaning of
the primary and secondary sources pertinent to the problem under investigation by subjecting the
evidence to external and internal criticism.
iii. Involves collection and analysis of primary sources such as manuscripts, memoirs, diaries,
autobiographies
iv. The major focus is construction of conceptual frameworks, collection, analysis and organization
and demonstration meaning of historical evidence

VIII. CROSS-SECTIONAL STUDIES


This is a study involving different organizations or groups of people to look at similarities or
differences between them at any one particular time, e.g. a survey of the IT skills of managers in
one or a number of organizations at any particular time. Cross-sectional studies are done when
time or resources for more extended research, e.g. longitudinal studies, are limited. It involves a
close analysis of a situation at one particular point in time to give a ‘snap-shot’ result.

3.2 THE LOCALE OR LOCATION OF THE STUDY


Describe the location where the research will be carried out giving justifications.

15
Jane Muchembi

3.3 POPULATION/TARGET POPULATION


It is the group of people or objects or items with at least one thing in common from which the
sample will be taken for measurements and the results generalized to e.g. All the small scale
fashion traders in Nakuru county.
The major question that motivates sampling in the first place is: "Who do you want to generalize
to?" Or should it be: "To whom do you want to generalize?" In most social research we are
interested in more than just the people who directly participate in our study. We would like to be
able to talk in general terms and not be confined only to the people who are in our study.
There are times when we aren't very concerned about generalizing. Maybe we're just evaluating a
program in a local agency and we don't care whether the program would work with other people
in other places and at other times. In that case, sampling and generalizing might not be of
interest.
In other cases, we would really like to be able to generalize almost universally. When
psychologists do research, they are often interested in developing theories that would hold for all
humans. But in most applied social research, we are interested in generalizing to specific groups.
The group you wish to generalize to is often called the population in your study. This is the
group you would like to sample from because this is the group you are interested in generalizing
to.

Let's imagine that you wish to generalize to urban homeless males between the ages of 30 and 50
in the Kenya. If that is the population of interest, you are likely to have a very hard time
developing a reasonable sampling plan. You are probably not going to find an accurate listing of
this population, and even if you did, you would almost certainly not be able to mount a national
sample across hundreds of urban areas. So we probably should make a distinction between the
population you would like to generalize to (theoretical population) and the population that will
be accessible to you (the accessible population). In this example, the accessible population
might be homeless males between the ages of 30 and 50 in six selected urban areas across the
Kenya

SAMPLING FRAME: is the list of all the members of the population under investigation. The
researcher can obtain one from the relevant bodies or develop one in case where such a list does
not exist.
Once you've identified the theoretical and accessible populations, you have to do one more thing
before you can actually draw a sample -- you have to get a list of the members of the accessible
population. (Or, you have to spell out in detail how you will contact them to assure
representativeness). The listing of the accessible population from which you'll draw your sample
is called the sampling frame.

3.4 SAMPLE POPULATION


Sample population: is a sub-set of population or a small part of the target population which is
thought to be a true representative of the whole population. It comprises of all the members
selected from the population from which data will be collected and then generalized to the whole
population.

16
Jane Muchembi

You actually draw your sample (using one of the many sampling procedures). The sample is the
group of people who you select to be in your study. Notice that I didn't say that the sample was
the group of people who are actually in your study. You may not be able to contact or recruit all
of the people you actually sample, or some could drop out over the course of the study. The group
that actually completes your study is a subsample of the sample -- it doesn't include non-
respondents or dropouts. The problem of nonresponse and its effects on a study will be addressed
when discussing "mortality" threats to internal validity.
People often confuse what is meant by random selection with the idea of random assignment.
You should make sure that you understand the distinction between random selection and random
assignment.
At this point, you should appreciate that sampling is a difficult multi-step process and that there
are lots of places you can go wrong. In fact, as we move from each step to the next in identifying
a sample, there is the possibility of introducing systematic error or bias. For instance, even if you
are able to identify perfectly the population of interest, you may not have access to all of them.
And even if you do, you may not have a complete and accurate enumeration or sampling frame
from which to select. And, even if you do, you may not draw the sample correctly or accurately.
And, even if you do, they may not all come and they may not all stay.

3.5 SAMPLING/ SAMPLING TECHNIQUES/PROCEDURES /METHODS


Sampling refers to the process of selecting the individuals or objects from which to gather the
information from the population. It is the process of selecting units (e.g., people, organizations)
from a population of interest so that by studying the sample we may fairly generalize our results
back to the population from which they were chosen.
SOME DEFINITIONS
Before I can explain the various probability methods we have to define some basic terms. These
are:
 N = the number of cases in the sampling frame
 n = the number of cases in the sample
 NCn = the number of combinations (subsets) of n from N
 f = n/N = the sampling fraction
That's it. With those terms defined we can begin to define the different probability sampling
methods.

SAMPLING FORMULAR
N
n =______________________
1 + Ne2

Where;
N- the population/Number of cases in the sampling frame
e- the margin of error
n- the sample size/number of cases in the sample

17
Jane Muchembi

SAMPLING DESIGNS
Are categorized as:-
1. PROBABILITY SAMPLING
Probability sampling techniques: apply random selection where each member of population is
given equal chances of being in the sample. They don’t give all the members of the population
equal chances of being in the sample.
A probability sampling method is any method of sampling that utilizes some form of random
selection. In order to have a random selection method, you must set up some process or procedure
that assures that the different units in your population have equal probabilities of being chosen.
Humans have long practiced various forms of random selection, such as picking a name out of a
hat, or choosing the short straw. These days, we tend to use computers as the mechanism for
generating random numbers as the basis for random selection.

2. NON-PROBABILITY SAMPLING
Non-probability sampling techniques: The researcher is interested in the representativeness of
the concepts in their varying forms maximizing the scope or range of variations of respondents in
the sample.

Probability sampling techniques Non-probability sampling techniques


i. Simple random sampling i. Quota sampling
ii. Systematic random sampling ii. Convenience sampling
iii. Stratified random sampling iii. Judgmental sampling
iv. Cluster random sampling iv. Purposive sampling
v. Multistage random sampling v. Snowballing
vi. Volunteer/ self-selection

i. Simple random sampling


It has the least bias and produces most generalized results because it gives every member of
population a chance of being in the sample.
Procedure: Use a table of random numbers, a computer random number generator, or a mechanical
device to select the sample. One may use the following procedure.

1. First, we have to get the sampling frame organized. The researcher obtains or makes a list of all
the members of the population and assigns to each of them a number or letter i.e. if the whole
population comprises of 1,000 people make a list of 1,000.
2. Decide on the number of people you would like to have in the final sample depending on the size
of the population and the research design. For example, if you want to select 100 clients to survey
and that there are 1000 members of the population. Then, the sampling fraction is f = n/N =
100/1000 =10 or 10%.
3. Write those numbers in small papers (tickets) similar to those used in a raffle i.e. 1-1,000.
4. Fold each of them and mix them nicely in a container.

18
Jane Muchembi

5. Pick the number of tickets equal to the sample size chosen e.g. if a sample of 10% is required
pick 100 tickets and individuals represented by the numbers in the 100 tickets picked will be the
ones to make up the sample.

But this mechanical procedure would be tedious and the quality of the sample would depend on
how thoroughly you mixed them up and how randomly you reached in.

With the development of inexpensive computers there is a much easier way. Here's a simple
procedure that's especially useful if you have the names of the clients already on the computer.
Many computer programs can generate a series of random numbers. Let's assume you can copy and
paste the list of client names into a column in an EXCEL spreadsheet. Then, in the column right
next to it paste the function =RAND() which is EXCEL's way of putting a random number between
0 and 1 in the cells. Then, sort both columns -- the list of names and the random number -- by the
random numbers. This rearranges the list in random order from the lowest to the highest random
number. Then, all you have to do is take the first hundred names in this sorted list. Pretty simple.
You could probably accomplish the whole thing in under a minute.
Simple random sampling is simple to accomplish and is easy to explain to others. Because simple
random sampling is a fair way to select a sample, it is reasonable to generalize the results from the
sample back to the population. Simple random sampling is not the most statistically efficient
method of sampling and you may, just because of the luck of the draw, not get good representation
of subgroups in a population. To deal with these issues, we have to turn to other sampling methods.
ii. Systematic random sampling
It involves drawing every kth elements in the population starting with a randomly selected element
between 1 and N.
Here are the steps you need to follow in order to achieve a systematic random sample:
 number the units in the population from 1 to N (N= number the units in the population)
 decide on the sample size that you want or need (n=sample size)
 determine the interval size (k ). The interval size =N/n
 randomly select an integer between 1 to N
 then take every kth unit

e.g. if 10% sample is required from a population of 1,000 people (i.e. 100 people)
1. The researcher obtains or makes a list of all the members of the population and assigns to each
of them a number or letter i.e. if the whole population comprises of 1,000 people make a list of
1,000.
Pick the first element (number) at random
2. Pick every 10th (1000/100) number from the randomly selected number until you get the 100
required e.g. if 1 (one)is the randomly selected number then the next number picked is, 10,
then 20, then 30, the 40 etc. until a sample of 100 is picked.
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22,23, 24, 25, 26,27,27,
29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53,
54, 55-----------1,000

19
Jane Muchembi

iii. Stratified random sampling


Stratified Random Sampling, also sometimes called proportional or quota random sampling,
involves dividing your population into homogeneous subgroups and then taking a simple random
sample in each subgroup. In more formal terms:
The researcher first divides the population into groups based on relevant characteristics and then
randomly selects participants from those groups e.g. If the population comprises of both male and
female, make a list of all the male and all the female. Then randomly select the number of males
required and the number of females required.
The steps:-
1. Determine the strata that the population will be divided into e.g. according to sex (male and
female), or according to the age, or geographical location such as rural and urban etc.
2. Determine the number of participants required from each stratum e.g. If a sample of 100 is
required from a population 1000 where there is 50% male (500) and 50% female (500), the
sample may also have 50% (50 ) male and 50% (50) female i.e. Proportionate random sampling.
3. Randomly select the participants from within the group/ strata.
NB: The sample can also be selected using a computer program.

Objective: Divide the population into non-overlapping groups (i.e., strata) N1, N2, N3, ... Ni, such
that N1 + N2 + N3 + ... + Ni = N. Then do a simple random sample of f = n/N in each strata.

There are several major reasons why you might prefer stratified sampling over simple random
sampling. First, it assures that you will be able to represent not only the overall population, but
also key subgroups of the population, especially small minority groups. If you want to be able to
talk about subgroups, this may be the only way to effectively assure you'll be able to. If the
subgroup is extremely small, you can use different sampling fractions (f) within the different
strata to randomly over-sample the small group (although you'll then have to weight the within-
group estimates using the sampling fraction whenever you want overall population estimates).

When we use the same sampling fraction within strata we are conducting proportionate stratified
random sampling. When we use different sampling fractions in the strata, we call this
disproportionate stratified random sampling. Second, stratified random sampling will generally
have more statistical precision than simple random sampling. This will only be true if the strata
or groups are homogeneous. If they are, we expect that the variability within-groups is lower
than the variability for the population as a whole. Stratified sampling capitalizes on that fact. For
example, let's say that the population of clients for our agency can be divided into three groups:
Caucasian, African-American and Hispanic-American. Furthermore, let's assume that both the
African-Americans and Hispanic-Americans are relatively small minorities of the clientele (10%
and 5% respectively).

If we just did a simple random sample of n=100 with a sampling fraction of 10%, we would
expect by chance alone that we would only get 10 and 5 persons from each of our two smaller
groups. And, by chance, we could get fewer than that! If we stratify, we can do better.

20
Jane Muchembi

First, let's determine how many people we want to have in each group. Let's say we still want to
take a sample of 100 from the population of 1000 clients over the past year. But we think that in
order to say anything about subgroups we will need at least 25 cases in each group.

So, let's sample 50 Caucasians, 25 African-Americans, and 25 Hispanic-Americans. We know


that 10% of the population, or 100 clients, are African-American.

If we randomly sample 25 of these, we have a within-stratum sampling fraction of 25/100 =


25%. Similarly, we know that 5% or 50 clients are Hispanic-American. So our within-stratum
sampling fraction will be 25/50 = 50%. Finally, by subtraction we know that there are 850
Caucasian clients.

Our within-stratum sampling fraction for them is 50/850 = about 5.88%. Because the groups are
more homogeneous within-group than across the population as a whole, we can expect greater
statistical precision (less variance). And, because we stratified, we know we will have enough
cases from each group to make meaningful subgroup inferences.

iv. Cluster/Area random sampling


The problem with random sampling methods when we have to sample a population that's
disbursed across a wide geographic region is that you will have to cover a lot of ground
geographically in order to get to each of the units you sampled. Imagine taking a simple random
sample of all the residents of New York State in order to conduct personal interviews. By the
luck of the draw you will wind up with respondents who come from all over the state. Your
interviewers are going to have a lot of traveling to do. It is for precisely this problem that cluster
or area random sampling was invented.

In cluster sampling, we follow these steps:


 divide population into clusters (usually along geographic boundaries)
 randomly sample clusters
 measure all units within sampled clusters
For instance, in the figure we see a map of the counties in New York State. Let's say that we
have to do a survey of town governments that will require us going to the towns personally. If we
do a simple random sample state-wide we'll have to cover the entire state geographically.
May be used when the list of the population is impossible or impractical to make e.g. in a study
on the factors that influence the choice clothing by Christians in the churches in Kenya, it may be
impossible to make a list of all of them.
1. The researcher can create a list of churches in Kenya
2. Randomly select the churches to be in the sample
3. Obtain the list of members in those churches
4. Randomly select the members to put in the sample from that list.

v. Multistage random sampling


The four methods we've covered so far -- simple, stratified, systematic and cluster -- are the
simplest random sampling strategies. In most real applied social research, we would use

21
Jane Muchembi

sampling methods that are considerably more complex than these simple variations. The most
important principle here is that we can combine the simple methods described earlier in a variety
of useful ways that help us address our sampling needs in the most efficient and effective manner
possible. When we combine sampling methods, we call this multi-stage sampling. The sample
is selected in multiple steps e.g. in a study on the factors that influence the choice of shoes by the
university students in Kenya, The researcher can follow the following steps:-
1. Cluster sampling of counties in which the universities are located as the first stage of the
process.
2. We could then have a two-stage sampling process with stratified samples within cluster
samples to select the counties.
3. Randomly select the universities in each sub-county in the sample
4. Randomly select the schools in each university sampled
5. Randomly select the faculties
6. Randomly select the departments
7. Randomly select the classes
8. Randomly select the individuals to be n the sample in each class

Non-probability sampling techniques


i. Quota sampling
ii. A quota sample is one in which the units are selected on the basis of pre-specified
characteristics so that the total sample has the same distribution of characteristics assumed to
exist in the whole population being studied e.g. if from a university the population of the male
is known to be 60% and female is 40% the researcher ensures that the 60% of the sample is
male and 40% is female. However, the individuals in the sample are not selected randomly.

iii. Convenience sampling


Selecting the sample from the most accessible population e.g. shoppers from the nearest
shopping mall if one is collecting data on attitudes of women towards a certain product like
ladies cossets.
Advantages Disadvantages
 Easy access  Can be biased
 Requires less effort  It may not be a true or accurate
 Save time to select representation of the population
 Saves money

iv. Judgmental sampling or Purposive sampling


The sample is selected based on the knowledge the population has on the research topic and the
purpose of the study e.g. a study on choice of residential space by university students in Kenya
only the students know or have information and so the researcher has to deliberately use them in
the sample.

v. Snowballing
Used when the population is difficult to locate or identify e.g. when studying a sensitive
population like prostitutes, gays or lesbians.

22
Jane Muchembi

i. The researcher identifies a few individuals, gathers information from them and requests
them to help identify others in the same category or group.
ii. The researcher then collects information from the ones referred by the previous group
and then asks them to help identify others and the process continues until the sample is
enough.

vi. Volunteer/ self-selection: The research collect information from anybody willing to participate.
This may introduce bias since volunteers are usually different from non-volunteers.

VALIDITY: The validity is the extent to which the data accurately measures what they were
intended to measure.

THE FOUR TYPES OF VALIDITY.


i. Internal Validity: It is the approximate truth about inferences regarding cause-effect or causal
relationships. Thus, internal validity is only relevant in studies that try to establish a causal
relationship. It's not relevant in most observational or descriptive studies, for instance. But for
studies that assess the effects of social programs or interventions, internal validity is perhaps the
primary consideration. In those contexts, you would like to be able to conclude that your program
or treatment made a difference -- it improved test scores or reduced symptomology. But there may
be lots of reasons, other than your program, why test scores may improve or symptoms may
reduce. The key question in internal validity is whether observed changes can be attributed to your
program or intervention (i.e., the cause) and not to other possible causes (sometimes described as
"alternative explanations" for the outcome)

ii. Construct Validity: Construct validity refers to the degree to which inferences can legitimately
be made from the operationalization in your study to the theoretical constructs on which those
operationalization were based.
Like external validity, construct validity is related to generalizing. But, where external validity
involves generalizing from your study context to other people, places or times, construct validity
involves generalizing from your program or measures to the concept of your program or measures.
You might think of construct validity as a "labeling" issue. When you implement a program that
you call a "Head Start" program, is your label an accurate one? When you measure what you term
"self esteem" is that what you were really measuring?

iii. Conclusion Validity: Conclusion validity is the degree to which conclusions we reach about
relationships in our data are reasonable. Conclusion validity is only concerned with whether there
is a relationship. For instance, in a program evaluation, we might conclude that there is a positive
relationship between our educational program and achievement test scores -- students in the
program get higher scores and students not in the program get lower ones. Conclusion validity is
essentially whether that relationship is a reasonable one or not, given the data. But it is possible
that we will conclude that, while there is a relationship between the program and outcome, the
program didn't cause the outcome. Perhaps some other factor, and not our program, was

23
Jane Muchembi

responsible for the outcome in this study. For instance, the observed differences in the outcome
could be due to the fact that the program group was smarter than the comparison group to begin
with. Our observed posttest differences between these groups could be due to this initial difference
and not be the result of our program. This issue -- the possibility that some other factor than our
program caused the outcome -- is what internal validity is all about. So, it is possible that in a
study we can conclude that our program and outcome are related (conclusion validity) and also
conclude that the outcome was caused by some factor other than the program (i.e., we don't have
internal validity).

iv. External Validity: External validity is related to generalizing. That's the major thing you need
to keep in mind. Recall that validity refers to the approximate truth of propositions, inferences, or
conclusions. So, external validity refers to the approximate truth of conclusions the involve
generalizations. Put in more pedestrian terms, external validity is the degree to which the
conclusions in your study would hold for other persons in other places and at other times.

THREATS TO EXTERNAL VALIDITY


A threat to external validity is an explanation of how you might be wrong in making a
generalization. For instance, you conclude that the results of your study (which was done in a
specific place, with certain types of people, and at a specific time) can be generalized to another
context (for instance, another place, with slightly different people, at a slightly later time). There
are three major threats to external validity because there are three ways you could be wrong --
people, places or times. Your critics could come along, for example, and argue that the results of
your study are due to the unusual type of people who were in the study. Or, they could argue that it
might only work because of the unusual place you did the study in (perhaps you did your
educational study in a college town with lots of high-achieving educationally-oriented kids). Or,
they might suggest that you did your study in a peculiar time. For instance, if you did your
smoking cessation study the week after the Surgeon General issues the well-publicized results of
the latest smoking and cancer studies, you might get different results than if you had done it the
week before.

IMPROVING EXTERNAL VALIDITY


How can we improve external validity? One way, based on the sampling model, suggests that you
do a good job of drawing a sample from a population. For instance, you should use random
selection, if possible, rather than a nonrandom procedure. And, once selected, you should try to
assure that the respondents participate in your study and that you keep your dropout rates low. A
second approach would be to use the theory of proximal similarity more effectively. How? Perhaps
you could do a better job of describing the ways your contexts and others differ, providing lots of
data about the degree of similarity between various groups of people, places, and even times. You
might even be able to map out the degree of proximal similarity among various contexts with a
methodology like concept mapping. Perhaps the best approach to criticisms of generalizations is
simply to show them that they're wrong -- do your study in a variety of places, with different
people and at different times. That is, your external validity (ability to generalize) will be stronger
the more you replicate your study.

24
Jane Muchembi

RELIABILITY: Reliability is the extent to which the data collection method will yield consistent
findings if replicated by others.
NB: Validity and reliability of the data you collect depend on the design of the questionnaire and
the words that that are used by the researcher.

3.6. DATA COLLECTION


It refers to the gathering of specific information to serve or prove some facts e.g. data may be
required to prove the need for changing the current price of given products.

THE PURPOSE OF DATA COLLECTION


They include to:-
i. Stimulate new ideas e.g. helping to identify areas related to the research that need improvement
or further evaluation.
ii. Highlight a situation and create awareness and recommend improvement e.g. on the effects of
increased imported second hand “Mutumba” clothes on the local clothing and textile industry and
the economy as a whole.
iii. Influence legislative policies e.g. Minimum wages, taxation, import, export, education etc.
iv. Provide justification for an existing program or need for a new program e.g. need for training
African craft producers on marketing.
v. Promote decision making e.g. decision to increase production level.

TYPES AND SOURCES OF DATA


1. PRIMARY DATA: Is gathered directly from primary sources or the respondents through
questionnaires, interviews, observation or Focus Group discussion. It involves creating new
data.

2. SECONDARY DATA: Is gathered from secondary sources such as books, journals, magazines
etc. it is often referred to as desk research.

DATA COLLECTION INSTRUMENTS AND METHODS OR TECHNIQUES


Data collection methods include Questionnaires, Interviews, Observations and Focus Group
Discussions.

1. QUESTIONNAIRE
A questionnaire is a data collection method that uses an instrument or tool called a
questionnaire.
 This instrument comprises of a list of written down questions based on the research problem,
objectives and questions which if answered will help the researcher to answer the research
questions and meet the research objectives.
 The researcher gives the questionnaire to the respondents to fill on their own either in print form
or online and then analysis the data to make conclusions.

25
Jane Muchembi

 To ensure effectiveness of the instrument a pre-test or pilot test should be carried out on a small
representative sample to enable the researcher find out if the questions are measuring what they
are supposed to measure and if the wording is clear i.e. All questionnaires should be piloted, if
possible, with a small group before the main research to assess their value, validity and
reliability.
The questions used may be:-
a) Closed questions: where a limited number of alternative responses to the set question are
provided. These can be in list, category, ranking, scale/rating, grid or other quantitative form.
They can be pre- coded on a questionnaire to facilitate analysis. e.g. Please tick the right answer
from the boxes shown below i.e. (specific choices of answers to choose from are provided) or

Advantages of closed questions Disadvantages of closed questions


I. Often easier for respondents (particularly i. They limit the choices (of answers) to
those who are busy) to answer respondents and gives them less control over
ii. Easier to collate than open questions their responses
iii. The questionnaire can be easily ii. It is harder to get below the
reproduced by other researchers who wanted surface of an issue
to test your findings

b) Open ended questions i.e. a question is posed, but space is left for the respondent’s own answer
i.e. no suggested answers.
Advantages of open ended questions Disadvantages of open ended questions
I. Enable you to get below the surface, I. The responses can be hard to collate
explore and probe ii. The research may be difficult for others to
ii. Encourages respondents to think and offer reproduce, so your findings may be open to
considered answers doubt or
iii. Encourages respondents to give honest
opinions

There are five types of questionnaire approaches:


1. On-line (electronic)
2. Postal (printed)
3. Delivery & collection (printed)
4. Telephone (electronic)
5. Interview face to face/group (electronic or printed)

SOME GENERAL RULES FOR DESIGNING QUESTIONNAIRES


(Source: Collis & Hussey, 2003)
1. Explain the purpose of the questionnaire to all participants
2. Keep your questions as simple as possible
3. Do not use jargon or specialist language (unless the recipients really prefer and understand it)
4. Phrase each question so that only one meaning is possible

26
Jane Muchembi

5. Avoid vague, descriptive words, such as ‘large’ and ‘small’


6. Avoid asking negative questions as these are easy to misinterpret
7. Only ask one question at a time
8. Include relevant questions only
9. Include, if possible, questions which serve as cross-checks on the answers to other questions
10. Avoid questions which require participants to perform calculations
11. Avoid leading or value-laden questions which imply what the required answer might be
12. Avoid offensive questions or insensitive questions which could cause embarrassment
13. Avoid asking ‘difficult’ questions, e.g. where the respondent may struggle to answer (people
hate to look stupid by not knowing the ‘answer’).
14. Keep your questionnaire as short as possible, but include all the questions you need to cover
your purposes

INCREASING RESPONSE RATES


 As a general rule, a response rate of 30 per cent or greater for a postal/externally sent
questionnaire is generally regarded as reasonable.
 However, a goal of 50 per cent or more responses should be attempted in any questionnaire that
involved face-to-face interviews.
 There are techniques that can help improve response rates to postal or electronic questionnaires:
i. Follow-up calls (especially telephone reminders and special delivery letters)
ii. Pre-contact with respondents(telling them about the questionnaire)
iii. Type of postage (special delivery is superior to ordinary mail; there is also some evidence that
hand-written white envelopes are more likely to be opened than brown/typed!)
iv. Rewards: prizes, or better still, cash incentives.
v. Personalizing the questionnaire: writing to the person by name, e.g. ‘Dear John’ etc.
vi. Emphasizing Confidentiality: ensuring that all views to be published remain anonymous, if
appropriate
vii. Appeals to the respondent: based on the social, personal or other benefits that might flow from
the participation of a respondent.
viii. Postal questionnaires should always include a stamped return envelope and have a covering
letter explaining the purpose of the questionnaire and the use intended for the findings in the
future.
ix. The researcher should include full contact details and the offer to discuss the questionnaire with
any respondent who has doubts or queries about it.
x. The researcher should always offer to share the research findings with any participant, if
requested, and this offer is best made in the covering letter.

QUALITIES OF A GOOD QUESTIONNAIRE


i. It should be easy to understand. The language used is clear and straight forward. This helps to
reduce misconception.
ii. Instructions are clearly given. There are few words of explanation in each new section.
iii. The questions are focused and limited to a single idea. Sentences are short and precise.

27
Jane Muchembi

iv. Each question included has a specific purpose, and contributes to the study.
v. There are no leading questions.
vi. There is a balance of questions per topic.
BASIC PRINCIPLES TO BE OBSERVED WHEN PREPARING A QUESTIONNAIRE
i.Begin by giving a short rationale explaining the objectives of the study.
ii.Explain the potential benefits to the target population in terms they can easily understand.
iii.Make it as brief and simple to fill as possible and avoid irrelevant questions.
iv. Begin with straightforward non-threatening questions which the respondents will find easy to
answer. Personal questions should be left until the end.
v. Make the questionnaire attractive and spacious as a crowded questionnaire will discourage
responses.

Advantages of using questionnaire Disadvantages of using questionnaire


i. Can be used to collect a lot of i. No direct contact with the respondents and therefore
information cheaply. difficult to clarify misconceptions and
ii. No bias on the side of the misunderstandings
researcher ii. Responses can be low because of lack of direct contact
iii. Data analysis is easy because of use iii. No opportunity for probing or asking for further
of closed-ended questions information
iv. Confidentiality can be achieved iv. If some questions are not answered there is no chance
of asking why

2. INTERVIEW
An interview is a method of collecting data in which questions are asked orally and answers
recorded by the researcher or research assistant. This gives the interviewer an opportunity to
probe or ask for further clarification to get deeper understanding of the topic. The interviewer
can also clarify questions that may be misunderstood or misinterpreted.

FORMS OF INTERVIEWS

a) UNSTRUCTURED INTERVIEW
It is a method of collecting data in which:-
 The researcher has some idea and may be a list of the topics to be covered but no specific
questions to be asked to the respondents.
 They are informal and conversational with aim of encouraging the respondents to feel free to
open up and give information freely.
 It is suitable when studying sensitive topics such as sexuality, some cultural issues etc.
 Since no possible answers are suggested respondents give a wide variety of answers and
therefore data analysis can be difficult.

Advantages of unstructured interview Disadvantages of unstructured interview


i. They are flexible because there are no i. They are time consuming because the

28
Jane Muchembi

predetermined questions respondents may dwell too much on one on one


ii. Respondents freely give information issue
iii.Information given is spontaneous and so ii. They are not systematic
reliable iii. The respondents may get become emotional ,
iv. A lot of information can be gathered personal and get out of control
v. The interviewer is able to respond to iv. Responses may be irrelevant
individual differences and situational v.It may be difficult to analyze data
characteristics.

b) SEMI STRUCTURED INTERVIEW


It is a method of collecting data in which:-
 The interviewer is guided by an interview guide or schedule. This a list of topics and questions to
be asked to the respondents.

Advantages of semi-structured interview Disadvantages of semi-structured interview


i. They are flexible because they consist of i. They are time consuming because the of open
both open and closed questions ended questions
ii. In-depth Information given ii. It may be difficult to analyze data
iii. A lot of information can be gathered iii. The respondents may be cautious of the
through closed-ended questions answers given in closed-ended questions

c) STRUCTURED INTERVIEW
It is a method of collecting data in which:-
 The interviewer is guided by an instrument called interview guide or schedule.
 All the respondents are subjected to the same questions.

Advantages of structured interview Disadvantages of structured interview


i. High reliability of information gathered i. The rigidity displayed by the researcher can
because all the subjects are subjected to the affect the responses given. The respondents
same questions. may feel as if he/she is under investigation and
ii. It gives in-depth information about particular being probed. This may affect the responses as
cases of interest to the researcher. This is some may become hostile.
because the researcher seeks information on ii. It is too formal. Since the researcher does the
specific issues. questioning and the respondents simply
iii. It is systematic. Researchers intensively answers, the respondent may be too cautious of
investigate a particular issue before moving to the answers given. The respondents may give
the next. answers he/she thinks are acceptable or will
iv. It is time saving since the respondents simply impress the researcher.
answer what has been asked by researcher. iii. The researcher may miss out on some
v. The researcher gets a complete and detailed important points that are not included in the
understanding of the issue from the questions formulated.
respondents.
vi. It is comprehensive and systematic since

29
Jane Muchembi

questions are formulated before the interview.


vii. The data collected is quantifiable.

STEPS IN CONDUCT OF THE INTERVIEW


The opening stages of an interview are particularly important.
Saunders, Lewis and Thornhill (2003) make the following suggestions on conducting a semi-
structured interview:
i. The interviewee is thanked for agreeing to the meeting
ii. The purpose of the research, its funding (if relevant) and progress to date are briefly outlined
iii. The interviewee is given an assurance regarding confidentiality
iv. The interviewee’s right not to answer questions is emphasized and the interview could be
terminated at any time by the interviewee
v. The interviewee is told about the use intended to be made of the data collected during and after
the project
vi. The offer of any written documentation to the interviewee promised in advance of the meeting
should be emphasized
vii. The interviewer describes the process of the interview, e.g. approximate number and range of
questions to be asked and the time it is likely to take.
viii. The researcher would want to intervene in the following situations:
a) If one group member was dominating the discussion
b) If the group strayed from discussing the topic in question
c) To encourage quieter members of the group to contribute to the discussion
d) To resolve any conflicts that arose between group members

PROBLEM AREAS ASSOCIATED WITH INTERVIEWS


There are a number of potential problem areas associated with interviews:
1. Bias
2. Demeanour (character, appearance, behaviour) of interviewer
3. Suspicion of the interviewer
4. Conduct of interview
5. Confidentiality
BIAS
 Bias is one of the most significant issues in interviewing, as it can affect the responses of the
interviewee to the interviewer, and vice versa.
 Interviews are human encounters and a range of issues can influence and colour the perceptions
of the people we encounter – including interviewers.
 We like or dislike someone, often without quite knowing why, and this can affect our responses
to them.
 A range of factors can come into play: gender, race, age, speech, appearance and attitude.
 Sexual bias is a particular significant factor. Rosenthal (1966) has suggested that there is the
possibility of sexual bias in interviewers and that both male and female researchers behave more
warmly towards female subjects than they do towards male subjects.

30
Jane Muchembi

DEMEANOUR OF INTERVIEWER
 Generally speaking, we often like people who appear to like us!
 The interviewer should then, appear to be interested in the interviewee, but in a neutral and
detached way: ‘neutrally interested’ is perhaps the best way of describing the best interviewer-
interviewee relationship.
 The tone of voice of the interviewer is important, as it should project an impression of quiet
confidence and quiet enthusiasm in the topic under discussion.
 Torrington (1991) suggests that an open posture is best, where the interviewer sits slightly
forward toward the interviewee, keeps regular eye contact, and avoids folded arms.
 The interviewer must avoid appearing shocked, disbelieving or astonished by comments made by
interviewees.
 The interviewer wear clothes similar to those of the interviewees: too scruffy, or too over-
dressed can affect the credibility of the interviewer.
SUSPICION OF THE INTERVIEWER
 People are increasingly suspicious of interviewers and their motives.
 Most people have experience of being stopped in the street by an interviewer who appears to be
asking questions in a neutral way but is really seeking to make a marketing contact for a
commercial organization.
 The true purpose of the interview should be carefully explained to the interviewee and how the
data collected will be used.
 Wherever possible, the student researcher should have a letter from a University tutor explaining
the research initiative, e.g. it is part of a legitimate first or post-graduate course.
 Wherever possible, the interviewer should send details of the interview process and agenda to
interviewees in advance, for example explaining the estimated length of time it will take, the
aim and purpose of the questions to be asked and the range of questions likely to be asked.

CONFIDENTIALITY
 Part of the suspicion shown to ward interviewers concerns the issue of confidentiality, and
interviewees may be worried about disclosing sensitive information in case it has negative
repercussions in some way against them.
 Interviewees may also be concerned that their personal details would be passed on to commercial
organisations and that they would be subsequently pestered to buy things.
 Interviewees must be given a complete reassurance about confidentiality and told who will see
the data obtained – and don’t forget this is likely to include at least two university tutors and
possibly one external examiner
 The interviewee should be assured that his or her views will be generalized in the final report or
that any direct quotes used would remain anonymous unless the interviewee wishes otherwise.
 The interviewer needs subsequently to take pains to avoid revealing the identity of respondents
by using false names and not giving any clues to the identity of any interviewee.

3. OBSERVATION

31
Jane Muchembi

It a method or technique of collecting data that uses tool or instrument called observation schedule
or guide. It provides information about the actual behavior of the respondents.

FORMS OF OBSERVATION
a) Participant observation: is a qualitative data collection method in which the researcher
becomes an active functioning member of the population being investigated e.g. a designer
wishing to study university students dressing behavior enrolls as a student. Its emphasis is on
discovering meanings that people attach to their actions (Saunders, Lewis and Adrian 2004 and
Kombo and Tromp 2006).

b) Structured observation: I quantitative and more concerned with frequency of those actions.
The observer/researcher is an onlooker (Saunders, Lewis and Adrian 2004).

c) Unstructured observation: The observer/researcher takes the position of an onlooker. Data


collected is in descriptive accounts (Kombo and Tromp 2006).

ADVANTAGES OF DISADVANTAGES OF PARTICIPANT OBSERVATION


PARTICIPANT
OBSERVATION
It allows the researcher to Access to an organization for overt participation may be very
observe at first hand and get difficult
below the surface of a particular
situation
It heightens the awareness of a There may be role-conflict. Researchers may lose their objectivity
researcher to social processes if they become too close emotionally to the people they are
that can influence behaviour observing.
It allows the researcher to The researcher may encounter suspicion and even hostility from
observe people, who suspect the motives for the research and may be
the relationship of different worried about their livelihoods.
variables
It can enable a researcher to gain The presence of a researcher (overt participation) can have an
insights into a situation that impact on the behaviour of the people being observed: the
otherwise would not have been ‘observer
recognized or observed effect’. They may change the way they behave if they are aware
they
are being observed.
It can allow, in the case of overt It is very time-consuming and can generate a large amount of
participation, the researcher to data/commentary that will need to be collated
gain the trust of the people being Recording and collating data from observations can be difficult
observed and the researcher may need to adopt structured observation,
where
tasks/responses etc. are identified and broken down into elements
and the frequency of actions or responses noted and later collated.
Taking notes in any covert situation presents real problems, as the
researcher may not be a position to openly record observations. In

32
Jane Muchembi

this situation, the researcher must record his or her observations


as
soon as possible afterwards.

4. FOCUS GROUP DISCUSSIONS


It a method or technique of collecting data in which data is collected through focus group
discussions.
According to Sekaran (2003) Focus Group Discussions:-
 Comprise of 8-10 members with homogeneous characteristics relevant to the study, a moderator
and recorder. The moderator leads the discussions on a specific and given topic for about two
hours and recorder records all the proceedings.
 Members are chosen on the basis of their expertise on the topic being discussed.
 The unstructured and spontaneous responses are expected to reflect genuine opinions ideas and
feelings of the members about the topic being discussed

STEPS TO FOLLOW WHEN COLLECTING DATA


i. Acquire an introduction letter from the body or institution commissioning him/her to go and do
research. This letter introduces the researcher to the people from whom assistance may be
needed.
ii. ii. Acquire a permit (permission to collect the data in the given area e.g. from the local
authorities to avoid conflicts with them.
iii. Arrange with the respondents when to meet (book appointment) i.e. do not ambush them as this
could make them fail to cooperate.
iv. Administer the instruments e.g. questionnaires
v. Follow up non-respondents to increase responses e.g. by visiting them or making phone calls.

ETHICAL ISSUES IN DATA COLLECTION


They include:-
i. The researcher must justify the research through balance of costs and benefits. Benefits should
outweigh the costs.
ii. The researcher must maintain confidentiality at all times. Only those who must know the
respondents should know them. Unless it is inevitable to avoid knowing the identity of the
respondents the research should avoid knowing them.
iii. The researchers should be responsible of their own work and their contribution to the whole
study. Accept individual responsibility and bear the consequences of the research.
iv. The researcher must obtain informed consent from the subjects used in the study and must
ensure that all the subjects participate voluntarily.
v. The researcher must be open and honest in dealing with other researchers and research
subjects i.e. must not exploit them by changing the agreements made with them.
vi. The researcher must take all reasonable measures to protect the respondents physically and
psychologically.
vii. The researcher must fully explain the research in advance and debrief the subjects afterwards.

Saunders, Lewis and Thornhill (2003, p. 131) summarise the main issues

33
Jane Muchembi

to consider, although the ethical issues surrounding these items are not
always clear-cut:
i. The rights of privacy of individuals
ii. Voluntary nature of participation – and the rights of individuals to withdraw partially or
completely from the process
iii. Consent and possible deception of participants
iv. Maintenance of the confidentiality of data provided by individuals or identifiable
participants and their anonymity
v. Reactions of participants to the ways in which researchers seek to collect data
vi. Effects on participants of the way in which data is analyzed and reported
vii. Behaviour and objectivity of the researcher

3.7 DATA ANALYSIS AND PRESENTATION


Raw data means nothing to the researcher without it being analyzed and interpreted using proper
tools. Data analysis refer to examining the data that has been collected and making deductions or
inferences or judgments or conclusions. It involves uncovering underlying structures, extracting
important variables, detecting any anomalies and testing any underlying assumptions. It involves
scrutinizing the acquired information and making inferences.

Data analysis involves examining data in ways that reveal the relationships, patterns, trends, etc. that
can be found within it. That may mean subjecting it to statistical operations that can tell you not only
what kinds of relationships seem to exist among variables, but also to what level you can trust the
answers you’re getting.  It may mean comparing your information to that from other groups (a
control or comparison group, statewide figures, etc.), to help draw some conclusions from the
data. The point, in terms of your evaluation, is to get an accurate assessment in order to better
understand your work and its effects on those you’re concerned with, or in order to better understand
the overall situation.

There are two kinds of data the researcher may be working with, although not all evaluations will
necessarily include both:-
i. Quantitative data
This refers to the information that is collected as, or can be translated into, numbers, which can then
be displayed and analyzed mathematically. Some examples include:
 The frequency (rate, duration) of specific behaviors or conditions
 Test scores (e.g., scores/levels of knowledge, skill, etc.)
 Survey results (e.g., reported behavior, or outcomes to environmental conditions; ratings
of satisfaction, stress, etc.)
 Numbers or percentages of people with certain characteristics in a population (e.g. have
design skills, have carpeted offices, wear high heeled shoes, buy second hand clothes etc.)

Data can also be collected in forms other than numbers, and turned into quantitative data for
analysis. Researchers can count the number of times an event is documented in interviews or
records, for instance, or assign numbers to the levels of intensity of an observed event or

34
Jane Muchembi

behavior. For instance, community initiatives often want to document the amount and intensity of
environmental changes they bring about – the new programs and policies that result from their
efforts. Whether or not this kind of translation is necessary or useful depends on the nature of what
you’re observing and on the kinds of questions your evaluation is meant to answer.

Quantitative data is usually subjected to statistical procedures such as calculating the mean or
average number of times an event or behavior occurs (per day, month, year). These operations,
because numbers are “hard” data and not interpretation, can give definitive, or nearly definitive,
answers to different questions. Various kinds of quantitative analysis can indicate changes in a
dependent variable related to – frequency, duration, timing (when particular things happen),
intensity, level, etc. They can allow you to compare those changes to one another, to changes in
another variable, or to changes in another population. They might be able to tell you, at a particular
degree of reliability, whether those changes are likely to have been caused by your intervention or
program, or by another factor, known or unknown. And they can identify relationships among
different variables, which may or may not mean that one causes another.

ii. Qualitative data


These are data collected as descriptions, anecdotes, opinions, quotes, interpretations, etc., and are
generally either not able to be reduced to numbers, or are considered more valuable or informative
if left as narratives. As you might expect, quantitative and qualitative information needs to be
analyzed differently.
Unlike numbers or “hard data,” qualitative information tends to be “soft,” meaning it can’t always
be reduced to something definite. That is in some ways a weakness, but it’s also a strength. A
number may tell you how well a student did on a test; the look on her face after seeing her grade,
however, may tell you even more about the effect of that result on her. That look can’t be translated
to a number, nor can a teacher’s knowledge of that student’s history, progress, and experience, all
of which go into the teacher’s interpretation of that look. And that interpretation may be far more
valuable in helping that student succeed than knowing her grade or numerical score on the test.

Qualitative data can sometimes be changed into numbers, usually by counting the number of times
specific things occur in the course of observations or interviews, or by assigning numbers or ratings
to dimensions (e.g., importance, satisfaction, ease of use). The challenges of translating qualitative
into quantitative data have to do with the human factor.  Even if most people agree on what 1
(lowest) or 5 (highest) means in regard to rating “satisfaction” with a program, ratings of 2, 3, and 4
may be very different for different people.  Furthermore, the numbers say nothing about why people
reported the way they did. One may dislike the program because of the content, the facilitator, the
time of day, etc. The same may be true when you’re counting instances of the mention of an event,
such as the onset of a new policy or program in a community based on interviews or archival
records.  Where one person might see a change in program he considers important another may
omit it due to perceived unimportance.

Qualitative data can sometimes tell you things that quantitative data can’t.  It may reveal why
certain methods are working or not working, whether part of what you’re doing conflicts with
participants’ culture, what participants see as important, etc.  It may also show you patterns – in
behavior, physical or social environment, or other factors – that the numbers in your quantitative
data don’t, and occasionally even identify variables that researchers weren’t aware of.

35
Jane Muchembi

NB: It is often helpful to collect both quantitative and qualitative information.

Quantitative analysis is considered to be objective – without any human bias attached to it –


because it depends on the comparison of numbers according to mathematical computations. 
Analysis of qualitative data is generally accomplished by methods more subjective – dependent
on people’s opinions, knowledge, assumptions, and inferences (and therefore biases) – than that of
quantitative data.  The identification of patterns, the interpretation of people’s statements or other
communication, the spotting of trends – all of these can be influenced by the way the researcher
sees the world.  Be aware, however, that quantitative analysis is influenced by a number of
subjective factors as well.  What the researcher chooses to measure, the accuracy of the
observations, and the way the research is structured to ask only particular questions can all
influence the results, as can the researcher’s understanding and interpretation of the subsequent
analyses.

STEPS OF DATA ANALYSIS


These days there are several user-friendly and interactive software programs that are used to
analyze data such as Excel, SPSS, SAS, STATPAK, SYSTAT etc. They have an ability to interface
with different databases and generate graphs and charts.

In most social research the data analysis involves three major steps, done in roughly this order:

i. Cleaning and organizing the data for analysis (Data Preparation)


ii. Describing the data (Descriptive Statistics)
iii. Testing Hypotheses and Models (Inferential Statistics)

i. Cleaning and organizing the data for analysis/Preparing Data


 Data Preparation involves checking or logging the data in; checking the data for accuracy;
entering the data into the computer; transforming the data; and developing and documenting a
database structure that integrates the various measures.
 Once all of the participants have completed the study measures and all of the data has been
collected, the researcher must prepare the data to be analyzed. 
 It involves:-
i. Editing data: This involve checking data for accuracy and consistency.
As soon as data is received you should screen it for accuracy. In some circumstances doing this
right away will allow you to go back to the sample to clarify any problems or errors. There are
several questions you should ask as part of this initial data screening:
 Are the responses legible/readable?
 Are all important questions answered? Sometimes a respondent may fail to answer a
question like do you have any children? But answers the following related question “If
yes how many? By writing five children. The researcher has to get back to the respondent
to find out why the first question was left out or assume that the answer is yes.
 Are the responses complete?
 Is all relevant contextual information included (e.g., data, time, place, researcher)?

36
Jane Muchembi

In most social research, quality of measurement is a major issue. Assuring that the data
collection process does not contribute inaccuracies will help assure the overall quality of
subsequent analyses.

ii. Handling blank responses: If many questions e.g. 25% of the questions are not answered you
may decide to discard the questionnaire and not include it in data set for analysis and report or
account for them in your report. If there are only a few blank responses then decide how to
handle them.

iii. Coding data: involves allocating codes to possible answers e.g. in a question where possible
answers are yes or no, you may assign number one (1) to “yes” and (2) to “No”. Where there are
five possible answers you can have codes 1-5.

iv. Categorizing data: This involve grouping questions addressing the same variable together.

v. Creating data file/ entering the Data into the Computer


Key in or enter the data in the computer using a program of your choice e.g. spreadsheet, SPSS
There are a wide variety of ways to enter the data into the computer for analysis. Probably the
easiest is to just type the data in directly. In order to assure a high level of data accuracy, the
analyst should use a procedure called double entry. In this procedure you enter the data once.
Then, you use a special program that allows you to enter the data a second time and checks each
second entry against the first. If there is a discrepancy, the program notifies the user and allows
the user to determine the correct entry. This double entry procedure significantly reduces entry
errors. However, these double entry programs are not widely available and require some training.
An alternative is to enter the data once and set up a procedure for checking the data for accuracy.
For instance, you might spot check records on a random basis. Once the data have been entered,
you will use various programs to summarize the data that allow you to check that all the data are
within acceptable limits and boundaries. For instance, such summaries will enable you to easily
spot whether there are persons whose age is 601 or who have a 7 entered where you expect a 1-
to-5 response

vi. Programming :

NB:
 Organizing the data correctly can save a lot of time and prevent mistakes. 
 Most researchers choose to use a database or statistical analysis program (Microsoft Excel,
SPSS) that they can format to fit their needs in order to organize their data effectively. 
 A good researcher enters all of the data in the same format and in the same database, as doing
otherwise might lead to confusion and difficulty with the statistical analysis later on. 
 Once the data has been entered, it is crucial that the researcher check the data for accuracy. 
 This can be accomplished by spot-checking a random assortment of participant data groups, but
this method is not as effective as re-entering the data a second time and searching for
discrepancies. 

37
Jane Muchembi

 This method is particularly easy to do when using numerical data because the researcher can
simply use the database program to sum the columns of the spreadsheet and then look for
differences in the totals. 
 Perhaps the best method of accuracy checking is to use a specialized computer program that
cross-checks double-entered data for discrepancies (as this method is free from error), though these
programs can be hard to come by and may require extra training to use correctly.

Developing a Database Structure


The database structure is the manner in which you intend to store the data for the study so that it
can be accessed in subsequent data analyses. You might use the same structure you used for
logging in the data or, in large complex studies, you might have one structure for logging data
and another for storing it. As mentioned above, there are generally two options for storing data
on computer:-
i. Database programs
ii. Statistical programs.
Usually database programs are the more complex of the two to learn and operate, but they allow
the analyst greater flexibility in manipulating the data.

In every research project, you should generate a printed codebook that describes the data and
indicates where and how it can be accessed. Minimally the codebook should include the
following items for each variable:
 variable name
 variable description
 variable format (number, data, text)
 instrument/method of collection
 date collected
 respondent or group
 variable location (in database)
 notes

The codebook is an indispensable tool for the analysis team. Together with the database, it
should provide comprehensive documentation that enables other researchers who might
subsequently want to analyze the data to do so without any additional information.

Data Transformations
Once the data have been entered it is almost always necessary to transform the raw data into
variables that are usable in the analyses. There are a wide variety of transformations that you
might perform. Some of the more common are:
 missing values
Many analysis programs automatically treat blank values as missing. In others, you need to
designate specific values to represent missing values. For instance, you might use a value of -99
to indicate that the item is missing. You need to check the specific program you are using to
determine how to handle missing values.
 item reversals

38
Jane Muchembi

On scales and surveys, we sometimes use reversal items to help reduce the possibility of a
response set. When you analyze the data, you want all scores for scale items to be in the same
direction where high scores mean the same thing and low scores mean the same thing. In these
cases, you have to reverse the ratings for some of the scale items. For instance, let's say you had
a five point response scale for a self-esteem measure where 1 meant strongly disagree and 5
meant strongly agree. One item is "I generally feel good about myself." If the respondent
strongly agrees with this item they will put a 5 and this value would be indicative of higher self
esteem. Alternatively, consider an item like "Sometimes I feel like I'm not worth much as a
person." Here, if a respondent strongly agrees by rating this a 5 it would indicate low self esteem.
To compare these two items, we would reverse the scores of one of them (probably we'd reverse
the latter item so that high values will always indicate higher self esteem). We want a
transformation where if the original value was 1 it's changed to 5, 2 is changed to 4, 3 remains
the same, 4 is changed to 2 and 5 is changed to 1. While you could program these changes as
separate statements in most program, it's easier to do this with a simple formula like:

New Value = (High Value + 1) - Original Value

In our example, the High Value for the scale is 5, so to get the new (transformed) scale value, we
simply subtract each Original Value from 6 (i.e., 5 + 1).
 scale totals
Once you've transformed any individual scale items you will often want to add or average across
individual items to get a total score for the scale.
 categories
For many variables you will want to collapse them into categories. For instance, you may want to
collapse income estimates (in dollar amounts) into income ranges.

ii. Describing the data (Descriptive Statistics) : Descriptive Statistics are used to describe the
basic features of the data in a study. They provide simple summaries about the sample and the
measures. Together with simple graphics analysis, they form the basis of virtually every quantitative
analysis of data. With descriptive statistics you are simply describing what is, what the data shows.

iii. Testing Hypotheses and Models (Inferential Statistics): Inferential Statistics investigate
questions, models and hypotheses. In many cases, the conclusions from inferential statistics extend
beyond the immediate data alone. For instance, we use inferential statistics to try to infer from the
sample data what the population thinks. Or, we use inferential statistics to make judgments of the
probability that an observed difference between groups is a dependable one or one that might have
happened by chance in this study. Thus, we use inferential statistics to make inferences from our
data to more general conditions; we use descriptive statistics simply to describe what's going on in
our data.

In most research studies, the analysis section follows these three phases of analysis. Descriptions of
how the data were prepared tend to be brief and to focus on only the more unique aspects to your
study, such as specific data transformations that are performed. The descriptive statistics that you
actually look at can be voluminous. In most write-ups, these are carefully selected and organized
into summary tables and graphs that only show the most relevant or important information. Usually,
the researcher links each of the inferential analyses to specific research questions or hypotheses that

39
Jane Muchembi

were raised in the introduction, or notes any models that were tested that emerged as part of the
analysis. In most analysis write-ups it's especially critical to not "miss the forest for the trees." If
you present too much detail, the reader may not be able to follow the central line of the results.
Often extensive analysis details are appropriately relegated to appendices, reserving only the most
critical analysis summaries for the body of the report itself.

TYPES OF STATISTICS USED IN DATA ANALYSIS


1. Descriptive Statistics
 They are used to describe the basic features of the data in a study. They provide simple
summaries about the sample and the measures. Together with simple graphics analysis, they form
the basis of virtually every quantitative analysis of data. With descriptive statistics you are simply
describing what is, what the data shows.
 Descriptive statistics describe the data but do not draw conclusions about the data. 
 Descriptive statistics are normally applied to a single variable at a time. 
 They can tell the researcher the central tendency of the variable, meaning the average score of a
participant on a given study measure. 
 The researcher can also determine the distribution of scores on a given study measure, or the
range in which scores appear. 
 Finally, descriptive statistics can be used to tell the researcher the frequency with which certain
responses or scores arise on a given study measure.  For example, in our imaginary study about
the effectiveness of corrective lenses on economic productivity, the researcher might observe that
the average dollars-per-week of a person with corrected vision is $500, whereas the average DPW
for a person without corrected vision is $450.  A good researcher will know that this is not enough
information to conclude that vision correction has an effect on economic productivity.
 Inferential statistics are necessary to draw conclusions of this kind. 

Examples of descriptive statistics


i. Frequency distribution is a simple data analysis technique which allows you to get a big
picture of the data. From frequency distribution, you can see how frequently the specific values
are observed and what their percentages are for the same variable. For example: for variable of
“age,” you can use frequency distribution to figure out how many people in the survey are aged
18 to 25, and how many are aged 26 to 33, etc… Histogram is a great tool in Excel to recognize
frequency distribution in data like this. From the frequency distribution we can figure out the
frequency of the values observed, as shown in the “age example” above.

ii. Measures of central tendency


We can use the measures of central tendency and dispersion to learn more about the data for
“age.” E.g. Mean, median and mode. Mean, median and mode are the three measures of central
tendency.
a) “Mean” is the average value. It equals to the sum of all the values divided by the numbers of
observations. It is the most popular measure of central tendency, especially when the data set
does not have an outlier.
b) “Median” is the value in the middle when all the values are lined in order (assuming there is
an odd number of values). If there are even numbers of values, the median is the average of the

40
Jane Muchembi

two numbers in the middle. It is useful when the data set has an outlier and values distribute very
unevenly.
c) “Mode” is the value which is observed most often. It is useful when the data is non-numeric
or when asked to find the most popular item.

iii. Measures of dispersion


Range and standard deviation are the basics measures of dispersion. The bigger the range and
bigger the standard deviation, the more dispersed the values are.

v. “Range” is the difference of the maximum value and the minimum value for the variable. For
the “age example”, the maximum value is 54 and the minimum value is 19. So the range is 35.

vi. “Standard deviation” shows how much variation the value exits from the mean. Variance is
the average of squared difference from the mean. Standard deviation is the square root of
variance.

2. Inferential Statistics
 Investigate questions, models and hypotheses.
 In many cases, the conclusions from inferential statistics extend beyond the immediate data
alone. For instance, we use inferential statistics to try to infer from the sample data what the
population thinks.
 Or, we use inferential statistics to make judgments of the probability that an observed difference
between groups is a dependable one or one that might have happened by chance in this study.
 Thus, we use inferential statistics to make inferences from our data to more general conditions;
we use descriptive statistics simply to describe what's going on in our data.
 Inferential statistics allow the researcher to begin making inferences about the hypothesis on the
basis of the data collected.  This means that, while applying inferential statistics to data, the
researcher is coming to conclusions about the population at large.  Inferential statistics seek to
generalize beyond the data in the study to find patterns that ostensibly exist in the target
population.  This course will not address the specific types of inferential statistics available to the
researcher, but a succinct and very useful summary of them, complete with step-by-step examples
and helpful descriptions, is available

Examples of Inferential statistics

i. T-Tests
T-Tests are used to test if the difference of means is statistically significant. It tests if the sample
is representative of the populations. For example, if the mean for variable 1 is 20 and the mean
for variable 2 is 28, you may say the means are different. T-Tests may show you that they are not
significantly different, however, and you can’t base your conclusion on the means’ difference
since the difference in the sample is not representative for the population.

41
Jane Muchembi

This analysis is appropriate whenever you want to compare the means of two groups, and
especially appropriate as the analysis for the posttest-only two-group randomized experimental
design.

ii. Correlation
 The correlation is one of the most common and most useful statistics. A correlation is a single
number that describes the degree of relationship between two variables. Let's work through an
example to show you how this statistic is computed.
 Correlations are used when you want to know about the relationship between two variables. For
example, you want to know consumers’ willingness to pay and their ratings for the product quality.
If the correlation is 1, meaning the willingness to pay and the ratings for the product quality are
completely positively correlated and if the correlation is 0, meaning there is no correlation between
these two variables. If the correlation is -1, it shows they are completely negatively correlated,
meaning the higher one variable, the lower the other variable. If the absolute value of the variables
is bigger than 0.5, they are usually significant.
 Correlation is one of the most often used (and most often misused) kinds of descriptive statistics. 
 It is perhaps best described as a single number that describes the degree of relationship between
two variables.
  If two variables tend to be “correlated,” that means that a participant’s score on one tends to
vary with a score on the other.  For example, people’s height and shoe size tend to be positively
correlated.  This means that for the most part, if a given man is tall, he is likely to have a large
shoe size.  If short, he is likely to have a smaller shoe size. 
 Correlation can also be negative. For example, the temperature outside in Fahrenheit may be
negatively correlated with the number of hot chocolates sold at a local coffee shop.  This is to
say that as the temperature goes down, hot chocolate sales tend to go up. 
 Although causality may seem to be implied in this situation, it is important to note that on a
statistical level, correlation does not imply causation. 
 A good researcher knows that there is no way to assess from correlation alone that a causal
relationship exists between two variables.
 In order to assert that “X caused Y”, a study should be experimental, with control groups and
random sampling procedures. 
 Determining causation is a difficult thing to do, and it is a common mistake to assert a cause-
and-effect relationship when the study methodology does not support this assertion.

iii. Linear regression


Regression is a more accurate way to test the relationship between the variables compared with
correlations since it shows the goodness of fit (Adjusted R Square) and the statistical testing for
the variables. The formulas for one-variable regressions is y = ax + b and for multiple
regressions is y = ax12 + bx2 + c.

For y = ax + b, y is the dependent variable, x is the causal variable and the intercept is a,
indicating the correlation between x and y. If “a” is 0.2 for example, it means when x variable
increases 1 unit, y increases 0.2 units. If “a” is negative, meaning y decreases as x increases.

For y = ax12 + bx2 + c, y is the dependent variable, x1 is causal variable 1 and x2 is causal variable
2. “a” is the intercept for variable 1 and “b” for variable 2. For example, if y = 0.6 x12 – 0.4 x2 +

42
Jane Muchembi

0.23, it means when x1 increases 1 unit, y increases 0.6 units and when x2 increases 1 unit, y
decreases 0.4 units. (Given the variables are statistically significant.)

Regression is a more accurate way to test the relationship between the variables compared with
correlations since it shows the goodness of fit (Adjusted R Square) and the statistical testing for
the variables. The formulas for one-variable regressions is y = ax + b and for multiple
regressions is y = ax12 + bx2 + c.

For y = ax + b, y is the dependent variable, x is the causal variable and the intercept is a,
indicating the correlation between x and y. If “a” is 0.2 for example, it means when x variable
increases 1 unit, y increases 0.2 units. If “a” is negative, meaning y decreases as x increases.

For y = ax12 + bx2 + c, y is the dependent variable, x1 is causal variable 1 and x2 is causal variable
2. “a” is the intercept for variable 1 and “b” for variable 2. For example, if y = 0.6 x12 – 0.4 x2 +
0.23, it means when x1 increases 1 unit, y increases 0.6 units and when x2 increases 1 unit, y
decreases 0.4 units. (Given the variables are statistically significant.)

Statistical Significance
 Researchers cannot simply conclude that there is a difference between two groups in a well-
constructed study. This difference must be due to the manipulation of the independent variable. 
 No matter how well a researcher designs the study, there always exists a degree of error in the
results. 
 This error can be due to individual differences both within and between experimental groups, or
the error can be due to systematic differences within the researcher’s sample. 
 Irrespective of its source, this error acts as a kind of “noise” in the data. It affects participants’
scores on study measures even though it is not the variable of interest. 
 Statistical significance is aimed at determining the probability that the observed result of a study
was due to the influence of the independent variable rather than by chance. 
 A result is “statistically significant” at a certain level.  For example, a result might be significant
at p<.05.  “P” represents the probability that the result was due to chance, and .05 represents a 5%
probability that the result was due to chance. 
 Therefore, p<.05 means that inferential statistical analysis has indicated that the observed results
have over a 95% probability of being due to the influence of the independent variable. 
 The 5% cutoff is generally thought of as the standard for most scientific research.   Note that it is
theoretically impossible to ever be entirely certain that one’s results are not due to chance, as the
nature of science is one of falsification, not immutable proof.

iv. Analysis of Variance (ANOVA): It helps to examine the significant mean differences among
more than two groups on an interval or ratio scaled dependent variable.

Data analysis procedures (DAP) / Techniques/Methods

43
Jane Muchembi

Data analysis procedures (DAP) help the researcher to arrive at the data analysis. The uses of such
procedures put the research project in perspective and assist the researcher in testing the
hypotheses with which he/she has started the research. Hence with the use of DAP, one can:-

i. Convert data into information and knowledge, and


ii. Explore the relationship between variables.

NB: Understanding of the data analysis procedures/techniques helps the researcher to:-

 appreciate the meaning of the scientific method, hypotheses testing and statistical significance in
relation to research questions
 realize the importance of good research design when investigating research questions
 have knowledge of a range of inferential statistics and their applicability and limitations in the
context of your research
 be able to devise, implement and report accurately a small quantitative research project
 be capable of identifying the data analysis procedures relevant to your research project
 show an understanding of the strengths and limitations of the selected quantitative and/or
qualitative research project
 demonstrate the ability to use word processing, project planning and statistical computer packages
in the context of a quantitative research project and report
 be adept of working effectively alone or with others to solve a research question/ problem
quantitatively.

STATISTICAL DATA ANALYSIS


Statistical data analysis divides data analysis methods into two categories:-
i. Exploratory methods: used to discover what data seems to be saying by using simple arithmetic
and easy-to-draw pictures to summarize data. This is mainly used in qualitative research.
ii. Confirmatory methods: Use ideas from probability theory in an attempt to answer specific
questions. These methods are mainly applicable in quantitative research.
NB: The methods used to analyze data are influenced by whether the research is qualitative or
quantitative.

DATA ANALYSIS IN QUALITATIVE RESEARCH


 Qualitative research involves intensive data collection (of several variables) over an extended
period of time in a natural setting i.e. variables are studied when and where they occur e.g.
investigating use of traditional jewelry by a traditional rural community in a remote area.
 The responses can be categorized into various classes which are called categorical variables.
 The analysis varies of qualitative data varies from simple descriptive analysis to more elaborate
deduction and multivariate associate techniques.
 The analysis varies with the purpose of research, complexity of the research design and extent to
which conclusions can be reached easily.
 In qualitative research designs the researcher should decide before going to the field, how data
will be analyzed.

44
Jane Muchembi

 The analytical techniques will determine the recording style that will be used during the data
collection.

ANALYTICAL TECHNIQUES USED IN QUALITATIVE RESEARCH


1. Quick Impressionistic Summary: This is a rapid data analysis technique mainly used in
situations that require urgent information to make decisions for a program e.g. in situations that
need urgent or immediate action. It can also be used when results already generalized are obvious
making further analysis unwarranted e.g. where 80% of the respondents give similar responses. It
does not require data transcription. The researcher records key issues of the discussion with the
respondents and the makes a narrative report.
It involves:-
i. Summarizing key findings e.g. in Focus Group discussions the researcher notes down the
frequency of responses of participants on various issues.
ii. Explanation
iii. Interpretation and conclusions

2. Thematic Analysis: Analyzing data using themes. Themes refer to topics or major subjects that
come up in discussions. In this method the researcher categorizes related topics and identifies
major concepts or themes and then does the following:-
i. Peruses the data collected and identifies the information that is relevant to the research questions
and objectives.
ii. Develops a coding system based on the samples of the collected data
iii. Classifies majors issues or topics covered
iv. Rereads the text and highlights key quotations/insights and interpretations.
v. Indicates major themes in the margins
vi. Places coded materials under major themes or topics identified
vii. Develops a summary report identifying major themes and associations between them
viii. Uses graphics and direct quotations to present the findings.
ix. Reports the intensity, which refers to the number of times certain words or phrases or
descriptions are used in the discussion. NB: the frequency with which an idea, or a word, or
description appears is used to interpret its importance, attention or emphasis.

DATA ANALYSIS IN QUANTITATIVE RESEARCH


Quantitative data analysis consists of measuring numerical values from which descriptions such as
mean, mode, and standard deviation are made. This data can be put in order and then grouped into
two:-
i. Discrete data: are countable data e.g. the number of defective items produced during a day’s
production.
ii. Continuous data: are parameters (variables) that are measured and expressed on a continuous
scale e.g. the height of a person.

NB: The analysis of quantitative data varies from simple to more elaborate analysis techniques. The
analysis varies with the objectives of the research, its complexity and extent to which conclusions
can be easily reached. Data analysis in quantitative research depends on the type of study for
example in:-
i. Correlation studies

45
Jane Muchembi

 Data is mainly analyzed using correlation coefficient.


 By using this tool the researcher indicates the degree of relationship between to variables. i.e.
The researcher is looking for degree of relationship not the effect of one variable on another.
 The correlation coefficient is a number ranging from 1(a perfect positive correlation) through
0(zero) (no relationship between variables) to -1(a perfect relationship between variables).
 In this method the researcher attempts to indicate the proportion of sameness between variables.
They include:-
a) Pearson product moment correlation: This is one of the tools used correlation coefficient. It
is used to analyze the relationship between isolated independent and dependent variables.
b) Reliability studies: Are analysis conducted to provide information about validity and
reliability of tests. In these studies the same group is given a test and the somewhat at a later date
is given the same test again. Then the researcher analyses the two scores for each subject (the test
scores and retests scores) and a correlation coefficient between the two sets of scores is
calculated. This type of correlation coefficient is referred to as reliability coefficient.
c) Prediction studies: the researcher uses the degree of relationship that exists between two
variables to predict one variable from another. E.g. if reading and spelling are correlated then the
researcher can use the information to predict a student’s score on spelling test if the student has
only taken a reading test and vice versa.

ii. Causal comparative studies


 Seek to identify the relationship between an independent and dependent variable.
 However the relationship is more suggestive than proven because the researcher has no complete
control over independent variable.
 On carrying out causal comparative research the researcher compares two selected groups on
dependent variable e.g. in a fashion production firm the researcher allows some designers to use
Computer Aided Design (CAD) and others hand design to find out the effects of CAD on
quality of designs produced.
 The inferential statistics used is t-test.
 Independent t-Test is used to test the significance of a difference between the means of
experimental and control group in the study.
 In research designs where the influence of an extraneous variable has been controlled or in
designs utilizing pre-test – post- test procedure the appropriate t-test to use to compare the two
groups would be the dependent t-test.
 When the researcher has three or more groups to compare, the appropriate inferential statistic to
use would be one way analysis of variance. This statistic shows the significance of differences
in the means of three or more groups.
 In cases where the researcher uses frequency counts for dependent variable the appropriate
inferential statistic to use would be chi-Square test. This statistic tests the significance of
differences between two more groups (independent variable) in frequencies for dependent
variable.

iii. Experimental research


 The major difference in data analysis between causal comparative and experimental research is
that the researcher has control over the independent variable in experimental research and can
manipulate this variable at will.

46
Jane Muchembi

 In the case of causal-comparative research the independent variable is established by the identity
of the groups chosen and not under experimental control.
 In experimental designs the observer should decide before carrying out the experiment the
analytical process.
 The analytical process in experimental studies mainly involves the calculation of effect size.
 Effect size is the mean of experimental group minus the control group.
The idea is to calculate the effect size across a number of studies to determine the relevance of
the test, treatment, or method.

CHAPTER FOUR: FINDINGS/RESULTS

INTERPRETING THE RESULTS


Once you’ve organized your results and run them through whatever statistical or other analysis
you’ve planned for, it’s time to figure out what they mean for your research.  In research terms,
that often translates to “What were the effects of the independent variable on the dependent
variable(s) (the behavior, conditions, or other factors it was meant to change)?”  

There are a number of possible answers to this question:


 Independent variables had exactly the effects on the dependent variable(s) you expected
and hoped it would. Statistics or other analysis showed clear positive effects at a high
level of significance
 Your program had no effect.  The independent variables produced no significant results
on the dependent variable.
 Independent variable had a negative effect. 
 Independent variable had the effects you hoped for and other effects as well.

DATA PRESENTATION
Data may be presented using frequency tables, graphs or charts to make it easier for the readers
to understand what has been found out by the research.
All the tables, graphs should be clearly labeled to make the readers understand them at a glance.
Data can be presented under in line with the objectives or research questions.

Frequency Distribution
Frequency distribution is a simple data analysis technique which allows you to get a big picture
of the data. From frequency distribution, you can see how frequently the specific values are
observed and what their percentages are for the same variable. For example: for variable of
“age,” you can use frequency distribution to figure out how many people in the survey are aged
18 to 25, and how many are aged 26 to 33, etc.

CHAPTER FIVE: SUMMARY, DISCUSSION, CONCLUSIONS AND


RECOMMENDATIONS
5.1 Summary
Should start by considering the findings especially with the results of statistical tests. Be as
precise as possible in terms of what the findings show and do not show. Comment on the results
that were not consistent with the rest of the participants’ data.

47
Jane Muchembi

5.2 Discussions
Explain how the results of your research relate with the results of other studies in the literature.
Explain whether they support or refute or contradict the previous theories.

Identify any weaknesses or limitations in your study and indicate how they could be eliminated
in future studies e.g. an ethical issue that became apparent after you had already stated the
research, or a weakness in the research design etc.

5.3 Conclusions
Make inferences on based on the findings.

5.4 Recommendations
Make recommendations or suggestions on what can be done to make the situation you
investigated better. Also make recommendations or suggestions on areas or topics that need
further research.

Ethical Issues
There are a number of key phrases that describe the system of ethical protections that the
contemporary social and medical research establishment have created to try to protect better the
rights of their research participants.

 The principle of voluntary participation requires that people not be coerced into participating
in research. This is especially relevant where researchers had previously relied on 'captive
audiences' for their subjects -- prisons, universities, and places like that.

 Closely related to the notion of voluntary participation is the requirement of informed


consent. Essentially, this means that prospective research participants must be fully informed
about the procedures and risks involved in research and must give their consent to participate.

 Ethical standards also require that researchers not put participants in a situation where they
might be at risk of harm as a result of their participation. Harm can be defined as both
physical and psychological.

 There are two standards that are applied in order to help protect the privacy of research
participants. Almost all research guarantees the participants confidentiality -- they are assured
that identifying information will not be made available to anyone who is not directly involved
in the study.

 The stricter standard is the principle of anonymity which essentially means that the participant
will remain anonymous throughout the study -- even to the researchers themselves. Clearly,
the anonymity standard is a stronger guarantee of privacy, but it is sometimes difficult to
accomplish, especially in situations where participants have to be measured at multiple time
points (e.g., a pre-post study).

48
Jane Muchembi

 Increasingly, researchers have had to deal with the ethical issue of a person's right to service.
Good research practice often requires the use of a no-treatment control group -- a group of
participants who do not get the treatment or program that is being studied. But when that
treatment or program may have beneficial effects, persons assigned to the no-treatment
control may feel their rights to equal access to services are being curtailed.

TIME/WORK PLAN: outlines the main activities of the research and when each will be
carried out. It should be on its own page.
EXAMPLE OF TIME PLAN
Activity YEAR 2021 YEAR 2022
March April may June July Aug Sep Oct Nov Dec Jan Feb March
Identify topic and write
title
Background to the study
Problem statement
Research objectives and
questions
Complete chapter one
Literature review
Research methodology
Compiling and handing
in research proposal
Corrections on the
proposal
Handing in corrected
proposal
Data collection and
analysis
Research findings,
conclusions and
recommendations

49
Jane Muchembi

BUDGET
Shows the approximated cost of the research project from the time one starts writing the proposal
to the time the project is completed. It should be as detailed and realistic as possible i.e. Show all
the items you will require to complete the project, quantities, cost per item and total cost. You
can also include 10% to cater for inflation. It should be on its own page.

Example of a budget
Item No. of units price@ item Kshs) Total cost (Kshs)
Printing papers 2 realms 400 800
Printing the proposal 30 pages 10 300
Photocopying proposals 30 pages 2 60
Internet services 1000
Transport 2 trips 500 1000
Total 3160

APPENDICES
Includes all the materials that do not fit well within the project or proposal e.g. sample
Instruments.

REFERENCES
It refers to materials that have been cited in the proposal or project. Lists all the Publications
cited in the research proposal or project. It should be arranged in alphabetical order. There are
several referencing styles but for social sciences APA style is recommended (Refer to the most
current edition of APA Manual). It should start on a fresh page.

REFERENCES
Babbie, E. (2001). The practice of social Research: 9th Ed. CA: Wadworth Thomson.

Neville C. (2007). An introduction to research and research methods. Bradford University


school of Management.

50
Jane Muchembi

Saunders, M., Lewis, P. & Thornhill, A. (2007). Research Methods for Business Students (4th
edition) Harlow: Prentice Hall.

Trochim, W. M. K.  “Data Preparation” Research Methods Knowledge Base 2nd Edition.
Accessed 2/24/09.

Trochim, W. M. K.  “Descriptive Statistics” Research Methods Knowledge Base 2nd Edition.
Accessed 2/24/09.

Trochim, W. M. K.  “Descriptive Statistics” Research Methods Knowledge Base 2nd Edition.
Accessed 2/24/09.

Trochim, W. M. K.  “Inferential Statistics” Research Methods Knowledge Base 2nd Edition.
Accessed 2/24/09.

Multivariate Statistics: concepts, models, and applications by David W. Stockburger WWW


Version 1.0 First Published July 1997 Revised March, 1998

PSY2005 Statistics and Research Methods: Quantitative data analysis component


http://www.soc.surrey.ac.uk/sru/SRU22.html by Janet Heaton

Pitfalls of Data Analysis (or How to Avoid Lies and Damned Lies) by Clay Helberg, M.S.,
Research Design and Statistics Unit

51

You might also like