Topic 5_ Data Analysis (1)
Topic 5_ Data Analysis (1)
Data Management
Data
• What do we mean by data?
Making Decisions on
Demonstrate trends in how to Improve
key indicators Interventions Appropriate policy formulation
• Data cleaning is the first step in the overall data preparation Good
process
analysis rests
• It involves analyzing, identifying and correcting messy raw
data on clean data
Coding and Code book
• A code: Short word, phrase (qualitative research) or a number
(quantitative research)describing the meaning and context of the
whole sentence or paragraph
• Why coding?
Reduces large quantities of
Data are collected in Working with original data
information to a form that is
different formats may be cumbersome easily handled
The 1,2,3,4,5
1 = Christian are Codes for
2 = Muslim each response
Variable name: category
Religion 3 = Pagan
4 = Non- religious
5 = Other
Examples of codes in qualitative research
He’s always been there for me, even when my parents were
not. He’s one of the few things that I hold as a constant in STABILITY
my life. So it’s nice
Suppose we
wanna move
column G
between E and F
Column G will now Right click at the
have a flashing green top of column G
box around it
Excel is waiting for
you to move it A list of options appears;
somewhere new
Select “Cut”
Right Click at the top of column
F and select “Insert Cut Cells”
Highlight
the cells 3
A drop-
you want to down
4 menu
Sort by
will
Click on Sort
Largest to
appear
Smallest
A Sort warning will
appear
Click on
Sort &
Filter
Click
Highlight ANY cell within
the row that you want to
on
add filters to (Usually the “Filter”
column headings)
A filter button will appear in each of
the cells in that row
Let us say we want to filter the data we
have to only look at data for the African
ethnic background only, and hide the
other ethnics
Click on a filter button at the bottom right
corner of the cell E1 where variable ethnic
is found
A drop
down menu
will appear
Tick only the box (es) for the data that you want
to remain visible (Africa for our case), then click
OK
After Filtering
To develop a frequency distribution for these data, we count the number of times each soft drink
appears! So that we have a drink and number of times it appears (frequency): See Table 1
Table 1
Drink Type frequency
Coke 13
Pepsi 8
Mirinda 5
Fanta 11
7up 6
Sprite 7
50
50 is the total frequency– the total number of the drinks purchased
Intuition from Table 1
Intuition: The top three soft drinks purchased were Coke (26%), Fanta
(22%) and Pepsi (22%)
Using Excel for Relative frequency
The formula:
E8
= E2/$E$8
Bar Graphs and Pie Charts
Step 2:
On the tool bar, Click
“Insert”, and select
Step 1:
Select the data set
you want to draw Step 3:
a bar chart Obtain the graph
The Graph
Pie Chart
Intuition:
21 persons have a poverty rate between 10 – 14 percent; Only 2 persons have a poverty rate
between 30 – 34 percentage!!! If the standard poverty rate is known, the feeling becomes more
realistic
Percentage distribution
Sample message:
The fourth class contains 5 out of a total of 51 observations (n= 51). The percent in this class is (5
divide by 51) times 100 gives 9.8% Implies that 1 out of every 10 persons had a poverty rate
between 25 and 29 percent
Analyzing frequency distributions
Step 1
Highlight cells
Type a formula for E2:E6
FREQUENCY in
the Cell E2: =
FREQUENCY(A2:A
21,D2:D6) without Press
enter CTRL+SHIFT+ENTER Step 3
to have the frequencies
Step 2
After ctrl+shift+enter
The required
frequencies
Histogram
Select cells 3
E2:E6 Go to Chart
Wizard and
Format the chart Choose Clustered
obtained Column
Formatting Chart
Good 42 40 2 0 84
Excellent 2 14 28 22 66