Chapter 2
Chapter 2
INTRODUCTION TO STATISTICS
• SUBTITLE: SUMMARISING DATA
• LECTURER: DR ADEBAYO
• LEC 03
Unit 3: summarizing data
• OUTLINE
Learning objectives
INTRODUCTION
Frequency distributions
GRAPHICAL PRESENTATION OF DATA
Summarizing data from CONTINUOUS
variables
summary
Learning objectives
After studying this lesson you will be able to:
o Routine surveillance
o Experimental study
o conducting a study and so on
QUALITATIVE/
Categorical
Discrete Continuous
Nominal Ordinal
Interval Ratio
Frequency distributions
Frequency table:
o Thus,
- A frequency distribution is the organization of a
data set into nearby, mutually exclusive
intervals so that the number or proportion of
observations falling in each interval is
apparent/clear.
Frequency distributions cont...
Frequency table:
Frequency table:
Female 35 58
Male 25 42
Total (n) 60 100
1, 1, 2, 3, 2, 1, 2, 2, 2, 1, 3, 1, 3, 1, 1, 2, 1, 1, 2, 2, 2, 2,
4, 2, 3, 2, 2, 1, 1, 2, 1, 1, 2, 2, 2, 1, 2, 1, 1 and 4
Where
1= Married, 2= Single, 3 = Divorced and 4=Widowed
Construct the frequency distribution for this data and calculate the
percentage of patients who were Single.
Frequency distributions cont...
Frequency table:
ANS: 45% of the patients
o Solution were single.
-
Frequency distributions cont...
- Table 3: Line Listing of HIV/AIDS Cases in Hospital X
ID Date of Age Sex HIV/ AIDS Hospitalized ARV
Diagnosis (Years) Drugs
01 05/ 01 74 M N Y N
02 06/ 01 29 M Y N Y
03 08/ 01 39 M Y Y N
04 19/ 01 23 F N N N
05 30/ 01 39 M Y N Y
06 02/ 02 23 M Y Y Y
07 03/ 02 19 M Y Y Y
08 05/ 02 40 M Y N Y
09 19/ 02 28 M N Y N
10 22/ 02 29 F Y N N
11 23/ 02 23 F Y Y N
12 24/ 02 40 M Y N Y
13 26/ 02 49 F N N N
14 26/ 02 40 F N N N
15 27/ 02 29 F Y Y N
16 27/ 02 18 M N Y N
17 27/ 02 19 M Y N Y
18 28/ 02 29 F Y Y Y
19 28/ 02 40 F Y Y Y
20 29/ 02 40 M Y N N
- First, list all the values that the variable age can take, from the
lowest possible value to the highest.
Step 1: Figure out how many classes (called class Intervals) you need.
You can use Sturge’s Rule. Let K = number of classes then,
N.B: The number found here should be rounded to the next integer
(whole number).
Frequency distributions cont...
Data from Continuous Variables
o Main steps:
Step 2: Find the minimum and the maximum values. Then find the
range by subtracting the minimum value from the maximum value.
Range = Maximum value – Minimum value
o Main steps:
Step 4: Begin with the minimum value in the data then add the class
width from step 3 to get the next lower class limit.
Step 6: Write down the upper class limits by subtracting 1 from the
class width then add that value to the lower class limits
Frequency distributions cont...
Continuous Variables
Step 9: Determine the class marks. The number in the middle of the
class is called class mark of the class. The number in the middle of
the upper class limit of one class and the lower class limit of the other
class is called the class boundary.
Frequency distributions cont...
Frequency Table
o Example of a Frequency Distribution of a Continuous Variable
118, 123, 124, 125, 127, 128, 129, 130, 130, 133,
136, 138, 141, 142, 149, 150, 154, 119, 122, 132,
145, 120, 151, 135, 152, 131, 121, 154, 151, 136,
144, 139, 123, 137, 147, 149, 119, 131, 142
Frequency distributions cont...
Frequency Table
o Example of a Frequency Distribution of a Continuous Variable
28, 28, 31, 29, 35, 33, 28, 31, 34, 29, 25, 27, 29, 33, 30, 31, 32, 26, 26, 21,
21, 20, 22, 24, 28, 30, 34, 33, 35, 29, 23, 21, 20, 19, 19, 18, 19, 17, 20,
19,18, 18, 19, 27, 17, 18, 20, 21, 18, and 19
Frequency distributions cont...
Solution
Step 1: k = 1+3.322log(n), n = 50
= 1+ 3.322log(50)
= 1+5.64398
= 6.64398
=7
= 2.57143
=3
Frequency distributions cont...
Solution