Lec 3 (Data Organization)
Lec 3 (Data Organization)
presentation
use the different methods of data organization and
presentation
Descriptive statistics
02/09/2025 7
Cont..
1. For categorical variables
A. Using table of frequency distribution
1. Frequency counts 2. Relative frequency
3. Cumulative frequency 4. Relative
cumulative frequency
B. Using pictorial forms
1.Bar charts(graph) 2.Pie charts
02/09/2025 8
Cont..
2. For Quantitative variable
A. Using table of grouped frequency
distributions
1.Frequency counts 2. Relative frequency
3.Cumulative frequencies 4.Relative cumulative
frequency
B. Using pictorial forms
1.Histogram 2.Frequency polygon 3.Line
graph…
02/09/2025 9
Frequency table & Frequency Distributions
Frequency:
• The number of same values within a data set.
02/09/2025 10
Cont..
Example 1: The blood type of 30 patients were
given as follows;
– A AB B B A O O AB AB B O A A B B
A AB A O AB B AB AB O A AB AB O A
O
02/09/2025 11
Cont..
02/09/2025 12
Cont..
Relative Frequency
A relative frequency distribution: shows the
02/09/2025 13
Cont..
From the previous example
02/09/2025 14
Cont..
Cumulative frequency
It is the number of observations in the category plus
02/09/2025 16
Cont..
02/09/2025 18
Cont..
02/09/2025 19
Cont..
• To divide the data into groups or intervals or classes.
we need to determine:
02/09/2025 20
Cont..
To determine the number of class intervals and the
corresponding width, we may use:
Sturge’s rule:
where
K = number of class intervals n = no. of
observations
W = width of the class interval L = the largest value
S = the smallest value
02/09/2025 21
Cont..
Example: Leisure time (hours) per week for 40 college
students:
23 24 18 14 20 36 24 26 23 21 16 15 19 20 22 14 13
10 19 27 29 22 38 28 34 32 23 19 21 31 16 28 19 18
12 27 15 21 25 16
K = 1 + 3.322 (logn)
K = 1 + 3.322 (log40) = 6.32 ≈ 6
Maximum value = 38, Minimum value = 10
W=L-S/K
W = (38-10)/6 = 4.66 ≈ 5
02/09/2025 22
Cont..
02/09/2025 23
Cont..
• Classes should be mutually exclusive.
• Make sure that the smallest and largest values fall within
the classification.
• None of the values can fall into possible gaps between
successive classes, and that the classes do not overlap,
namely, that successive classes have no values in
common.
• I.e. Class intervals should be continuous, non
overlapping, mutually exclusive and exhaustive
02/09/2025 24
Cont..
02/09/2025 25
Cont..
Class boundary (True limits): Are those
limits that make an interval of a continuous
variable continuous in both directions
Upper class boundary
27
Exercise
• These data represent the record high temperatures in degrees
Fahrenheit (F) for each of the 50 states. Construct a grouped
frequency distribution
112 100 127 120 134 118 105 110 109 112
110 118 117 116 118 122 114 114 105 109
107 112 114 115 118 117 118 122 106 110
116 108 110 121 113 120 119 111 104 111
120 113 120 117 105 110 118 112 114 114
02/09/2025 28
Cont…
Step 1 Determine the classes.
• Find the highest value and lowest value:
R= 134-100= 34
• Select the number of classes desired; Using this
formula K = 1 + 3.322×log(n), where n is the
number of observations (n=50)
K= 1 + 3.322×log(50) ≈ 7 classes
02/09/2025 29
Cont…
= 34/7= 4.9
02/09/2025 30
Con…
Select a starting point for the lowest class limit.
• Add the width to the lowest score taken as the
starting point to get the lower limit of the next
class & keep adding
100
105
110
115
120
125
135
02/09/2025 31
• Subtract one unit from the lower limit of the second
class to get the upper limit of the first class.
• Then add the width to each upper limit to get all the
upper limits.
• 105 – 1 = 104
then
109
114
119
124
129
134
139
Step 2: determine the midpoint for each interval.
Step 3: Find the numerical frequencies from the
distribution.
02/09/2025 32
Cont..
Class limit Class boundary Mid point Frequency
(true limit)
100-104 99.5-104.5 102 2
105-109 104.5-109.5 107 8
110-114 109.5-114.5 112 18
115-119 114.5-119.5 117 13
120-124 119.5-124.5 122 7
125-129 124.5-129.5 127 1
130-134 129.5-134.5 132 1
02/09/2025 33
Guidelines for constructing tables
or less)
All tables should be self-explanatory(Include clear title
Show totals
• Histogram
• Frequency polygon
• Box plot
Quantitative
• Ogive curve data
• Scatter plot
• Line graph
• Others
36
1. Bar charts (Graphs)
(X-axis)
Frequencies or relative frequencies are
02/09/2025 38
Cont..
02/09/2025 39
Multiple bar chart
In this type of chart the component
figures are shown as separate bars
adjoining each other.
02/09/2025 41
Sub-divided (component) bar chart
02/09/2025 43
Cont..
Method of constructing bar chart
02/09/2025 44
2. Pie chart
frequency.
Used for a single categorical variable
02/09/2025 45
Steps to construct pie chart
02/09/2025 46
Example: Distribution of deaths for females, in England
and Wales, 1989.
Cause of death No. of death
Circulatory system 100 000
Neoplasm 70 000
Respiratory system 30 000
Injury and poisoning 6 000
Digestive system 10 000
Others 20 000
Total 236 000
47
Distribution fo cause of death for females, in England and Wales, 1989
Others
8%
Digestive System
4%
Injury and Poisoning
3%
Circulatory system
Respiratory system
42%
13%
Neoplasmas
30%
48
Histogram
Histograms are frequency distributions with continuous
02/09/2025 49
Cont..
Constructed by choosing a set of non-overlapping class
02/09/2025 51
Cont..
Two problems with histograms
1. They are somewhat difficult to construct
2. The actual values within the respective groups are lost and
difficult to reconstruct. (we “lose” the information about
individual data values when we group the data).
02/09/2025 53
Frequency polygon for the ages of 2087 mothers with <5
children, Adami Tulu, 2003
700
600
500
400
300
200
N1AGEMOTH
54
It can be also drawn without erecting rectangles by
joining the top midpoints of the intervals representing
the frequency of the classes as follows:
02/09/2025 55
Scatter plot
Most studies in medicine involve measuring more
02/09/2025 56
Cont..
02/09/2025 57
Line graph
• The line graph is especially useful for the study of
some variables according to the passage of time.
02/09/2025 58
Cont..
02/09/2025 59
Cont..
02/09/2025 60
Reading assignment
02/09/2025 61
Thank
you
02/09/2025 62