Ch2 Statistics
Ch2 Statistics
Having collected and edited the data, the next important step
3/17/2023
is to organize it.
That is to present it in a readily comprehensible condensed
form that aids in order to draw inferences from it.
It is also necessary that the like be separated from the unlike
ones.
The presentation of data is broadly classified in to the
following two categories:
Tabular presentation
1
CON’T…
The process of arranging data in to classes or categories
according to similarities technically is called classification.
3/17/2023
Classification is a preliminary and it prepares the ground for
proper presentation of data.
Definitions:
Raw data: recorded information in its original collected form,
whether it is counts or measurements, is referred to as raw data.
Frequency: is the number of values in a specific class of the
distribution.
Frequency distribution: is the organization of raw data in table2
form using classes and frequencies.
THREE BASIC TYPES OF FREQUENCY DISTRIBUTIONS
3/17/2023
•
3
Categorical frequency Distribution
Used for data that can be place in specific categories such
as nominal, or ordinal. e.g. marital status, blood type,….
3/17/2023
Example: a health worker collected the following data on
blood type for 25 persons.(A, B, O, AB)
A B AB O AB
B B A A A
O AB B A A
O AB AB B B
B O O AB AB
4
CON’T
Solution:
3/17/2023
There are four types of blood type A, B, O, and AB.
Step 3: Count the tally and place the result in column (3).
3/17/2023
𝑓
% = 100 Where f = frequency of the class, n= total number
𝑛
of value or observations.
3/17/2023
A //// / 6 24
B //// // 7 28
AB //// // 7 28
O //// 5 24
7
Ungrouped frequency Distribution
It is a table of all the potential raw score values that could
possible occur in the data along with the number of times each
3/17/2023
actually occurred.
It is often constructed for small set or data on discrete variable.
Steps in constructing ungrouped frequency distribution:
First find the smallest and largest raw score in the collected
data.
Arrange the data in order of magnitude and count the
frequency.
To facilitate counting one may include a column of tallies.
8
EXAMPLE:
THE FOLLOWING DATA REPRESENT THE MARK OF 20 STUDENTS.
80 76 90 85 80
70 60 62 70 85
3/17/2023
65 60 63 74 75
76 70 70 80 85
Construct a frequency distribution, which is ungrouped.
Solution:
3/17/2023
63 / 1
65 / 1
70 //// 4
74 / 1
75 / 1
76 // 2
80 /// 3
85 /// 3
90 / 1
10
Grouped frequency Distribution
When the range of the data is large, the data must be grouped
in to classes that are more than one unit in width.
3/17/2023
Definitions:
Grouped Frequency Distribution: a frequency distribution when
several numbers are grouped in one class.
Class limits: Separates one class in a grouped frequency
distribution from another.
The limits could actually appear in the data and have gaps
between the upper limits of one class and lower limit of the next.
Units of measurement (U): the distance between two possible
11
consecutive measures. It is usually taken as 1, 0.1, 0.01, 0.001,
CON’T
Class boundaries: Separates one class in a grouped frequency
distribution from another.
3/17/2023
The boundaries have one more decimal places than the row data
and therefore do not appear in the data. There is no gap
between the upper boundary of one class and lower boundary of the
next class
Class width: the difference between the upper and lower class
boundaries of any class.
It is also the difference between the lower limits of any two
consecutive classes or the difference between any two consecutive
class marks.
12
Class mark (Mid points): it is the average of the lower and upper
class limits or the average of upper and lower class boundary.
CON’T
Cumulative frequency: is the number of observations less
than/more than or equal to a specific value.
3/17/2023
Cumulative frequency above: it is the total frequency of all
values greater than or equal to the lower class boundary of a
given class.
cumulative frequencies.
STEPS FOR CONSTRUCTING GROUPED FREQUENCY DISTRIBUTION
1. Find the largest and smallest values
3/17/2023
2. Compute the Range(R) = Maximum - Minimum
3. Select the number of classes desired, usually between 5 and 20
or use Sturges rule K=1+3.32log n where k is number of classes
desired and n is total number of observation.
4. Find the class width by dividing the range by the number of
𝑅
classes and rounding up, not off. W =
𝐾
The starting point is called the lower limit of the first class.
CON’T
To find the upper limit of the first class, subtract U from the lower
limit of the second class.
3/17/2023
Then continue to add the class width to this upper limit to find the
rest of the upper limits.
Find the boundaries by subtracting U/2 units from the lower limits
and adding U/2 units from the upper limits.
The boundaries are also half-way between the upper limit of one
class and the lower limit of the next class.
3/17/2023
Depending on what you're trying to accomplish,
frequencies.
3/17/2023
18 17 22 38 23 21 26 34 39 27
Solutions:
Step 1: Find the highest and the lowest value H=39, L=6
3/17/2023
Step 6: Find the upper class limit; e.g. the first upper class=12-
U=12-1=11
11, 17, 23, 29, 35, 41 are the upper class limits.
3/17/2023
29.5 – 35.5
35.5 – 41.5
CON’T
3/17/2023
Step 9: Write the numeric values for the tallies in the frequency
column.
20
Class Class Class Tally Fre Cf Cf rf. rcf
limit boundary Mark q. (less (more (less
than than than
type) type) type
6 – 11 5.5 – 11.5 8.5 // 2 2 20 0.10 0.10
21
DIAGRAMMATIC AND GRAPHIC PRESENTATION OF DATA.
3/17/2023
visual displays using geometric and pictures.
Importance:
22
CON’T…
Diagrams are appropriate for presenting discrete data.
3/17/2023
The three most commonly used diagrammatic presentation
for discrete as well as qualitative data are:
Pie charts
Bar charts
23
CON’T
Pie chart
A pie chart is a circle that is divided in to sections or wedges
3/17/2023
according to the percentage of frequencies in each category of
the distribution.
The angle of the sector is obtained using:
𝑓
𝑃𝑒𝑟𝑐𝑒𝑛𝑡𝑎𝑔𝑒 = × 100
𝑛
𝑓
𝑛𝑔𝑙𝑒 𝑜𝑓 𝑆𝑒𝑐𝑡𝑜𝑟 = × 360
𝑛
3/17/2023
17
Solutions: 5 2 1
25
CON’T
3/17/2023
fingers 17 68 244.8
eyes 5 20 72
arm 2 8 28.8
leg 1 4 14.4
Total 25 100 360
26
CON’T
Pie Chart
3/17/2023
leg
4% eyes
20%
fingers arm
68% 8%
27
BAR GRAPH
Bars can be drawn either vertically or horizontally.
3/17/2023
There are different types of bar charts.
28
SIMPLE BAR CHART
Are used to display data on one categorical variable.
They are thick lines (narrow rectangles) having the same range.
3/17/2023
The magnitude of a quantity is represented by the height /length
of the bar.
3/17/2023
30
COMPONENT BAR CHART
Desire to show how a total (or aggregate) is divided in to its
3/17/2023
component parts
3/17/2023
100
80
Sales in $
Product C
60
Product B
40
Product A
20
0
1957 1958 1959
Year of production
3/17/2023
continuous data.
Represent the class boundaries for the histogram or ogive or the mid
points for the frequency polygon on the X axes.
34
Plot the points and draw the bars or lines to connect the points.
HISTOGRAM
A graph which displays the data by using vertical bars of
3/17/2023
various height to represent frequencies.
35
CONT…
Class limit
15-19
Class boundaries
14.5-19.5
Mid point
17
Frequency
20-24 19.5-24.5 22 8
25-29 24.5-29.5 27 6
30-34 29.5-34.5 32 12
3/17/2023
35-39 34.5-39.5 37 7
40-44 39.5-44.5 42 6
45-49 44.5-49.5 47 4
50-54 49.5-54.5 52 3
55-59 54.5-59.5 57 1
60-64 59.5-64.5 62 1
36
FREQUENCY POLYGON:
A line graph.
3/17/2023
The frequency is placed along the vertical axis and classes mid
points are placed along the horizontal axis.
CON’T
15-19 17 2
20-24 22 8
25-29 27 6
3/17/2023
30-34 32 12
35-39 37 7
40-44 42 6
45-49 47 4
50-54 52 3
55-59 57 1
60-64 62 1
38
CON’T…
Ogive (cumulative frequency polygon)
3/17/2023
- A graph showing the cumulative frequency (less than or more
than type) plotted against upper or lower class boundaries
respectively.
23-26
CONT… 3 ≤ 26.5 ≤ 26 = 3 ≥ 22.5 ≥ 23 = 20
3/17/2023
35-38 5 ≤ 38.5 ≤ 38 = 15 ≥ 34.5 ≥ 35 = 10
40
Thank you!!!
3/17/2023
Have you
Question
41