Intro Stat Session 4 Final 22-11-23

Download as pdf or txt
Download as pdf or txt
You are on page 1of 28

Intro-to-Stat.

Session 4
Categorical Data Tabular Presentation

Categorical raw data can also be summarized


using table
Example: Consider data on the marital status of a
sample of 100 employees selected from all
colleges AAU. S:=single; M:= married, D:= divorsed

SMMMSDMMDMSMMMSDMMDMSMMMSDMMDMSMMSMMM
SMMMMMSMMMSSMMSMMSSMMDMSMMMSSMMMMSMMM
SSMMMMSMMSMMMSSMMMMSMMMSSMMMMMSDMMDM
Example Cont’d
Variable: marital status
Categories: three – single, married, divorced
Frequency distribution of marital status:
Marital Status Frequency

Single 9

Married 29

Divorced 62

Total 100
Example on terms used in frequency distribution
Content of Statistical Table
Basic content of statistical table:
1. Table Number and Title of the Table
2. Captions: column headings
3. Stubs: row names/headings
4. Body of the table
5. Head note
6. Footnotes and references
7. Source of data
Table Number and Title
Example Captions /
Column Heading

Table 1: Value of Coffee Export (2012 – 2015 E.C.)


Sub-title
(in Million USD )
S/N Description Quarter Quarter Quarter Quarter Total
I II III IV

1 2012 230 245 165 300


940
2 2013 315 235 277 350
Rows 1177
3 2014 120 450 200 300
1070
4 2015 320 250 400 100
1070
Total
985 1180 1042 1050

Source: National Bank Bulletin, 20--- E. C.


Note: Coffee grades: 1 to 3
Graphical Presentation of Data

Why graphical presentation?


Raw Data are usually not easy to use for
decision making.
Therefore, there is a need for organizing data.
Data can be organized in the form of
a) Table, e.g. frequency distribution table
b) Graph
The type of graph to use depends on the
variable/data type being summarized
Graphical
Presentation of Data
Types of Variables and Graphical/tabular Presentation

Categorical Numerical
Variables Variables

• Frequency distribution • Line chart


• Bar chart • Frequency distribution
• Pie chart • Histogram and O-give
• Pictograms • Frequency Polygon
Tables and Graphs for
Categorical Variables
Categorical Data

Tabulating Data Graphical Presentation

Frequency
Distribution Table Bar Chart Pie Chart Pictograms
The Frequency
Distribution Table
Summarizing data by category

Example: Hospital Patients by Unit


Hospital Unit Number of Patients

Cardiac Care 1,052


Emergency 2,245
Intensive Care 340
Maternity 552
Surgery 4,630

(Variables are
categorical)
Class work
The following data are taken from record unit of certain
hospital. The observations include the blood type of donors
with Rh factor in bracket.
i. Construct a frequency distribution for blood type
ii. Construct a frequency distribution for Rh factor

A(+) A(+) A(+) O(+) A(+) O(+)

A(+) A(+) A(+) B(-) A(-) A(-)

B(+) A(+) A(+) A(+) B(+) O(+)

O(-) O(-) AB(+) AB(-) O(+) O(+)

A(+) O(-) O(+) O(+) O(+) O(-)

O(+) O(+) O(+) O(-) O(+) O(+)


Frequency Table
Blood Group Rh positive Rh Negative Total %
(blood group)
A 11 2 13 36%
B 2 1 3 8%
AB 1 1 2 6%
O 13 5 18 50%
Total 27 9 36
Percent 75% 25%
Bar and Pie Charts

• Bar charts and Pie charts are often used


for qualitative (category) data
• Height of bar shows the frequency for
each category
• Size of pie slice is proportional to the
percentage of the corresponding
category
Example: Bar Chart:
Blood Group (Type-1)
20
18
16
14
12
10
8
6
4
2
0
Group A Group B Group AB Group O
Example: Bar Chart:
Blood Group (Type-2)
14

12

10

8
Rh +
6 Rh -
4

0
Group A Group B Group AB Group O
Example: Bar Chart
(Type 3)

Blood Group

20

15
Rh -
10
Rh +
5

0
Group A Group B Group AB Group O
Example: Pie Chart
Blood Group
Group A
Group B
Group AB
Group O

The entire circle: 360e


Every 1% of
observation of
observations should
correspond to
(.01)(360)=3.6 degree

The angle between the


lines demarcating
Group O is therefore:
(50)(3.6)=180 degree
Pictograms

• Represent the data by means of some picture


symbols
• Decide a suitable picture to represent a definite
number of units
Pictograms
Example: Number of patients in each department

Year 2000 2001 2002 2003


Number of 2000 3000 5000 7000
Students
Tables and Graphs for Numerical Variables

Numerical Data

Frequency Polygon Cumulative Freq. Polygon Histogram

Note: Line chart/graph is used to depict variables versus time to show trend:
time series data
Frequency Polygon
Example: Histogram and Frequency Polygon

Histogram Frequency Polygon


Cumulative frequency polygon (Ogive)

• .
Histogram
Description of Histogram:
❑ Is a chart that shows the frequency
distribution of data points for numerical
(quantitative) values.
❑ It representing data with rectangular bars
❑ The height of each bar depicts the frequency
of the data values. A histogram differs from a
bar graph in that the vertical columns are
drawn with no space in between them.
Example: Histogram

Age Group Frequency


15 - 20 3
21 - 25 21
26 - 30 15
31 - 35 1
Total 40

Continue on Whiteboard
Difference between
Bar Graph and Histogram
❑ Bar Graph is for categorical data while
Histogram is for numerical data
❑ A histogram differs from a bar graph in that
the vertical columns/bars are drawn with no
space in between them.
Frequency Polygon
Description of Frequency Polygon:
✓It is used for Grouped Data
✓It is a frequency diagram that plots
the midpoints of the class intervals against the
frequencies, i.e. x-axis: midpoints; y-axis:
frequencies
✓ Finally we connect the frequency points with
straight lines.
Group Assignment
Use quantitative hypothetical data from
chemistry field of study and construct
1. Frequency Distribution
2. Bar Chart
3. Pie Chart
4. Histogram
5. Cumulative Frequency Polygon highest to lowest
6. Cumulative Frequency Polygon lowest to highest

You might also like