Chapter 2 - Graphical Descriptive I
Chapter 2 - Graphical Descriptive I
Graphical
Descriptive Techniques I
2.1
Introduction & Re-cap…
Descriptive statistics involves arranging, summarizing, and
presenting a set of data in such a way that useful information
is produced.
Statistics
Data Information
2.2
Populations & Samples
Population Sample
Subset
The graphical & tabular methods presented here apply to both entire
populations and samples drawn from populations.
2.3
Definitions…
A variable is some characteristic of a population or sample.
E.g. student grades.
Typically denoted with a capital letter: X, Y, Z…
2.4
Types of Data & Information
Data (at least for purposes of Statistics) fall into three main
groups:
Interval Data
Nominal Data
Ordinal Data
2.5
Interval Data…
Interval data
• Real numbers, i.e. heights, weights, prices, etc.
• Also referred to as quantitative or numerical.
2.6
Nominal Data…
Nominal Data
• The values of nominal data are categories.
E.g. responses to questions about marital status, coded
as:
Single = 1, Married = 2, Divorced = 3, Widowed = 4
2.7
Ordinal Data…
Ordinal Data appear to be categorical in nature, but their
values have an order; a ranking to them:
2.9
Example
Example: Identify each of the following as example of nominal, ordinal
or interval data.
2.11
Example 2.1Work Status in the GSS 2008 Survey
[GSS2008*] In the 2008 General Social Survey, 2021 people were asked the
following questions.
“Last week, were you working full time, part time, going to school, keeping
house, or what?” The responses were
1. Working full time
2. Working part time
3. Temporarily not working
4. Unemployed, laid off
5. Retired
6. School
7. Keeping house
8. Other
The responses were recorded using the codes 1, 2, 3, 4, 5, 6, 7, and 8,
respectively.
2.12
Frequency and Relative Frequency Distributions
2.13
GSS 2008 Excel File-Nominal Data (Frequency-Excel
Instructions on pp 20-22)
Bar chart is used for frequency, but also can be used for relative measures.
These techniques are also valid for ordinal data. (bar charts and wedges in
pie charts are arranged in ascending or descending order).
2.16
Nominal Data
2.17
Relationship between Two Nominal Variables
So far we’ve looked at tabular and graphical techniques for
one variable.
2.18
Example 2.4
In a major North American city there are four competing
newspapers: the Post, Globe and Mail, Sun, and Star.
2.19
Example 2.4
A sample of newspaper readers was asked to report which
newspaper they read: Globe and Mail (1) Post (2), Star (3),
Sun (4), and to indicate whether they were blue-collar
worker (1), white-collar worker (2), or professional (3).
2.20
Example 2.4
By counting the number of times each of the 12
combinations occurs, we produced the Table 2.5.
Excel instructions are on page 34.
Occupation
Newspaper Blue Collar White Collar Professional Total
G&M 27 29 33 89
Post 18 43 51 112
Star 38 21 22 81
Sun 37 15 20 72
Total 120 108 126 354
2.21
Example 2.4
If occupation and newspaper are related, then there will be differences
in the newspapers read among the occupations. An easy way to see this
is to convert the frequencies in each column to relative frequencies in
each column. That is, compute the column totals and divide each
frequency by its column total.
Occupation
Newspaper Blue Collar White Collar Professional
G&M 27/120 =.23 29/108 = .27 33/126 = .26
Post 18/120 = .15 43/108 = .40 51/126 = .40
Star 38/120 = .32 21/108 = .19 22/126 = .17
Sun 37/120 = .31 15/108 = .14 20/126 = .16
2.22
Graphing the Relationship Between Two Nominal Variables
(Excel instructions are on page 35)
Use the data from the cross-classification table to create bar charts…
Professionals tend
to read the Post
more than twice as
often as the Star or
Sun…
2.23