Chapter 01-01
Chapter 01-01
Exploring Data
1.1
Analyzing Categorical
Data
The Practice of Statistics, 5th Edition
Starnes, Tabor, Yates, Moore
Categorical Variables
Categorical variables place individuals into one of several groups or
categories.
Frequency Table
Format
Variable
Count of Stations
Format
Percent of Stations
Adult Contemporary
1556
Adult Contemporary
Adult Standards
1196
Adult Standards
8.6
Contemporary Hit
4.1
Contemporary Hit
569
11.2
Country
2066
Country
14.9
News/Talk
2179
News/Talk
15.7
Oldies
1060
Oldies
Religious
2014
Religious
Rock
869
Spanish Language
750
Other Formats
Values
Total
1579
13838
7.7
14.6
Rock
6.3
Count
Spanish Language
Other Formats
Total
Percent
5.4
11.4
99.9
2500
2000
Format
1000
500
0
Count of Stations
Format
Adult Contemporary
1556
Adult Contemporary
Adult Standards
1196
11%Standards 11%
Adult
Contemporary Hit
1500
569
5%
Contemporary Hit
Country
2066
News/Talk
2179
News/Talk
Oldies
1060
Oldies
Religious
2014
6%
15%
Religious
869
Rock
Spanish Language
750
8%
Spanish Language
16%
Total
1579
13838
11.2
Adult Standards
8.6
Contemporary hit
4.1
9%
Country
14.9
Country
Rock
Other Formats
Percent of Stations
Adult Contemporary
Other Formats
Total
4%
News/Talk
15.7
Oldies 7.7
15%
14.6
Religious
Rock
6.3
5.4
Spanish
11.4
Other
99.9
Female
Male
Total
Almost no chance
96
98
194
426
286
712
A 50-50 chance
696
720
1416
A good chance
663
758
1421
Almost certain
486
597
1083
Total
2367
2459
4826
Male
Total
Almost no chance
96
98
194
426
286
712
A 50-50 chance
696
720
1416
A good chance
663
758
1421
Almost certain
486
597
1083
Total
2367
2459
4826
Almost no
chance
194/4826 = 4.0%
Some chance
712/4826 = 14.8%
A 50-50 chance
1416/4826 = 29.3%
A good chance
1421/4826 = 29.4%
Almost certain
1083/4826 = 22.4%
Percent
Response
35
30
25
20
15
10
5
0
Almost
none
Some
chance
50-50
chance
Good
chance
Almost
certain
Survey Response
The Practice of Statistics, 5th Edition
Male
Total
Almost no chance
96
98
194
426
286
712
A 50-50 chance
696
720
1416
A good chance
663
758
1421
Almost certain
486
597
1083
Total
2367
2459
4826
Male
Female
Almost no chance
98/2459 =
4.0%
96/2367 =
4.1%
286/2459 =
11.6%
426/2367 =
18.0%
720/2459 =
29.3%
696/2367 =
29.4%
758/2459 =
30.8%
663/2367 =
28.0%
597/2459 =
24.3%
486/2367 =
20.5%
Some chance
A 50-50 chance
A good chance
Almost certain
100%
90%
80%
70%
Percent
Response
Almost certain
60%
50%
Good chance
40%
30%
50-50 chance
20%
10%
Some chance
0%
Males
Opinion
Females
Almost no
chance
10
Caution!
Even a strong association between two categorical variables can
be influenced by other variables lurking in the background.
11
deceptive
CALCULATE and DISPLAY the marginal distribution of a
categorical variable from a two-way table
12