0% found this document useful (0 votes)
16 views14 pages

L02 - Ch4 - Data Presentation

Uploaded by

sarangjeon08
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views14 pages

L02 - Ch4 - Data Presentation

Uploaded by

sarangjeon08
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Statistics 1

Chapter 4:
Data Presentation

By Caleb Tan

Objectives
 Calculate the following:
- arithmetic mean, median, mode,
- standard deviation, variance, quartiles, range and
interquartile range
 Explain the use and limitations of the above quantities
 Draw and interpret: histograms, stem-and-leaf diagrams,
boxplots and cumulative frequency distributions
 Incorporate labels and titles correctly in your diagrams
and give the units you have used.

1
Data Presentation
 Dot Plot (self-study)

 Stem-and-Leaf Diagram

 Histogram

 Boxplot (later part)

 Cumulative Frequency Diagram (self-study)

Stem and Leaf Diagram


A stem and leaf display provides information about
the pattern of data. It assists in the discovery of

 Concentrations or clusters of particular values,

 Outlying or extreme observations (outliers), and

 The extent or lack of symmetry in the


distribution of the observations

2
Stem and Leaf Diagram
Example:

Consider the following mock exam marks of students in Statistics 1:


69 84 52 93 61 74 79 65 88 63
57 64 67 72 74 55 82 61 68 77

Draw a stem and leaf diagram for the data.

Note:
1. Title
2. Centre
3. Align
4. Gap

Stem and Leaf Diagram


SG Section 4.7.4

3
Histogram
To draw a Histogram, the first step is to
construct a frequency distribution table.

 Organise data into class intervals


(also called bins)

 Class intervals are continuous non-


overlapping

 Upper limits are not included

Histogram
SG Section 4.7.3

for Cumulative
Freq Density = Freq / Width Frequency
Diagram

4
Histogram

Vertical Axis is
Freq Density
Freq = Area

Horizontal Axis
is the Variable

Class Activity 2.1


Suppose a consumer organization was interested in studying weekly food &
living expenses of college students living on college campuses in New York
State. A survey of 40 students yielded the following (ordered array) to the
nearest dollar. Construct a histogram.

Weekly Living Expenses ($) of 40 NYS College Students

38 50 54 60
39 51 56 61
40 51 56 62
41 52 57 64
44 52 57 64
45 52 57 64
48 52 58 65
48 54 59 65
49 54 59 66
50 54 60 67

10

5
Measures of Location

 Mean

 Median

 Mode

11

Sample Mean SG Section 4.8.1

• Mean is the average of the dataset

Example:

32 28 67 39 19 48 32 44 37 24

Sample Mean,

12

6
Sample Median SG Section 4.8.2

• Median is the middle value of the ordered dataset

• Median Position = (n + 1) / 2

Example:

19 24 28 32 32 37 39 44 48 67

Median Position = (10 + 1) / 2 = 5.5

Median = (32 + 37) / 2 = 34.5

13

Sample Mode SG Section 4.8.3

• Mean is most frequent value in the dataset

Example:

19 24 28 32 32 37 39 44 48 67

Sample Mode = 32

14

7
Shape of Distribution (Skewness)

15

Class Activity 2.2


The following values represent the heights, in cm, of a random selection of
20 students:

172.5 162.5 166.5 174.0 180.0 176.5 186.5 179.0 164.5 196.5

174.0 177.0 169.5 172.5 192.0 181.5 171.0 169.0 171.0 172.5

a) Find the mean, median and mode.

b) Determine the shape of the distribution.

175.425, 173.25, 172.5

16

8
Measures of Spread

 Range

 Standard Deviation

 Interquartile Range

17

Sample Range SG Section 4.9.1

• The range is simply the

Largest value - Smallest value.

Example:

32 28 67 39 19 48 32 44 37 24

Range = 67 – 19 = 48

18

9
Sample Standard Deviation

Sample Variance, s2 =

Sample Standard Deviation, s =

Alternative Computational Formula:

19

Sample Standard Deviation


Example:

32 28 67 39 19 48 32 44 37 24

20

10
Sample Interquartile Range
SG Section 4.9.1
• Quartiles

25% 25% 25% 25%

Q1 Q2 Q3

Q1 position = (n+1)/4
Q2 position = 2(n+1)/4
Q3 position = 3(n+1)/4

Interquartile Range, IQR = Q3 – Q1

21

Sample Interquartile Range


Example:

19 24 28 32 32 37 39 44 48 67

Q1 Position = (10 + 1) / 4 = 2.75 so Q1 = 24 + 0.75(28-24) = 27

Q2 Position = 2(10 + 1) / 4 = 5.5 so Q2 = 32 + 0.5(37-32) = 34.5

Q3 Position = 3(10 + 1) / 4 = 8.25 so Q3 = 44 + 0.25(48-44) = 45

IQR = 45 - 27 = 18

22

11
Class Activity 2.3
The following values represent the heights, in cm, of a random selection of
20 students:

162.5 164.5 166.5 169.0 169.5 171.0 171.0 172.5 172.5 172.5

174.0 174.0 176.5 177.0 179.0 180.0 181.5 186.5 192.0 196.5

a) Find the quartiles.

b) Determine the interquartile range.

IQR: 9.875

23

Boxplot SG Section 4.9.2

24

12
Boxplot
Example:
162.5 164.5 166.5 169.0 169.5 171.0 171.0 172.5 172.5 172.5
174.0 174.0 176.5 177.0 179.0 180.0 181.5 186.5 192.0 196.5

Q1 = 169.875, Q2 = 173.25, Q3 = 179.75, IQR = 9.875.

 To find outliers:
◦ Q1 – 1.5(IQR) = 169.875 - (1.5)(9.875) = 155.0625
◦ Q3 + 1.5(IQR) = 179.75 + (1.5)(9.875) = 194.5625

 To find extreme outliers:


◦ Q1 - 3(IQR) = 169.875 - (3)(9.875) = 140.25
◦ Q3 + 3(IQR) = 179.75 + (3)(9.875) = 209.375

25

Boxplot
Example:
162.5 164.5 166.5 169.0 169.5 171.0 171.0 172.5 172.5 172.5
174.0 174.0 176.5 177.0 179.0 180.0 181.5 186.5 192.0 196.5

26

13
Grouped Data

 Mean → Use Midpoint (see Example 4.5)

 Median → Interpolation Method (see Example 4.7)

 Mode → Usually Use Modal Class/Group

 Standard Deviation → Use Midpoint

 Quartiles → Interpolation Method

27

Homework
VLE Activities

Do SG Section 4.13: Question 2 to 12

Do SG Section 4.15: Question 1 to 3

Do Self-Practice L02: All Questions

28

14

You might also like