0% found this document useful (0 votes)

9 views68 pages

Chap02 1

Chapter 2 of 'Statistics for Business and Economics' focuses on numerical data description, covering measures of central tendency such as mean, median, and mode, as well as measures of variation including range, variance, and standard deviation. It also discusses the empirical rule, weighted mean, and least squares regression for analyzing relationships between variables. Key concepts include the importance of understanding data distribution shapes and the use of quartiles and box plots for data visualization.

Uploaded by

benchehidaranym

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views68 pages

Chap02 1

Uploaded by

benchehidaranym

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 68

Statistics for

Business and Economics

7th Edition

Chapter 2

Describing Data: Numerical

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1
Chapter Goals
After completing this chapter, you should be able to:
■ Compute and interpret the mean, median, and mode for a
set of data
■ Find the range, variance, standard deviation, and
coefficient of variation and know what these values mean
■ Apply the empirical rule to describe the variation of
population values around the mean
■ Explain the weighted mean and when to use it
■ Explain how a least squares regression line estimates a
linear relationship between two variables

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-2
Chapter Topics
■ Measures of central tendency, variation, and
shape
■Mean, median, mode, geometric mean
■Quartiles

■Range, interquartile range, variance and standard

deviation, coefficient of variation

■Symmetric and skewed distributions

■ Population summary measures

■Mean, variance, and standard deviation
■The empirical rule and Bienaymé-Chebyshev rule

■ Five number summary and box-and-whisker

plots
■ Covariance and coefficient of correlation
■ Pitfalls in numerical descriptive measures and
ethical considerations

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-4
Describing Data Numerically
Describing Data Numerically

Central Tendency Variation

Arithmetic Mean Range

Median Interquartile Range

Mode Variance

Standard Deviation

Coefficient of Variation

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-5
2.1
Measures of Central Tendency
Overview
Central Tendency

Mean Median Mode

Arithmetic Midpoint of Most frequently

average ranked values observed value

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-6
Arithmetic Mean
■ The arithmetic mean (mean) is the most
common measure of central tendency
■ For a population of N values:

Population
values
Population size
■ For a sample of size n:

Observed
values
Sample size
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-7
Arithmetic Mean
(continued)

■ The most common measure of central tendency

■ Mean = sum of values divided by the number of values
■ Affected by extreme values (outliers)

0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10

Mean = 3 Mean = 4

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-8
Median
■ In an ordered list, the median is the “middle”
number (50% above, 50% below)

0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10

Median = 3 Median = 3

■ Not affected by extreme values

■ The location of the median:

■ If the number of values is odd, the median is the middle

number
■ If the number of values is even, the median is the average of

the two middle numbers

■ Note that is not the value of the median, only the

position of the median in the ranked data

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-10
Mode
■ A measure of central tendency
■ Value that occurs most often
■ Not affected by extreme values
■ Used for either numerical or categorical data
■ There may may be no mode
■ There may be several modes

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 1 2 3 4 5 6

No Mode
Mode = 9
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-11
Review Example
■ Five houses on a hill by the beach

House Prices:

$2,000,000
500,000
300,000
100,000
100,000

House Prices:
■ Mean: ($3,000,000/5)
$2,000,000 = $600,000
500,000
300,000
100,000
100,000
■ Median: middle value of ranked data
Sum 3,000,000
= $300,000

■ Mode: most frequent value

= $100,000

■ Mean is generally used, unless extreme values

(outliers) exist . . .
■ Then median is often used, since the median
is not sensitive to extreme values.
■ Example: Median home prices may be reported for
a region – less sensitive to outliers

■ Describes how data are distributed

■ Measures of shape
■ Symmetric or skewed

Left-Skewed Symmetric Right-Skewed

Mean < Median Mean = Median Median < Mean

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-15
Geometric Mean
■ Geometric mean
■ Used to measure the rate of change of a variable
over time

■ Geometric mean rate of return

■ Measures the status of an investment over time

■ Where xi is the rate of return in time period i

An investment of $100,000 rose to $150,000 at the

end of year one and increased to $180,000 at end
of year two:

50% increase 20% increase

What is the mean percentage return over time?

Use the 1-year returns to compute the arithmetic

mean and the geometric mean:

Arithmetic
mean rate Misleading result
of return:

Geometric
mean rate
of return: More
accurate
result
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-18
2.2
Measures of Variability
Variation

Range Interquartile Variance Standard Coefficient of

Range Deviation Variation

■ Measures of variation give

information on the spread
or variability of the data
values.

Same center,
different variation
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-19
Range

■ Simplest measure of variation

■ Difference between the largest and the smallest
observations:
Range = Xlargest – Xsmallest

Example:

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

Range = 14 - 1 = 13

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-20
Disadvantages of the Range
■ Ignores the way in which data are distributed

7 8 9 10 11 12 7 8 9 10 11 12
Range = 12 - 7 = 5 Range = 12 - 7 = 5

■ Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4

1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 120 - 1 = 119

■ Can eliminate some outlier problems by using

the interquartile range

■ Eliminate high- and low-valued observations

and calculate the range of the middle 50% of
the data

■ Interquartile range = 3rd quartile – 1st quartile

IQR = Q3 – Q1

Example:
X Median X
minimum Q1 (Q2) Q3 maximum

25% 25% 25% 25%

12 30 45 57
70

Interquartile range
= 57 – 30 = 27

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-23
Quartiles
■ Quartiles split the ranked data into 4 segments with
an equal number of values per segment

25% 25% 25% 25%

Q1 Q2 Q3

■ The first quartile, Q1, is the value for which 25% of the
observations are smaller and 75% are larger
■ Q2 is the same as the median (50% are smaller, 50% are
larger)
■ Only 25% of the observations are greater than the third
quartile

Find a quartile by determining the value in the

appropriate position in the ranked data, where

First quartile position: Q1 = 0.25(n+1)

Second quartile position: Q2 = 0.50(n+1)

(the median position)

Third quartile position: Q3 = 0.75(n+1)

where n is the number of observed values

■ Example: Find the first quartile

Sample Ranked Data: 11 12 13 16 16 17 18 21 22

(n = 9)
Q1 = is in the 0.25(9+1) = 2.5 position of the ranked data
so use the value half way between the 2nd and 3rd values,
so Q1 = 12.5

■ Average of squared deviations of values from

the mean

■ Population variance:

Where = population mean

N = population size
xi = ith value of the variable x

■ Average (approximately) of squared deviations

of values from the mean

■ Sample variance:

Where = arithmetic mean

n = sample size
Xi = ith value of the variable X

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-28
Population Standard Deviation
■ Most commonly used measure of variation
■ Shows variation about the mean
■ Has the same units as the original data

■ Population standard deviation:

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-29
Sample Standard Deviation
■ Most commonly used measure of variation
■ Shows variation about the mean
■ Has the same units as the original data

■ Sample standard deviation:

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-30
Calculation Example:
Sample Standard Deviation
Sample
Data (xi) : 10 12 14 15 17 18 18 24
n=8 Mean = x = 16

A measure of the “average”

scatter around the mean
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-31
Measuring variation

Small standard deviation

Large standard deviation

Data A
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 3.338

Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 0.926

Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 4.570

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-33
Advantages of Variance and
Standard Deviation

■ Each value in the data set is used in the

calculation

■ Values far from the mean are given extra

weight
(because deviations from the mean are squared)

■ Measures relative variation

■ Always in percentage (%)
■ Shows variation relative to mean
■ Can be used to compare two or more sets of
data measured in different units

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-35
Comparing Coefficient
of Variation
■ Stock A:
■Average price last year = $50

■Standard deviation = $5

Both stocks
■ Stock B: have the same
standard
■Average price last year = $100 deviation, but
■Standard deviation = $5
stock B is less
variable relative
to its price

■ Descriptive Statistics can be obtained

from Microsoft® Excel
■ Select:
data / data analysis / descriptive statistics

■ Enter details in dialog box

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-37
Using Excel
■ Select data / data analysis / descriptive statistics

■ Enter input
range details

■ Check box for

summary
statistics

$2,000,000
500,000
300,000
100,000
100,000

■ For any population with mean μ and

standard deviation σ , and k > 1 , the
percentage of observations that fall within
the interval
[μ + kσ]
Is at least

■ Regardless of how the data are distributed, at

least (1 - 1/k2) of the values will fall within k
standard deviations of the mean (for k > 1)
■ Examples:

At least within
(1 - 1/1.52) = 55.6% ……... k = 1.5 (μ ± 1.5σ)
(1 - 1/22) = 75% …........... k = 2 (μ ± 2σ)
(1 - 1/32) = 89% …….…... k = 3 (μ ± 3σ)

■ If the data distribution is bell-shaped, then

the interval:
■ contains about 68% of the values in the
population or the sample

68%

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-43
The Empirical Rule
■ contains about 95% of the values in
the population or the sample
■ contains almost all (about 99.7%) of the
values in the population or the sample

95% 99.7%

■ The weighted mean of a set of data is

■ Where wi is the weight of the ith observation

and

■ Use when data is already grouped into n classes, with

wi values in the ith class

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-45
Approximations for Grouped Data
Suppose data are grouped into K classes, with
frequencies f1, f2, . . . fK, and the midpoints of the
classes are m1, m2, . . ., mK

■ For a sample of n observations, the mean is

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-46
Approximations for Grouped Data
Suppose data are grouped into K classes, with
frequencies f1, f2, . . . fK, and the midpoints of the
classes are m1, m2, . . ., mK

■ For a sample of n observations, the variance is

■ The population covariance:

■ The sample covariance:

■ Only concerned with the strength of the relationship

■ No causal effect is implied

■ Covariance between two variables:

Cov(x,y) > 0 x and y tend to move in the same direction

Cov(x,y) < 0 x and y tend to move in opposite directions
Cov(x,y) = 0 x and y are independent

■ Population correlation coefficient:

■ Sample correlation coefficient:

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-50
Features of
Correlation Coefficient, r
■ Unit free
■ Ranges between –1 and 1
■ The closer to –1, the stronger the negative linear
relationship
■ The closer to 1, the stronger the positive linear
relationship
■ The closer to 0, the weaker any positive linear
relationship

X X X
r = -1 r = -.6 r=0
Y
Y Y

■ Choose Correlation from the selection menu

■ Click OK . . .

■ Input data range and select

appropriate options
■ Click OK to get output

■ r = .733

■ There is a relatively
strong positive linear
relationship between
test score #1
and test score #2

■ Students who scored high on the first test tended

to score high on second test

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-55
Chapter Summary
■ Described measures of central tendency
■ Mean, median, mode
■ Illustrated the shape of the distribution
■ Symmetric, skewed
■ Described measures of variation
■ Range, interquartile range, variance and standard deviation,
coefficient of variation
■ Discussed measures of grouped data
■ Calculated measures of relationships between
variables
■ covariance and correlation coefficient

Locating Extreme Outliers:Z-Score

■ To compute the Z-score of a data value,

subtract the mean and divide by the standard
deviation.
■  The Z-score is the number of standard
deviations a data value is from the mean.
■  A data value is considered an extreme outlier
if its Zscore is less than -3.0 or greater than
+3.0.
■  The larger the absolute value of the Z-score,
the farther the data value is from the mean.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Locating Extreme Outliers:Z-Score

Locating Extreme Outliers:Z-Score

The Five-Number Summary

Relationships among the five-number
summary and distribution shape

Distribution Shape and The Boxplot

Boxplot Example

Outliers

■ An outlier is an unusual score, relative to the

dataset. It is inconsistent with the rest of the
data. It willinfluence the mean and the standard
deviation.
■ A data set might have no outliers, one outlier, or
Several outliers.
■ Two types: Mild and Extreme

Outliers

Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
55 pages
Newbold Sbe8 Ch02
No ratings yet
Newbold Sbe8 Ch02
59 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
55 pages
Numerical Descriptive Measures
No ratings yet
Numerical Descriptive Measures
52 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
40 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
CA Foundation Quantitative Aptitude Suggested Answers For May 2025 Castudynotes Com
No ratings yet
CA Foundation Quantitative Aptitude Suggested Answers For May 2025 Castudynotes Com
28 pages
Newbold Sbe8 Ch02 Ge
No ratings yet
Newbold Sbe8 Ch02 Ge
65 pages
Chap 02
No ratings yet
Chap 02
54 pages
Newbold Sbe8 ch02
No ratings yet
Newbold Sbe8 ch02
66 pages
Chapter 3 Using Numerical Measures To Describe Data
No ratings yet
Chapter 3 Using Numerical Measures To Describe Data
72 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
77 pages
4.statistics 2
No ratings yet
4.statistics 2
55 pages
Chapter 2 Slides
No ratings yet
Chapter 2 Slides
93 pages
Chap02 2024
No ratings yet
Chap02 2024
58 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
60 pages
Statistics For Managers Using Microsoft Excel: 5 Edition
No ratings yet
Statistics For Managers Using Microsoft Excel: 5 Edition
54 pages
Numerical Measures To Describe Data
No ratings yet
Numerical Measures To Describe Data
103 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
56 pages
Newbold SBE9e Accessible CH02
No ratings yet
Newbold SBE9e Accessible CH02
64 pages
Dr. K. M. Salah Uddin Associate Professor Dept. of MIS, DU
No ratings yet
Dr. K. M. Salah Uddin Associate Professor Dept. of MIS, DU
41 pages
WEEK 3 - Central-Tendency-Variation-And-Shape
No ratings yet
WEEK 3 - Central-Tendency-Variation-And-Shape
39 pages
Chap 03
No ratings yet
Chap 03
56 pages
CH02
No ratings yet
CH02
46 pages
Chapter 1
No ratings yet
Chapter 1
44 pages
Slides Week2
No ratings yet
Slides Week2
43 pages
Intro W03 Rev
No ratings yet
Intro W03 Rev
23 pages
Statistics For Business and Economics: Using Numerical Measures To Describe Data
No ratings yet
Statistics For Business and Economics: Using Numerical Measures To Describe Data
74 pages
Chapter 2
No ratings yet
Chapter 2
38 pages
Part 2-Chapter 3 - Describing Data - Edit
No ratings yet
Part 2-Chapter 3 - Describing Data - Edit
46 pages
Decision Science
No ratings yet
Decision Science
523 pages
2 Descriptives
No ratings yet
2 Descriptives
43 pages
Week 6+7+8
No ratings yet
Week 6+7+8
37 pages
Chapter 03
No ratings yet
Chapter 03
67 pages
Probability Theory & Statistics: Describing Data: Numerical
No ratings yet
Probability Theory & Statistics: Describing Data: Numerical
36 pages
Lecture 04
No ratings yet
Lecture 04
88 pages
EECM3724 Unit 1 Ch3 Slides 2022
No ratings yet
EECM3724 Unit 1 Ch3 Slides 2022
48 pages
Topic 1 Describing Data II
No ratings yet
Topic 1 Describing Data II
68 pages
Data Management
No ratings yet
Data Management
36 pages
Lecture Notes 02
No ratings yet
Lecture Notes 02
54 pages
Chap 4
No ratings yet
Chap 4
126 pages
Bus. Statt. Chapter-Lecture 2+3
No ratings yet
Bus. Statt. Chapter-Lecture 2+3
43 pages
Lecture - 04 - TP
No ratings yet
Lecture - 04 - TP
126 pages
Statistics in Kinesiology 5th Edition Readable PDF Download
100% (8)
Statistics in Kinesiology 5th Edition Readable PDF Download
17 pages
# 4 Pemusatan & Penyebaran Data (TM)
No ratings yet
# 4 Pemusatan & Penyebaran Data (TM)
65 pages
Session 2 Inferential Statistics Slides
No ratings yet
Session 2 Inferential Statistics Slides
93 pages
Portage Math 110 Statistics
No ratings yet
Portage Math 110 Statistics
2 pages
Lecture 2b - Describing Data-Numerical
No ratings yet
Lecture 2b - Describing Data-Numerical
47 pages
Numerical Measures: Bf1206-Business Mathematics SEMESTER 2 - 2016/2017
No ratings yet
Numerical Measures: Bf1206-Business Mathematics SEMESTER 2 - 2016/2017
25 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
55 pages
Variability Final
No ratings yet
Variability Final
53 pages
Measusres of Locations
No ratings yet
Measusres of Locations
52 pages
Introductory of Statistics - Chapter 3
No ratings yet
Introductory of Statistics - Chapter 3
7 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
Lecture 2 1013
No ratings yet
Lecture 2 1013
36 pages
Statistics
No ratings yet
Statistics
6 pages
Stats
No ratings yet
Stats
109 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
38 pages
Jerome Statistics
No ratings yet
Jerome Statistics
12 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Probability Distribution 2
No ratings yet
Probability Distribution 2
12 pages
Exercises On Introduction To Ststistics
No ratings yet
Exercises On Introduction To Ststistics
68 pages
Business Statistics Notes
No ratings yet
Business Statistics Notes
20 pages
Cfa I
No ratings yet
Cfa I
45 pages
Ista SH Microdochium PT For 7 022 Final Report Jan 1
No ratings yet
Ista SH Microdochium PT For 7 022 Final Report Jan 1
20 pages
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 5
No ratings yet
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 5
64 pages
Statistics Class 9+10
No ratings yet
Statistics Class 9+10
2 pages
MCQs (Final)
No ratings yet
MCQs (Final)
50 pages
BioPhysics Lab Manual Experiment 1
No ratings yet
BioPhysics Lab Manual Experiment 1
2 pages
Topic 3 Measures of Dispersion
No ratings yet
Topic 3 Measures of Dispersion
14 pages
GSEB Solutions Class 12 Statistics Part 1 Chapter 2 Linear Corre
No ratings yet
GSEB Solutions Class 12 Statistics Part 1 Chapter 2 Linear Corre
43 pages
Maths Document 1
No ratings yet
Maths Document 1
7 pages
Factor Analysis: KMO and Bartlett's Test
No ratings yet
Factor Analysis: KMO and Bartlett's Test
31 pages
STT033 Modules 1 6
No ratings yet
STT033 Modules 1 6
99 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
27 pages
Descriptive Statistics Alp2019
No ratings yet
Descriptive Statistics Alp2019
98 pages
Chapter 7 3
No ratings yet
Chapter 7 3
30 pages
Session 17-20
No ratings yet
Session 17-20
16 pages
Stat 4
No ratings yet
Stat 4
10 pages
Assignment 8-PnS
No ratings yet
Assignment 8-PnS
16 pages
Lampiran Hasil Output Spss
No ratings yet
Lampiran Hasil Output Spss
9 pages
The 3-Parameter Log Normal Distribution and Its Applications in Hydrology
No ratings yet
The 3-Parameter Log Normal Distribution and Its Applications in Hydrology
12 pages
QT Answer and Notted
No ratings yet
QT Answer and Notted
3 pages
Manuel L. Quezon National High School Quezon, San Isidro, Isabela
No ratings yet
Manuel L. Quezon National High School Quezon, San Isidro, Isabela
4 pages
IBA ASSIGNMENT p22251
No ratings yet
IBA ASSIGNMENT p22251
6 pages
Q4 Summative Test Math 10
No ratings yet
Q4 Summative Test Math 10
3 pages
Your Answer: average: µ±3σ covers - - - - - - - - - - of the items in a data set
No ratings yet
Your Answer: average: µ±3σ covers - - - - - - - - - - of the items in a data set
3 pages
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet