0% found this document useful (0 votes)
61 views64 pages

Chapter 3

This document summarizes key concepts from Chapter 3 of a statistics textbook. It covers measures of location such as the mean, median, and mode. It then discusses measures of variability, including the range, interquartile range, variance, and standard deviation. Finally, it mentions exploring data analysis and measures of association between two variables. The chapter provides definitions and formulas for important statistical measures used to describe and analyze numerical data.

Uploaded by

Fadly Nurullah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
61 views64 pages

Chapter 3

This document summarizes key concepts from Chapter 3 of a statistics textbook. It covers measures of location such as the mean, median, and mode. It then discusses measures of variability, including the range, interquartile range, variance, and standard deviation. Finally, it mentions exploring data analysis and measures of association between two variables. The chapter provides definitions and formulas for important statistical measures used to describe and analyze numerical data.

Uploaded by

Fadly Nurullah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 64

Chapter 3

Desciptive Statistics: Numerical Measures

Ahmad Fadly Nurullah bin Rasedee


[email protected]

December 17, 2015


www.unikl.edu.my

Chapter 3

small-corne

Contents

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

Measures of Location

Exploratory Data Analysis

Measures of Variability

Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Measures of Distribution Shape, Relative Location, and


Detection of Outliers
Exploratory Data Analysis
Measures of Association Between Two Variables
The Weighted Mean and Working with Grouped Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

2/64

Measures of Location

small-corne

Mean

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis

Mean or Average value is a measure of central location


computed by summing the data values and dividing by the
number of observations.

Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

The mean is denoted by x; if the data are for a population,


the mean is denoted by the Greek letter

Sample mean
P
x=

Ahmad Fadly Nurullah bin Rasedee

xi
n

Chapter 3

December 17, 2015

3/64

Measures of Location

small-corne

Mean

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

4/64

Measures of Location

small-corne

Mean

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis

Population mean

Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

P
=

xi
N

The sample mean x is a point estimator of the population


mean, .

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

5/64

Measures of Location

small-corne

Median

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

The median is a measure of central location provided by


the value in the middle when the data are arranged in
ascending order.

Exploratory Data Analysis


Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Median
Arrange the data in ascending order (smallest value to
largest value)
For an odd number of observations, the median is the
middle value.
For an even number of observations, the median is the
average of the two middle values.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

6/64

Measures of Location

small-corne

Median

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

7/64

Measures of Location

small-corne

Mode

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

A third measure of location is the mode.

Mode
The mode is a measure of location, defined as the value
that occurs with greatest frequency.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

8/64

Measures of Location

small-corne

Percentile

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

A percentile provides information about how the data are


spread over the interval from the smallest value to the
largest value.

Percentile
The pth percentile is a value such that at least p percent
of the observations are less than or equal to this value
and at least (100-p) percent of the observations are
greater than or equal to this value.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

9/64

Measures of Location

small-corne

Percentile

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

10/64

Measures of Location

small-corne

Percentile

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

11/64

Measures of Location

small-corne

Quartiles

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

It is often desirable to divide data into four parts, with each


part containing approximately one-fourth, or 25% of the
observations. The division points are referred to as the
quartiles and are defined as
Q1 = first quartile, or 25th percentile.
Q2 = second quartile, or 50th percentile (aslo median).
Q3 = third quartile, or 75th percentile.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

12/64

Measures of Variability

small-corne

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

13/64

Measures of Variability

small-corne

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

14/64

Measures of Variability

small-corne

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

In addition to measures of location, it is often desirable


to consider measures of variability,or dispersion.
The variability in the delivery time creates uncertainty for
production scheduling. Methods in this section help
measure and understand variability.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

15/64

Measures of Variability

small-corne

Range

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

The simplest measure of variability is the Range. Range


A measure of variability, defined to be the largest value
minus the smallest value.

Range
Range = Largest value Smallest Value

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

16/64

Measures of Variability

small-corne

Interquartile range (IQR)

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Interquartile range (IQR) a measure of variability, defined


to be the difference between the third and first quartiles.

Interquartile range (IQR)


IQR = Q3 Q1

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

17/64

Measures of Variability

small-corne

Variance

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Variance a measure of variability based on the squared


deviations of the data values (value of each observation,
xi ) about the mean.
The difference between each xi and the mean ( for a
sample, for a population) is called a deviation about
the mean. A deviation about the mean is written (xi x);
for a population, it is written (xi ).

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

18/64

Measures of Variability

small-corne

Variance

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

If the data are for a population, the average of the squared


deviations is called the population variance. For a
population of N observations and with denoting the
population mean, the definition of the population variance
is as follows.

Population variance
2 =

Ahmad Fadly Nurullah bin Rasedee

(xi )2
N

Chapter 3

December 17, 2015

19/64

Measures of Variability

small-corne

Variance

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

the sample variance, denoted by s2 , is defined as follows

Exploratory Data Analysis


Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Sample variance
2

s =

Ahmad Fadly Nurullah bin Rasedee

(xi x)2
n1

Chapter 3

December 17, 2015

20/64

Measures of Variability

small-corne

Variance

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

The variance is useful in comparing the variability of two


or more variables.

Exploratory Data Analysis


Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

21/64

Measures of Variability

small-corne

Variance

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

22/64

Measures of Variability

small-corne

Standard deviation

Contents
Measures of Location
Measures of Variability

Standard deviation a measure of variability computed by


taking the positive square root of the variance.

Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

Standard deviation

Exploratory Data Analysis


Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

p
s2

Population standard deviation = = 2


Sample standard deviation = s =

The sample standard deviation s is the estimator of the


population standard deviation .
The standard deviation is easier to interpret than the
variance because the standard deviation is measured in
the same units as the data.
Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

23/64

Measures of Variability

small-corne

Coefficient of variation

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Coefficient of variation a measure of relative variability


computed by dividing the standard deviation by the mean
and multiplying by 100.

Coefficient of variation



Standard deviation
100 %
Mean

The coefficient of variation is a relative measure of


variability; it measures the standard deviation relative to
the mean.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

24/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Skewness
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Skewness a measure of the shape of a data distribution.


Data skewed to the left result in negative skewness; a
symmetric data distribution results in zero skewness; and
data skewed to the right result in positive skewness.

The formula for the skewness of sample data:


Skewness =

X
n
(n 1)(n 2)

xi x
s

3
%

The coefficient of variation is a relative measure of


variability; it measures the standard deviation relative to
the mean.
Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

25/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Skewness
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

26/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

z-Scores
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

z-Score a value computed by dividing the deviation about


the mean (xi x) by the standard deviation s. A z-score
is referred to as a standardized value and denotes the
number of standard deviations xi is from the mean.

Exploratory Data Analysis


Measures of Association
Between Two Variables

z-Scores

The Weighted Mean and


Working with Grouped
Data

zi =

xi x
s

where zi = the z-score for xi


x =the sample mean
s = the sample standard deviation
The coefficient of variation is a relative measure of
variability; it measures the standard deviation relative to
the mean.
Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

27/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

z-Scores
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

The z-score is often called the standardized value.


Can be interpreted as the number of standard deviations
xi is from the mean x.
The z-score for any observation can be interpreted as a
measure of the relative location of the observation in a
data set. Thus, observations in two different data sets
with the same z-score can be said to have the same

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

28/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Chebyshevs theorem
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Chebyshevs theorem a theorem that can be used to


make statements about the proportion of data values that
must be within a specified number of standard deviations
of the mean.

Chebyshevs theorem
At least (1 1/z 2 ) of the data values must be within z
standard deviations of the mean, where z is any value
greater than 1.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

29/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Chebyshevs theorem
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Some of the implications of this theorem, with z = 2, 3


and 4 standard deviations, follow.
At least .75, or 75%, of the data values must be within
z = 2 standard deviations of the mean.
At least .89, or 89%, of the data values must be within
z = 3 standard deviations of the mean.
At least .94, or 94%, of the data values must be within
z = 4 standard deviations of the mean.
Chebyshevs theorem requires z > 1; but z need not be
an integer.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

30/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Chebyshevs theorem
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

31/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Empirical rule
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Empirical rule rule that can be used to compute the


percentage of data values that must be within one, two,
and three standard deviations of the mean for data that
exhibit a bell-shaped distribution.
When the data are believed to approximate this
distribution, the empirical rule can be used to determine
the percentage of data values that must be within a
specified number of standard deviations of the mean.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

32/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Empirical rule
Contents
Measures of Location
Measures of Variability

Empirical rule

Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

For data having a bell-shaped distribution:


Approximately 68% of the data values will be within one
standard deviation of the mean.
Approximately 95% of the data values will be within two
standard deviations of the mean.
Almost all the data values will be within three standard
deviations of the mean.
The empirical rule is based on the normal probability
distribution, which will be discussed in Chapter 6. The
normal distribution is used extensively throughout the text.
Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

33/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Empirical rule
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

34/64

Measures of Distribution Shape, Relative


Location, and Detection of Outliers

small-corne

Detection of Outliers
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Outlier extreme values unusually small or unusually large


data value.
An outlier may be a data value that has been incorrectly
recorded.
Standardized values (z-scores) can be used to identify
outliers.
It is a good idea to check for outliers before making
decisions based on data analysis. Errors are often made
in recording data and entering data into the computer.
Outliers should not necessarily be deleted, but their
accuracy and appropriateness should be verified

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

35/64

Exploratory Data Analysis

small-corne

Five-Number Summary

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Five-number summary An exploratory data analysis


technique that uses five numbers to summarize the data:
1. Smallest value.
2. First quartile, (Q1 ).
3. Median, (Q2 ).
4. Third quartile, (Q3 ).
5. Largest value

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

36/64

Exploratory Data Analysis

small-corne

Five-Number Summary

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

37/64

Exploratory Data Analysis

small-corne

Box Plot

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Box plot a graphical summary of data based on a


five-number summary. A key to the development of a box
plot is the computation of the median and the quartiles,Q1
and Q3 . The interquartile range, IQR = Q3 Q1 , is also
used.
Box plots provide another way to identify outliers. But they
do not necessarily identify the same values as those with
a z-score less than 3 or greater than +3. Either or both
procedures may be used.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

38/64

Exploratory Data Analysis

small-corne

Box Plot

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

39/64

Exploratory Data Analysis

small-corne

Box Plot

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

40/64

Exploratory Data Analysis

small-corne

Box Plot

Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

41/64

Measures of Association Between Two


Variables

small-corne

Covariance
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Covariance a measure of linear association between two


variables. Positive values indicate a positive relationship;
negative values indicate a negative relationship. For a
sample of size n with the observations (x1 , y1 ), (x2 , y2 ),
and so on, the sample covariance is defined as follows:

Sample Covariance
sxy

Ahmad Fadly Nurullah bin Rasedee

P
(xi x)(yi y )
=
n1

Chapter 3

December 17, 2015

42/64

Measures of Association Between Two


Variables

small-corne

Covariance
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

43/64

Measures of Association Between Two


Variables

small-corne

Covariance
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

44/64

Measures of Association Between Two


Variables

small-corne

Covariance
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

45/64

Measures of Association Between Two


Variables

small-corne

Covariance
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

The formula for computing the covariance of a population


of size N

Population Covariance
xy

Ahmad Fadly Nurullah bin Rasedee

P
(xi x )(yi y )
=
N

Chapter 3

December 17, 2015

46/64

Measures of Association Between Two


Variables

small-corne

Interpretation of the Covariance


Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

47/64

Measures of Association Between Two


Variables

small-corne

Interpretation of the Covariance


Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

48/64

Measures of Association Between Two


Variables

small-corne

Interpretation of the Covariance


Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

49/64

Measures of Association Between Two


Variables

small-corne

Correlation Coefficient
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Correlation coefficient A measure of linear association


between two variables that takes on values between 1
and +1. Values near +1 indicate a strong positive linear
relationship; values near 1 indicate a strong negative
linear relationship; and values near zero indicate the lack
of a linear relationship.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

50/64

Measures of Association Between Two


Variables

small-corne

Correlation Coefficient
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

Pearson product moment correlation coefficient:


sample data

Exploratory Data Analysis


Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

rxy =

sxy
sx sy

where
rxy =sample correlation coefficient
sxy = sample covariance
sx = sample standard deviation of x
sy = sample standard deviation of y

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

51/64

Measures of Association Between Two


Variables

small-corne

Correlation Coefficient
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

the Pearson product moment correlation coefficient for


sample data (commonly referred to more simply as the
sample correlation coefficient) is computed by dividing the
sample covariance by the product of the sample standard
deviation of x and the sample standard deviation of y .

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

52/64

Measures of Association Between Two


Variables

small-corne

Correlation Coefficient
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

Pearson product moment correlation coefficient:


population data

Exploratory Data Analysis


Measures of Association
Between Two Variables

xy =

The Weighted Mean and


Working with Grouped
Data

xy
x y

where
xy = sample correlation coefficient
xy = sample covariance
x = sample standard deviation of x
y = sample standard deviation of y
The sample correlation coefficient rxy provides an
estimate of the population correlation coefficient xy .
Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

53/64

Measures of Association Between Two


Variables

small-corne

Interpretation of the Correlation Coefficient


Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

54/64

Measures of Association Between Two


Variables

small-corne

Interpretation of the Correlation Coefficient


Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

55/64

The Weighted Mean and Working with


Grouped Data

small-corne

Weighted mean
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis

Weighted mean The mean obtained by assigning each


observation a weight that reflects its importance.

Weighted mean

Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

P
wi xi
x= P
wi
where
xi = value of observation i
wi = weight for observation i
Computing a grade point average is a good example of
the use of a weighted mean.
Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

56/64

The Weighted Mean and Working with


Grouped Data

small-corne

Weighted mean
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

57/64

The Weighted Mean and Working with


Grouped Data

small-corne

Grouped Data
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Grouped data Data available in class intervals as


summarized by a frequency distribution.
weighted mean formula can be used to obtain
approximations of the mean, variance, and standard
deviation for grouped data.

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

58/64

The Weighted Mean and Working with


Grouped Data

small-corne

Grouped Data
Contents

Sample variance for grouped data

Measures of Location
Measures of Variability
Measures of Variability

Measures of Distribution
Shape, Relative Location,
and Detection of Outliers

x=

Exploratory Data Analysis


Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

fi Mi
n

where
Mi = the midpoint for class i
fi = the frequency for class i
n = the sample size

Sample variance for grouped data


2

s =
Ahmad Fadly Nurullah bin Rasedee

fi (Mi x)2
n1
Chapter 3

December 17, 2015

59/64

The Weighted Mean and Working with


Grouped Data

small-corne

Grouped Data
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

60/64

The Weighted Mean and Working with


Grouped Data

small-corne

Grouped Data
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

61/64

The Weighted Mean and Working with


Grouped Data

small-corne

Grouped Data
Contents
Measures of Location
Measures of Variability
Measures of Variability

Population variance for grouped data

Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis

Measures of Association
Between Two Variables

The Weighted Mean and


Working with Grouped
Data

fi Mi
N

Population variance for grouped data


2

Ahmad Fadly Nurullah bin Rasedee

fi (Mi )2
N

Chapter 3

December 17, 2015

62/64

The Weighted Mean and Working with


Grouped Data

small-corne

Grouped Data
Contents
Measures of Location
Measures of Variability
Measures of Variability
Measures of Distribution
Shape, Relative Location,
and Detection of Outliers
Exploratory Data Analysis
Measures of Association
Between Two Variables
The Weighted Mean and
Working with Grouped
Data

Ahmad Fadly Nurullah bin Rasedee

Chapter 3

December 17, 2015

63/64

Thank You

You might also like