0% found this document useful (0 votes)
172 views14 pages

ch4 Standard Deviation

The standard deviation measures how spread out the values are from the average value. It is a measure of variation or dispersion. The range tells us the spread between the lowest and highest values. The variance is the average of the squared deviations from the mean. The standard deviation is the square root of the variance.

Uploaded by

Akbar Suhendi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
172 views14 pages

ch4 Standard Deviation

The standard deviation measures how spread out the values are from the average value. It is a measure of variation or dispersion. The range tells us the spread between the lowest and highest values. The variance is the average of the squared deviations from the mean. The standard deviation is the square root of the variance.

Uploaded by

Akbar Suhendi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 14

Stat 2411 Statistical Methods

Chapter 4. Measure of Variation


4.1 The Range
• Difference between the largest and smallest values
3, 4, 6, 2, 1, 9

1, 2, 3, 4, 6, 9

Range=9-1=8
4.2 Variance and Standard
Deviation
For a population with values
x1, x2, … …, xn
The center is the population mean


 x i

n are
The deviations from the mean

x1   , x2   , , xn  
Consider the population – Diameters of all ball bearings
produced by machine: x1, x2, … …, xn

Let  = population mean n = population size


Then
n
x
 in
i 1

 2  population variance
( xi   )2
n
  mean of (x -  )  
2 2
n
i 1
Average squared
deviation from mean   population standard deviation
  2
Many calculators have a function for  x
Sample variance
• For a sample of size n, the sample variance
is
n
1
s 
2

n  1 i 1
( xi  x ) 2

2
• Why divide by n -1? This makes s an
unbiased estimator of  2
. Unbiased means
on the average correct.
Suppose we have a large population of ball bearings with diameters =1cm
and
  0.02   0.0004
2

2
Sample x s
1 0.98 0.00032
2 1.03 0.00031
3 1.01 0.00045
4 1.02 0.00052
. . .
. . .
∞ ------ --------
Mean 1.00 0.0004

( xi   )2
n
If we knew we would find ˆ   2

Fact i 1
n
min  ( xi  m)   ( xi  x )
2 2

So
 (xi  x )   ( xi   ) and
2 2  (x i  x)2 would be too small for .
n
Dividing by n-1 makes s2 come out right ()on average.
Sample Standard Deviation
Variance:
 ( x  x ) 2
s2  i
n 1

Standard Deviation:

( xi  x )2
s
n 1

The standard deviation (s) measures spread (or


variation) by looking at how far observations are from
the mean.
Example
On an exam I might ask you to write a numerical
expression for s for the data for the sample.

x1  8 x2  9 x3  4
894
x 7
3
(8  7 ) 2
 (9  7 ) 2
 ( 4  7 ) 2
14
s 
2
 7
3 1 2
s  Sample standard deviation
 7  2.65

(8  7)2  (9  7)2  (4  7)2


s
2
Choosing Measures of Center and Spread

 Use the mean & standard deviation for “bell-


shaped” distributions, where data are
symmetric and the average score is typical,
i.e. no outliers.
 Use the five number summary (Min, Q1,
Median, Q3, Max) for skewed data where
very large or small observations make the
mean less representative and to highlight the
range of outliers.
4.3 Application of the Standard Deviation
• Chebyshev’s Theorem – skip
• For bell – shaped histograms (or approximately
normal distributed, we will talk more about this
later)



Approx. 68% of the obs. are between 


Approx. 95% of the obs. are between 
Approx. 99.7% of the obs. are between 
The same is true for s and x
Standardizing Observations – z-scores
If we measure in units of size , about the mean , we
can transform our data to standard units: # of standard
deviations from average.
This is called standardizing.
So if x is an observation from a data set that has mean
 and standard deviation the standardized value of x
is
x
z

A standardized value is often called a z-score.
Example
In the US, the systolic blood pressure of men aged 20
has mean 120 and standard deviation 10.

1) We can expect 95% of our observations fall within

2) The systolic bp of a 20-yr old man is 130. Find the z-score for
his bp:
Exercise 1: The Standard Deviation (s)
26 systolic blood pressure

108 134 100 108 112 112 112 122 116


116 120 108 108 96 114 108 128
114 112 124 90 102 106 124 130 116
X = 113.08 mm Hg
Exercise 2: z-score

In the US, the systolic blood pressure of


men aged 20 has mean 120 and standard
deviation 10.
Q1. what proportion of the bps have a value
outside the range 110 to 130?

Q2. What is the z-score of a blood pressure


value of 100?

You might also like