0% found this document useful (0 votes)

306 views16 pages

Variance and Standard Deviation

Uploaded by

mehtab sana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

306 views16 pages

Variance and Standard Deviation

Uploaded by

mehtab sana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Statistics Canada

www.statcan.gc.ca
Skip to content | Skip to institutional links

 Français
 Home
 Contact Us
 Help
 Search
 canada.gc.ca
Home > Publications > 12-004-X > Main page > Measures of spread >

Publications

 Statistics: Power from Data!

o 12-004-X
o Main page
o Glossary
o Bibliography

 Measures of spread
o Welcome page
o Range and quartiles
o Variance and standard deviation
o Five-number summaries
o Constructing box and whisker plots
o Exercises
o Answers

Variance and standard deviation

Archived Content

Information identified as archived is provided for reference, research or recordkeeping

purposes. It is not subject to the Government of Canada Web Standards and has not been
altered or updated since it was archived. Please contact us to request a format other than
those available.

 Properties of standard deviation

 Discrete variables
 Example 1 – Standard deviation
 Frequency table (discrete variables)
 Example 2 – Standard deviation calculated using a frequency table
 Example 3 – Standard deviation using grouped variables (continuous or discrete)
 Example 4 – Standard deviation
 Example 5 – Standard deviation

Unlike range and quartiles, the variance combines all the values in a data set to produce a
measure of spread. The variance (symbolized by S2) and standard deviation (the square root of
the variance, symbolized by S) are the most commonly used measures of spread.

We know that variance is a measure of how spread out a data set is. It is calculated as the
average squared deviation of each number from the mean of a data set. For example, for the
numbers 1, 2, and 3 the mean is 2 and the variance is 0.667.

[(1 - 2)2 + (2 - 2)2 + (3 - 2)2] ÷ 3 = 0.667

[squaring deviation from the mean] ÷ number of observations = variance

Variance (S2) = average squared deviation of values from mean

Calculating variance involves squaring deviations, so it does not have the same unit of
measurement as the original observations. For example, lengths measured in metres (m) have a
variance measured in metres squared (m2).

Taking the square root of the variance gives us the units used in the original scale and this is the
standard deviation.

Standard deviation (S) = square root of the variance

Standard deviation is the measure of spread most commonly used in statistical practice when the
mean is used to calculate central tendency. Thus, it measures spread around the mean. Because
of its close links with the mean, standard deviation can be greatly affected if the mean gives a
poor measure of central tendency.

Standard deviation is also influenced by outliers one value could contribute largely to the results
of the standard deviation. In that sense, the standard deviation is a good indicator of the
presence of outliers. This makes standard deviation a very useful measure of spread for
symmetrical distributions with no outliers.

Standard deviation is also useful when comparing the spread of two separate data sets that have
approximately the same mean. The data set with the smaller standard deviation has a narrower
spread of measurements around the mean and therefore usually has comparatively fewer high or
low values. An item selected at random from a data set whose standard deviation is low has a
better chance of being close to the mean than an item from a data set whose standard deviation
is higher.

Generally, the more widely spread the values are, the larger the standard deviation is. For
example, imagine that we have to separate two different sets of exam results from a class of
30 students the first exam has marks ranging from 31% to 98%, the other ranges from 82% to
93%. Given these ranges, the standard deviation would be larger for the results of the first
exam.

Standard deviation might be difficult to interpret in terms of how big it has to be in order to
consider the data widely spread. The size of the mean value of the data set depends on the size
of the standard deviation. When you are measuring something that is in the millions, having
measures that are "close" to the mean value does not have the same meaning as when you are
measuring the weight of two individuals. For example, a measure of two large companies with a
difference of $10,000 in annual revenues is considered pretty close, while the measure of two
individuals with a weight difference of 30 kilograms is considered far apart. This is why, in most
situations, it is useful to assess the size of the standard deviation relative to the mean of the
data set.

Although standard deviation is less susceptible to extreme values than the range, standard
deviation is still more sensitive than the semi-quartile range. If the possibility of high values
(outliers) presents itself, then the standard deviation should be supplemented by the semi-
quartile range.

Properties of standard deviation

When using standard deviation keep in mind the following properties.

 Standard deviation is only used to measure spread or dispersion around the mean of a
data set.
 Standard deviation is never negative.
 Standard deviation is sensitive to outliers. A single outlier can raise the standard
deviation and in turn, distort the picture of spread.
 For data with approximately the same mean, the greater the spread, the greater the
standard deviation.
 If all values of a data set are the same, the standard deviation is zero (because each
value is equal to the mean).

When analysing normally distributed data, standard deviation can be used in conjunction with
the mean in order to calculate data intervals.

If = mean, S = standard deviation and x = a value in the data set, then

 about 68% of the data lie in the interval: - S < x < + S.

 about 95% of the data lie in the interval: - 2S < x < + 2S.
 about 99% of the data lie in the interval: - 3S < x < + 3S.

Discrete variables

The variance for a discrete variable made up of n observations is defined as:

The standard deviation for a discrete variable made up of n observations is the positive square
root of the variance and is defined as:

Use this step-by-step approach to find the standard deviation for a discrete variable.

1. Calculate the mean.

2. Subtract the mean from each observation.
3. Square each of the resulting observations.
4. Add these squared results together.
5. Divide this total by the number of observations (variance, S2).
6. Use the positive square root (standard deviation, S).

Example 1 – Standard deviation

A hen lays eight eggs. Each egg was weighed and recorded as follows:

60 g, 56 g, 61 g, 68 g, 51 g, 53 g, 69 g, 54 g.

a. First, calculate the mean:

b. Now, find the standard deviation.

Table 1. Weight of eggs, in

grams

Weight (x) (x - ) (x - )2

60 1 1

56 -3 9

61 2 4

68 9 81

51 -8 64

53 -6 36

69 10 100

54 -5 25

472 320
c.
Using the information from the above table, we can see that

In order to calculate the standard deviation, we must use the following formula:

Frequency table (discrete variables)

The formulas for variance and standard deviation change slightly if observations are grouped into
a frequency table. Squared deviations are multiplied by each frequency's value, and then the
total of these results is calculated.

In a frequency table, the variance for a discrete variable is defined as

The standard deviation for a discrete variable is defined as

Example 2 – Standard deviation calculated using a frequency table

Thirty farmers were asked how many farm workers they hire during a typical harvest season.
Their responses were:

4, 5, 6, 5, 3, 2, 8, 0, 4, 6, 7, 8, 4, 5, 7, 9, 8, 6, 7, 5, 5, 4, 2, 1, 9, 3, 3, 4, 6, 4

Table 2. Thirty farmers were asked how many farm workers they
hire during a typical harvest season. Their responses were:

Workers (x) Tally Frequency (f) (xf) (x - ) (x - )2 (x - )2f

0 1 0 -5 25 25
1 1 1 -4 16 16

2 2 4 -3 9 18

3 3 9 -2 4 12

4 6 24 -1 1 6

5 5 25 0 0 0

6 4 24 1 1 4

7 3 21 2 4 12

8 3 24 3 9 27

9 2 18 4 16 32

30 150 152

To calculate the mean:

To calculate the standard deviation:

Example 3 – Standard deviation using grouped variables (continuous or
discrete)

220 students were asked the number of hours per week they spent watching television. With this
information, calculate the mean and standard deviation of hours spent watching television by the
220 students.

Table 3. Number of hours per

week spent watching
television

Hours Number of students

10 to 14 2

15 to 19 12

20 to 24 23

25 to 29 60

30 to 34 77

35 to 39 38

40 to 44 8

a. First, using the number of students as the frequency, find the midpoint of time
intervals.
b. Now calculate the mean using the midpoint (x) and the frequency (f).

Note: In this example, you are using a continuous variable that has been rounded to the nearest
integer. The group of 10 to 14 is actually 9.5 to 14.499 (as the 9.5 would be rounded up to 10
and the 14.499 would be rounded down to 14). The interval has a length of 5 but the midpoint
is 12 (9.5 + 2.5 = 12).

6,560 = (2 X 12 + 12 X 17 + 23 X 22 + 60 X 27 + 77 X 32 + 38 X 37 + 8 X 42)
Then, calculate the numbers for the xf, (x - ), (x - )2 and (x - )2f formulas.

Add them to the frequency table below.

Table 4. Number of hours spent watching television

Hours Midpoint (x) Frequency (f) xf (x - ) (x - )2 (x - )2f

10 to 14 12 2 24 -17.82 317.6 635.2

15 to 19 17 12 204 -12.82 164.4 1,972.8

20 to 24 22 23 506 -7.82 61.2 1,407.6

25 to 29 27 60 1,620 -2.82 8.0 480.0

30 to 34 32 77 2,464 2.18 4.8 369.6

35 to 39 37 38 1,406 7.18 51.6 1,960.8

40 to 44 42 8 336 12.18 148.4 1,187.2

220 6,560 8,013.2

Example 4 – Standard deviation

Use the information found in the table above to find the standard deviation.

Note: During calculations, when a variable is grouped by class intervals, the midpoint of the
interval is used in place of every other value in the interval. Thus, the spread of observations
within each interval is ignored. This makes the standard deviation always less than the true
value. It should, therefore, be regarded as an approximation.
Example 5 – Standard deviation

Assuming the frequency distribution is approximately normal, calculate the interval within which
95% of the previous example's observations would be expected to occur.

= 29.82, s = 6.03

Calculate the interval using the following formula: - 2s < x < + 2s

29.82 - (2 X 6.03) < x < 29.82 + (2 X 6.03)

29.82 - 12.06 < x < 29.82 + 12.06

17.76 < x < 41.88

This means that there is about a 95% certainty that a student will spend between 18 hours and
42 hours per week watching television.

Date Modified: 2017-10-23

Top of Page

Important Notices

Advanced
Select Language ▼

We may use Cookies

FacebookTwitterPinterestLinkedIneMail a Friend

Standard Deviation and Variance

Deviation just means how far from the normal
Standard Deviation
The Standard Deviation is a measure of how spread out numbers are.

Its symbol is σ (the greek letter sigma)

The formula is easy: it is the square root of the Variance. So now you ask,

"What is the Variance?"

Variance
The Variance is defined as:

The average of the squared differences from the Mean.

To calculate the variance follow these steps:

 Work out the Mean (the simple average of the numbers)

 Then for each number: subtract the Mean and square the result
(the squared difference).
 Then work out the average of those squared differences. (Why Square?)

Example
You and your friends have just measured the heights of your dogs (in
millimeters):
The heights (at the shoulders) are: 600mm, 470mm, 170mm, 430mm and
300mm.

Find out the Mean, the Variance, and the Standard Deviation.

Your first step is to find the Mean:

Answer:

Mea
= 600 + 470 + 170 + 430 + 3005
n
= 19705
= 394

so the mean (average) height is 394 mm. Let's plot this on the chart:

Now we calculate each dog's difference from the Mean:

To calculate the Variance, take each difference, square it, and then average the
result:
Variance
σ2 = 2062 + 762 + (−224)2 + 362 + (−94)25
= 42436 + 5776 + 50176 + 1296 + 88365
= 1085205
= 21704

So the Variance is 21,704

And the Standard Deviation is just the square root of Variance, so:

Standard Deviation
σ = √21704
= 147.32...
= 147 (to the nearest mm)

And the good thing about the Standard Deviation is that it is useful. Now we can
show which heights are within one Standard Deviation (147mm) of the Mean:

So, using the Standard Deviation we have a "standard" way of knowing what is
normal, and what is extra large or extra small.

Rottweilers are tall dogs. And Dachshunds are a bit short, right?

Using

We can expect about 68% of values to be within plus-or-minus 1 standard

deviation.

Read Standard Normal Distribution to learn more.

Also try the Standard Deviation Calculator.

But ... there is a small change

with Sample Data
Our example has been for a Population (the 5 dogs are the only dogs we are
interested in).

But if the data is a Sample (a selection taken from a bigger Population), then
the calculation changes!

When you have "N" data values that are:

 The Population: divide by N when calculating Variance (like we did)

 A Sample: divide by N-1 when calculating Variance

All other calculations stay the same, including how we calculated the mean.

Example: if our 5 dogs are just a sample of a bigger population of dogs, we

divide by 4 instead of 5 like this:

Sample Variance = 108,520 / 4 = 27,130

Sample Standard Deviation = √27,130 = 165 (to the nearest mm)

Think of it as a "correction" when your data is only a sample.

Formulas
Here are the two formulas, explained at Standard Deviation Formulas if you
want to know more:

The "Population Standard Deviation":

The "Sample Standard Deviation":

Looks complicated, but the important change is to

divide by N-1 (instead of N) when calculating a Sample Variance.

*Footnote: Why square the differences?

If we just add up the differences from the mean ... the negatives cancel the
positives:

4 + 4 − 4 − 44 = 0

So that won't work. How about we use absolute values?

|4| + |4| + |−4| + |−4|4 = 4 + 4 + 4 +

44 = 4

That looks good (and is the Mean Deviation), but what about this case:
|7| + |1| + |−6| + |−2|4 = 7 + 1 + 6 +

24 = 4

Oh No! It also gives a value of 4, Even though the differences are more spread
out.
So let us try squaring each difference (and taking the square root at the end):

√(42 + 42 + 42 + 424) = √(644) = 4

√(7 + 1 + 6 +
2 2 2

2 4) = √(904) = 4.74...
2

That is nice! The Standard Deviation is bigger when the differences are more
spread out ... just what we want.
In fact this method is a similar idea to distance between points, just applied in a
different way.
And it is easier to use algebra on squares and square roots than absolute
values, which makes the standard deviation easy to use in other areas of
mathematics.
Return to Top

Question 1 Question 2 Question 3 Question 4 Question 5 Question 6 Q
uestion 7 Question 8 Question 9 Question 10
Standard Deviation FormulasStandard Deviation CalculatorStandard Normal
DistributionAccuracy and PrecisionMeanProbability and Statistics

MEASURES OF Dispersion
No ratings yet
MEASURES OF Dispersion
28 pages
MSM 111 - Binomial Expansions PP
No ratings yet
MSM 111 - Binomial Expansions PP
46 pages
Applications of Ir Spectros
No ratings yet
Applications of Ir Spectros
18 pages
Lesson 1 (Obtaining Data)
100% (1)
Lesson 1 (Obtaining Data)
7 pages
8.3 Measures of Dispersion: Standard Deviation
No ratings yet
8.3 Measures of Dispersion: Standard Deviation
19 pages
Higher-Order Derivatives
100% (1)
Higher-Order Derivatives
3 pages
2-Module 1 - Finite Fields and Number Theory-05-01-2024
No ratings yet
2-Module 1 - Finite Fields and Number Theory-05-01-2024
82 pages
Mathematics Part 1 Math Board Exam
No ratings yet
Mathematics Part 1 Math Board Exam
14 pages
Engr - Jessa Mae A. Gomez: Instructor
No ratings yet
Engr - Jessa Mae A. Gomez: Instructor
15 pages
2.3 Relation and Function
No ratings yet
2.3 Relation and Function
41 pages
Problem Solving Mathematics - Lesson 1
No ratings yet
Problem Solving Mathematics - Lesson 1
27 pages
Variance and Standard Deviation
100% (3)
Variance and Standard Deviation
15 pages
Chapter 3 Probability
No ratings yet
Chapter 3 Probability
82 pages
College Algebra
No ratings yet
College Algebra
30 pages
Calculus 2
No ratings yet
Calculus 2
2 pages
Chapter 3: Sequences and Series: 3.2 Binomial Expansion
No ratings yet
Chapter 3: Sequences and Series: 3.2 Binomial Expansion
30 pages
Es 10 PPT For Video Lecture 6
No ratings yet
Es 10 PPT For Video Lecture 6
23 pages
Unit 1: Measures of Central Tendency: Module 6: Descriptive Statistical Measures
No ratings yet
Unit 1: Measures of Central Tendency: Module 6: Descriptive Statistical Measures
10 pages
Statistics and Probability
No ratings yet
Statistics and Probability
4 pages
Binomial Expansions
No ratings yet
Binomial Expansions
10 pages
Depedpang 1
No ratings yet
Depedpang 1
127 pages
Part A - Multiple Choice: Practice Test - Trigonometry
No ratings yet
Part A - Multiple Choice: Practice Test - Trigonometry
6 pages
Business Mathematics (OBE)
No ratings yet
Business Mathematics (OBE)
10 pages
Abstract Algebra-Syllabus
No ratings yet
Abstract Algebra-Syllabus
8 pages
Mathm109-Calculus II - Module 3
No ratings yet
Mathm109-Calculus II - Module 3
20 pages
Correlational Study
No ratings yet
Correlational Study
12 pages
K0292001022011402301 Matrix
No ratings yet
K0292001022011402301 Matrix
34 pages
5 Linear Transformation of Matrices
100% (1)
5 Linear Transformation of Matrices
53 pages
Measures of The Spread of The Data (Ch2Sec7)
No ratings yet
Measures of The Spread of The Data (Ch2Sec7)
24 pages
Module 3 - Presentation of Data
No ratings yet
Module 3 - Presentation of Data
25 pages
Detailed Learning Module: Normal Distribution
No ratings yet
Detailed Learning Module: Normal Distribution
12 pages
3 - Binary Operations
No ratings yet
3 - Binary Operations
2 pages
Activity 3 - Mathematics Language and Symbols
No ratings yet
Activity 3 - Mathematics Language and Symbols
5 pages
Review in Trigo
No ratings yet
Review in Trigo
5 pages
ECE 024 Lab Activity 2 - Forms of Complex Numbers
No ratings yet
ECE 024 Lab Activity 2 - Forms of Complex Numbers
9 pages
St. Dominic Savio College College of Arts, Sciences, and Educaton SY 2018 - 2019 Curriculum Instructional Guide (Cig)
No ratings yet
St. Dominic Savio College College of Arts, Sciences, and Educaton SY 2018 - 2019 Curriculum Instructional Guide (Cig)
9 pages
Mathm109-Calculus II - Module 5
No ratings yet
Mathm109-Calculus II - Module 5
13 pages
TMIGLesson 1
No ratings yet
TMIGLesson 1
5 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
23 pages
Lesson 3. Division and Multiplication of Rational Expressions 1
No ratings yet
Lesson 3. Division and Multiplication of Rational Expressions 1
5 pages
Lectures in Math 111: Linear Algebra Lecture 1. The Inverse of A Matrix
No ratings yet
Lectures in Math 111: Linear Algebra Lecture 1. The Inverse of A Matrix
51 pages
TRIG FUNCTIONS Lesson Solving Right Triangles
No ratings yet
TRIG FUNCTIONS Lesson Solving Right Triangles
52 pages
The Central Limit Theorem
No ratings yet
The Central Limit Theorem
8 pages
3.3 Function Notation
No ratings yet
3.3 Function Notation
13 pages
Mathematical Induction PDF
No ratings yet
Mathematical Induction PDF
6 pages
Module 1 Week 1 (Functions and Relations)
No ratings yet
Module 1 Week 1 (Functions and Relations)
4 pages
Measures of Position: Percentile of Grouped Data
No ratings yet
Measures of Position: Percentile of Grouped Data
10 pages
Math 131 - Action Research in Mathematics Education
No ratings yet
Math 131 - Action Research in Mathematics Education
9 pages
Mathm109-Calculus II - Module 6
No ratings yet
Mathm109-Calculus II - Module 6
5 pages
3.3 The Inverse of A Matrix
No ratings yet
3.3 The Inverse of A Matrix
30 pages
Trig Exam 2 Review F07
No ratings yet
Trig Exam 2 Review F07
6 pages
Eigenvector and Eigenvalue
No ratings yet
Eigenvector and Eigenvalue
6 pages
Republic of The Philippines Salvacion, Daraga, Albay A.Y. 2020 - 2021
No ratings yet
Republic of The Philippines Salvacion, Daraga, Albay A.Y. 2020 - 2021
3 pages
Variance
No ratings yet
Variance
6 pages
Notes On Mathematical Expectation
No ratings yet
Notes On Mathematical Expectation
6 pages
Adjoint and Inverse Matrices Linear Equations
No ratings yet
Adjoint and Inverse Matrices Linear Equations
9 pages
33 Simplified Radical Form
No ratings yet
33 Simplified Radical Form
11 pages
SPS 2320 Theory of Estimation Year 3 Semester II
100% (1)
SPS 2320 Theory of Estimation Year 3 Semester II
2 pages
Business Plan Ladaran (Autosaved)
No ratings yet
Business Plan Ladaran (Autosaved)
12 pages
Statistics - Linear Regression - Correlation Worksheet PDF
No ratings yet
Statistics - Linear Regression - Correlation Worksheet PDF
2 pages
Introduction To Probability and Statistics (IPS) : Endterm
No ratings yet
Introduction To Probability and Statistics (IPS) : Endterm
16 pages
Day 8 Mixture Problems
No ratings yet
Day 8 Mixture Problems
7 pages
Box-and-Whisker Plot: 1. The Box Plots Show The Times For 15 Boys and 15 Girls To Run 100 M
No ratings yet
Box-and-Whisker Plot: 1. The Box Plots Show The Times For 15 Boys and 15 Girls To Run 100 M
4 pages
Standard Deviation
No ratings yet
Standard Deviation
9 pages
Course Title: Business Statistics Course Code: Qam 103 Credit Unit: 03 Course Level: Ug
No ratings yet
Course Title: Business Statistics Course Code: Qam 103 Credit Unit: 03 Course Level: Ug
4 pages
Unit 3
No ratings yet
Unit 3
20 pages
3-Representation of Data
No ratings yet
3-Representation of Data
12 pages
Today Final Test Dec 2023 - 27-12
No ratings yet
Today Final Test Dec 2023 - 27-12
12 pages
Data Kelompok 4 Statistik
No ratings yet
Data Kelompok 4 Statistik
5 pages
Selina Solutions Concise Maths Class 10 Chapter 24
No ratings yet
Selina Solutions Concise Maths Class 10 Chapter 24
22 pages
Geometric Increase Method
No ratings yet
Geometric Increase Method
11 pages
Radioactive Tracer: Language Watch Edit
No ratings yet
Radioactive Tracer: Language Watch Edit
6 pages
Normal Distribution PDF
No ratings yet
Normal Distribution PDF
25 pages
PH (Titration) Curves: The Equivalence Point of A Titration
No ratings yet
PH (Titration) Curves: The Equivalence Point of A Titration
5 pages
Moore
100% (1)
Moore
13 pages
Measures of Spread or Dispersion (Mean, Variance, and Standard Deviation) Topic Contents
No ratings yet
Measures of Spread or Dispersion (Mean, Variance, and Standard Deviation) Topic Contents
9 pages
Confidence Intervals For Variances and Standard Deviations
No ratings yet
Confidence Intervals For Variances and Standard Deviations
2 pages
STAT - 101 - TUTORIAL - 4 Solutions
No ratings yet
STAT - 101 - TUTORIAL - 4 Solutions
10 pages
Lind 19e Chap003 PPT Accessible
No ratings yet
Lind 19e Chap003 PPT Accessible
46 pages
What Is A PET Scan?: Purpose PET Scan vs. Other Tests Risks Preparation Procedure Follow-Up and Results
No ratings yet
What Is A PET Scan?: Purpose PET Scan vs. Other Tests Risks Preparation Procedure Follow-Up and Results
13 pages
Radioisotope: Applications, Effects, and Occupational Protection
No ratings yet
Radioisotope: Applications, Effects, and Occupational Protection
33 pages
Aqa Ss1a W QP Jun07
No ratings yet
Aqa Ss1a W QP Jun07
4 pages
Be Electrical Engineering Semester 3 2023 May Engineering Mathematics III m3 Pattern 2019
No ratings yet
Be Electrical Engineering Semester 3 2023 May Engineering Mathematics III m3 Pattern 2019
5 pages
PGM 1
No ratings yet
PGM 1
5 pages
Stat
No ratings yet
Stat
3 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
SPECT Scan: Menu Search
No ratings yet
SPECT Scan: Menu Search
8 pages
General Instrumentation: Icp-Ms
No ratings yet
General Instrumentation: Icp-Ms
6 pages
Business Statistics
No ratings yet
Business Statistics
13 pages
Alevel Ut s1 U2 Test
No ratings yet
Alevel Ut s1 U2 Test
5 pages
Testul 6
No ratings yet
Testul 6
10 pages
Ial Maths s1 Review Exercise 1 Ans
No ratings yet
Ial Maths s1 Review Exercise 1 Ans
5 pages
Assignment For Statistics
No ratings yet
Assignment For Statistics
3 pages
LA 05 - MLS 054 For SAS 11 - Version 2
No ratings yet
LA 05 - MLS 054 For SAS 11 - Version 2
4 pages
Statistics Assignment 1
No ratings yet
Statistics Assignment 1
4 pages
BBS Assignment 2 Final
No ratings yet
BBS Assignment 2 Final
5 pages
Problems 1
No ratings yet
Problems 1
2 pages

Variance and Standard Deviation

Uploaded by

Variance and Standard Deviation

Uploaded by

Statistics Canada

 Statistics: Power from Data!

Variance and standard deviation

Information identified as archived is provided for reference, research or recordkeeping

 Properties of standard deviation

[(1 - 2)2 + (2 - 2)2 + (3 - 2)2] ÷ 3 = 0.667

[squaring deviation from the mean] ÷ number of observations = variance

Variance (S2) = average squared deviation of values from mean

Standard deviation (S) = square root of the variance

Properties of standard deviation

When using standard deviation keep in mind the following properties.

 about 68% of the data lie in the interval: - S < x < + S.

The variance for a discrete variable made up of n observations is defined as:

1. Calculate the mean.

Example 1 – Standard deviation

a. First, calculate the mean:

b. Now, find the standard deviation.

Table 1. Weight of eggs, in

Frequency table (discrete variables)

In a frequency table, the variance for a discrete variable is defined as

The standard deviation for a discrete variable is defined as

Example 2 – Standard deviation calculated using a frequency table

Workers (x) Tally Frequency (f) (xf) (x - ) (x - )2 (x - )2f

To calculate the mean:

To calculate the standard deviation:

Table 3. Number of hours per

Hours Number of students

Add them to the frequency table below.

Table 4. Number of hours spent watching television

Hours Midpoint (x) Frequency (f) xf (x - ) (x - )2 (x - )2f

10 to 14 12 2 24 -17.82 317.6 635.2

15 to 19 17 12 204 -12.82 164.4 1,972.8

20 to 24 22 23 506 -7.82 61.2 1,407.6

25 to 29 27 60 1,620 -2.82 8.0 480.0

30 to 34 32 77 2,464 2.18 4.8 369.6

35 to 39 37 38 1,406 7.18 51.6 1,960.8

40 to 44 42 8 336 12.18 148.4 1,187.2

220 6,560 8,013.2

Example 4 – Standard deviation

Calculate the interval using the following formula: - 2s < x < + 2s

29.82 - (2 X 6.03) < x < 29.82 + (2 X 6.03)

29.82 - 12.06 < x < 29.82 + 12.06

17.76 < x < 41.88

Standard Deviation and Variance

Its symbol is σ (the greek letter sigma)

The formula is easy: it is the square root of the Variance. So now you ask,

The average of the squared differences from the Mean.

To calculate the variance follow these steps:

 Work out the Mean (the simple average of the numbers)

Your first step is to find the Mean:

Now we calculate each dog's difference from the Mean:

So the Variance is 21,704

Rottweilers are tall dogs. And Dachshunds are a bit short, right?

We can expect about 68% of values to be within plus-or-minus 1 standard

Read Standard Normal Distribution to learn more.

Also try the Standard Deviation Calculator.

But ... there is a small change

When you have "N" data values that are:

 The Population: divide by N when calculating Variance (like we did)

Example: if our 5 dogs are just a sample of a bigger population of dogs, we

Sample Variance = 108,520 / 4 = 27,130

Think of it as a "correction" when your data is only a sample.

The "Sample Standard Deviation":

Looks complicated, but the important change is to

*Footnote: Why square the differences?

So that won't work. How about we use absolute values?

|4| + |4| + |−4| + |−4|4 = 4 + 4 + 4 +

√(42 + 42 + 42 + 424) = √(644) = 4

You might also like