0% found this document useful (0 votes)

30 views

Lecture 4

1) Variability refers to how different or similar the values in a data set are from each other. The more dissimilar the values, the higher the variability. 2) Range is a simple measure of variability that tells the span between the highest and lowest values. However, it does not fully capture variability. 3) Variance and standard deviation are better measures that account for how far all values are from the mean by squaring the differences. This prevents values from cancelling out.

Uploaded by

addis zewd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Lecture 4

Uploaded by

addis zewd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Variation

• Variability: The extent numbers in a data set are

dissimilar (different) from each other.
• When all elements measured receive the same
scores (e.g., everyone in the data set is the same
age, in years), there is no variability in the data set.
• As the scores in a data set become more
dissimilar, variability increases.
Variation: Range
• The range tells us the span over which the data are
distributed, and is only a very rough measure of
variability.
• Range: The difference between the maximum and
minimum scores.
Example: The youngest student in a class is 19 and the
oldest is 46. Therefore, the age range of the class is 46
– 19 = 27 years.
X X X
5 0.00 This is an example of data
5 0.00 with NO variability
5 0.00
5 0.00
5 0.00

 X= 25 n=5 X =5
X X X
6 +1.00 This is an example of data
4 -1.00 with low variability
6 +1.00
5 0.00
4 -1.00

 X= 25 n=5 X =5
X X X
8 +3.00 This is an example of data
1 -4.00 with higher variability
9 +4.00
5 0.00
2 -3.00

 X= 25 n=5 X =5
Note:
• Let’s say we wanted to figure out the average
deviation from the mean. Normally, we would want
to sum all deviations from the mean and then divide
by n, i.e.,
 X  X 
n

• BUT: We have a problem. ( X  X ) will always add

up to zero
• However, if we square each of the deviations from
the mean, we obtain a sum that is not equal to
zero.
• This is the basis for the measures of variance and
standard deviation, the two most common
measures of variability of data.
X XX X  X 
2

8 +3.00 9.00
1 -4.00 16.00
9 +4.00 16.00
5 0.00 0.00
2 -3.00 9.00
 X = 25  X  X  = 0.00
 
2 = 50.00
XX

Note: The  X  X 2 is called the Sum of Squares.

Variance of a Population
• VARIANCE OF A POPULATION: the sum of
squared deviations from the mean divided by the
number of scores (sigma squared):

 X   
2
  2

n
Population Standard Deviation
Square root of the variance 2

 X   
2

n
Sample Variance
• The sum of squared deviations from the mean
divided by the number of degrees of freedom (an
estimate of the population variance, n-1)

s 
2  X x  2

n 1
Sample Standard Deviation
• Square root of the variance s2

s  X  x  2

n 1
Why use Standard Deviation and not
Variance!??!
• Normally, you will only calculate variance in
order to calculate standard deviation, as standard
deviation is what we typically want.

• Why? Because standard deviation expresses

variability in the same units as the data.

• Example: Standard deviation of ages in a class is

3.7 years.
Degrees of Freedom
• Degrees of Freedom: The number of
independent observations, or, the number of
observations that are free to vary.
• In our data example above, there are 5
numbers that total 25 (  X = 25, n = 5)
Degrees of Freedom
• Many combinations of numbers can total 25, but only the
first 4 can be any value.
• The 5th number cannot vary if  X = 25
• This example has 4 degrees of freedom, as four of the
five numbers are free to vary.
• Sample standard deviation usually underestimates
population standard deviation.
• Using n-1 in the denominator corrects for this and gives
us a better estimate of the population standard deviation.
Normal Distribution
• The normal distribution is a theoretical
distribution.
• “Normal” does not mean typical or average, it is a
technical term given to this mathematical
function.
• The normal distribution is unimodal and
symmetrical, and is often referred to as the Bell
Curve.
Normal Distribution

Mean
Median
Mode
Normal Distribution
• We study the normal distribution because many
naturally occurring events yield a distribution
that approximates the normal distribution.
Properties of Area Under the Normal
Distribution
• One of the properties of the Normal Distribution
is the fixed area under the curve.
• If we split the distribution in half, 50% of the
scores of the sample lie to the left of the mean (or
median, or mode), and 50% of the scores lie to
the right of the mean (or median, or mode).
• The mean, median, and mode always cut the
Normal Distribution in half, and are equal since
the Normal Distribution is unimodal and
symmetrical.
50% of 50% of
scores scores

Mean, Median, Mode

• The entire area under the normal curve can be
considered to be a proportion of 1.0000.
• Thus, half, or .5000 of the scores lie in the bottom
half (i.e., left of the mean) of the distribution, and
half, or .5000 of the scores lie in the top half (i.e.,
right of the mean).
.5000 of .5000 of
scores scores

Mean, Median, Mode

Z-scores
• Z-Scores (or standard scores) are a way of
expressing a raw score’s place in a distribution.

• Z-score formula:

X 
z

• The mean  and standard deviation  are
always notated in Greek letters.

• Z-scores only reflect the data points’ position relative to

the overall data set (so you’re now considering the data
as a population, as you’re not looking to infer to a
greater population).

• This means use the population formula for standard

deviation rather than the sample formula whenever you
calculate Z.
• A z-score is a better indicator of where your score
falls in a distribution than a raw score.
• A student could get a 75/100 on a test (75%) and
consider this to be a very high score.
• If the average of the class marks is 89 and the
(population) standard deviation is 5.2, then the z-score
for a mark of 75 would be:
 89 
X 
= = 5.2

z = (75-89)/5.2 z
z = (-14)/5.2
z = -2.69

• This means that a mark of 75% is actually 2.69
standard deviations BELOW the mean.
• The student would have done poorly on this test,
as compared to the rest of the class.
• z = 0 represents the mean score (which would be
89 in this example).
• z < 0 represents a score less than the mean (which
would be less than 89).
• z > 0 represents a score greater than the mean
(which would be greater than 89).
• A z-score expresses the position of the raw score
above or below the mean in standard deviation
sized units.
• E.g.,
z = +1.50 means that the raw score is 1 and one-half
standard deviations above the mean.
z = -2.00 means that the raw score is 2 standard
deviations below the mean.
Z-score Example
• If you write two exams, in Math and English, and
get the following scores:

Math 70% (class = 55,  = 10)
English 60% (class  = 50,  = 5)
• Which test mark represents the better performance
(relative to the class)?
• Math mark:
z = (70-55)/10
z = +1.50
• English mark:
z = (60-50)/5 X 
z = +2.00 z

Z-score Example Illustration

Mean
Z=0.00 Z=1.50 Z=2.00
The Answer
• Because: Z = +2.00 is greater than Z = +1.50, the
English class mark of 60% reflects a better
performance relative to that class than does the
Math class mark of 70%.
Z-score: Solving for X
• The z-score formula can be rearranged to solve
for X:

X   X  (z)( )  
z

• This formula is used when you know the z-score
of a data point, and want to solve for the raw
score.
Example
• E.g., if a class midterm exam has  = 65 and  = 5,
what exam mark has a z-score value of 1.25?
X = (1.25)(5) + 65
X  (z)( )   = 6.25 + 65
= 71.25

So, a person whose test is 1.25 standard deviations above the

mean obtained a score of 71.25%.
Skew Distributions
• Outliers skew distributions.
• If group has one high score,
the curve has a positive
skew (contains more low
scores)
• If a group has a low outlier,
the curve has a negative
skew (contains more high
scores)

Science Stage 5 Workbook Answers
80% (10)
Science Stage 5 Workbook Answers
15 pages
Examples Biostatistics. Final
No ratings yet
Examples Biostatistics. Final
90 pages
Measures of Dispersion and Relative Standing
No ratings yet
Measures of Dispersion and Relative Standing
11 pages
History Reporting
No ratings yet
History Reporting
61 pages
4th Chap Variability
No ratings yet
4th Chap Variability
24 pages
Univariate Statistics
No ratings yet
Univariate Statistics
7 pages
Lecture III-Measures of Dispersion
No ratings yet
Lecture III-Measures of Dispersion
33 pages
Measures of Variability Lec 7: DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
No ratings yet
Measures of Variability Lec 7: DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
48 pages
Univariate Statistics
No ratings yet
Univariate Statistics
4 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
AP ECON 2500 Session 2
No ratings yet
AP ECON 2500 Session 2
22 pages
Ge 4 - Topic 2-Statistics
No ratings yet
Ge 4 - Topic 2-Statistics
8 pages
Lecture 3 Notes - PSYC 204
No ratings yet
Lecture 3 Notes - PSYC 204
8 pages
m2 2 Variation Z Scores
No ratings yet
m2 2 Variation Z Scores
18 pages
Psych Stat Reviewer print
No ratings yet
Psych Stat Reviewer print
2 pages
Math-7 FLDP Quarter-4 Week-7
No ratings yet
Math-7 FLDP Quarter-4 Week-7
7 pages
Measures of Variability
No ratings yet
Measures of Variability
20 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
32 pages
LESSON 4 MMW Data Management
No ratings yet
LESSON 4 MMW Data Management
104 pages
Basic Maths23su
No ratings yet
Basic Maths23su
42 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
8 pages
EFM 515 Stats Lecture Notes
No ratings yet
EFM 515 Stats Lecture Notes
104 pages
Measures of Variation and Z Scores
No ratings yet
Measures of Variation and Z Scores
20 pages
Variability: Educational Statistics EDU 5950 by Chan Yoke Bee (GS37395)
No ratings yet
Variability: Educational Statistics EDU 5950 by Chan Yoke Bee (GS37395)
26 pages
Advanced Statistics DISPERSION & NORMAL CURVE
No ratings yet
Advanced Statistics DISPERSION & NORMAL CURVE
29 pages
Statistics_Probability_Week_4(2)
No ratings yet
Statistics_Probability_Week_4(2)
16 pages
Analysis Interpretation and Use of Test Data
No ratings yet
Analysis Interpretation and Use of Test Data
50 pages
Arslan Ahmed: Time: Date
No ratings yet
Arslan Ahmed: Time: Date
62 pages
Measures of Central Tendency and Dispersion/ Variability
No ratings yet
Measures of Central Tendency and Dispersion/ Variability
35 pages
Location) .: Distribution Is The Purpose of Measure of Central
No ratings yet
Location) .: Distribution Is The Purpose of Measure of Central
13 pages
Ed216 Chapter 7
No ratings yet
Ed216 Chapter 7
31 pages
BS Lect 05
No ratings yet
BS Lect 05
35 pages
4x @6ote ) 'Btda2@m
No ratings yet
4x @6ote ) 'Btda2@m
55 pages
Statistics-in-Education6
No ratings yet
Statistics-in-Education6
460 pages
Lecture of BIOSTATISTICS 12.2022 RMDC
No ratings yet
Lecture of BIOSTATISTICS 12.2022 RMDC
85 pages
Statistics and Statistic
No ratings yet
Statistics and Statistic
11 pages
Measures of Central Tendency and Variability
No ratings yet
Measures of Central Tendency and Variability
38 pages
Business Statistics: Session 2
No ratings yet
Business Statistics: Session 2
60 pages
Lesson 6c, 7, 8-Print
No ratings yet
Lesson 6c, 7, 8-Print
5 pages
Statistics 1 Revision Sheet
No ratings yet
Statistics 1 Revision Sheet
9 pages
Measures of the Spread of the Data (Ch2Sec7)
No ratings yet
Measures of the Spread of the Data (Ch2Sec7)
24 pages
Basic Statistics Terms and Calculations
No ratings yet
Basic Statistics Terms and Calculations
4 pages
Random Variables and Probability Distribution
No ratings yet
Random Variables and Probability Distribution
26 pages
measures of dispersion updated
No ratings yet
measures of dispersion updated
38 pages
Math in The Modern World Stat Lecture
No ratings yet
Math in The Modern World Stat Lecture
3 pages
Nordis Final
No ratings yet
Nordis Final
6 pages
Chapter 4 Measures of Variability
No ratings yet
Chapter 4 Measures of Variability
26 pages
Statistics Slide Notes - Lecture 3-8
No ratings yet
Statistics Slide Notes - Lecture 3-8
104 pages
المحاضرة الثالثة
No ratings yet
المحاضرة الثالثة
16 pages
Measure of Dispersion Kurtosi, Skiwness
No ratings yet
Measure of Dispersion Kurtosi, Skiwness
22 pages
Statistics Chapter-IV
No ratings yet
Statistics Chapter-IV
59 pages
Lm#4c-Measures of Variability
No ratings yet
Lm#4c-Measures of Variability
4 pages
Measures of Variation
No ratings yet
Measures of Variation
30 pages
P102 Lesson 4
No ratings yet
P102 Lesson 4
24 pages
Surgical Safety Checklist
No ratings yet
Surgical Safety Checklist
103 pages
Biostatistics Revision Dr.nj
No ratings yet
Biostatistics Revision Dr.nj
67 pages
Variance and Standard Deviation
100% (3)
Variance and Standard Deviation
15 pages
Measure of Central Tendency and Variability
No ratings yet
Measure of Central Tendency and Variability
73 pages
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet
Easy_Sanitary_Riser_Diagram
No ratings yet
Easy_Sanitary_Riser_Diagram
2 pages
M01 Application Soft Ware
100% (1)
M01 Application Soft Ware
67 pages
M02-Bill of Quantities
No ratings yet
M02-Bill of Quantities
72 pages
Material & Tool
No ratings yet
Material & Tool
2 pages
Chapter 1 and 2 Advanced Career and Rehabilitation Counseling
No ratings yet
Chapter 1 and 2 Advanced Career and Rehabilitation Counseling
21 pages
Unit One 1.1 Operating Three Unit Objective:-At The End of This Unit, The Students Should Be
No ratings yet
Unit One 1.1 Operating Three Unit Objective:-At The End of This Unit, The Students Should Be
99 pages
Supplemental Book: Tinsae Holistic School
No ratings yet
Supplemental Book: Tinsae Holistic School
2 pages
English Reading: Supplemental Book
No ratings yet
English Reading: Supplemental Book
2 pages
Chapter 3 - Cement Hydration and AAR in Concrete
100% (2)
Chapter 3 - Cement Hydration and AAR in Concrete
65 pages
Chapter 2 - Advanced Construction Materials
No ratings yet
Chapter 2 - Advanced Construction Materials
43 pages
Le texte intégral du discours de Barack Obama, en anglais
No ratings yet
Le texte intégral du discours de Barack Obama, en anglais
1 page
Teamworking Skills
100% (1)
Teamworking Skills
66 pages
CHILD AND DEVELOPMENT PPT 2
No ratings yet
CHILD AND DEVELOPMENT PPT 2
13 pages
Ang Mahiwagang Saklay
No ratings yet
Ang Mahiwagang Saklay
32 pages
Lesson Plan. CN. 8-10
No ratings yet
Lesson Plan. CN. 8-10
1 page
5. Peterson & Kern. Changing highbrow taste, from snob to omnivore
No ratings yet
5. Peterson & Kern. Changing highbrow taste, from snob to omnivore
9 pages
[Ebooks PDF] download Group Dynamics for Teams 5th Edition (eBook PDF) full chapters
100% (1)
[Ebooks PDF] download Group Dynamics for Teams 5th Edition (eBook PDF) full chapters
40 pages
Emotional Intelligence Scale (EIS)
No ratings yet
Emotional Intelligence Scale (EIS)
7 pages
Intrapersonal and Interpersonal Communication
No ratings yet
Intrapersonal and Interpersonal Communication
2 pages
Data Integrity Template
No ratings yet
Data Integrity Template
4 pages
SAARC
No ratings yet
SAARC
17 pages
Andreas Birk Holger Keno - : - Vrije Universiteit Brussel AI-Lab
No ratings yet
Andreas Birk Holger Keno - : - Vrije Universiteit Brussel AI-Lab
6 pages
Renewable Energy: J. Guerrero-Perez, E. de Jodar, E. Gómez-Lázaro, A. Molina-Garcia
No ratings yet
Renewable Energy: J. Guerrero-Perez, E. de Jodar, E. Gómez-Lázaro, A. Molina-Garcia
11 pages
CW CONCEPTUALIZATION
No ratings yet
CW CONCEPTUALIZATION
40 pages
Control of Sales Force
No ratings yet
Control of Sales Force
28 pages
Review Teknik Granulasi
No ratings yet
Review Teknik Granulasi
6 pages
SIM - Unit 1 PDF
No ratings yet
SIM - Unit 1 PDF
11 pages
Sistem Informasi Kredit Program (SIKP)
No ratings yet
Sistem Informasi Kredit Program (SIKP)
11 pages
Iq Test Analysis
No ratings yet
Iq Test Analysis
16 pages
Education MSC in Analytical Chemistry MS
No ratings yet
Education MSC in Analytical Chemistry MS
3 pages
Digital Project Development PDF
No ratings yet
Digital Project Development PDF
282 pages
Invitation Letter 22
No ratings yet
Invitation Letter 22
4 pages
Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Auto Regressive Models
No ratings yet
Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Auto Regressive Models
31 pages
Resumeladewski
No ratings yet
Resumeladewski
1 page
Bk2 English
No ratings yet
Bk2 English
118 pages
Chap08 8th
No ratings yet
Chap08 8th
63 pages
Chen Et Al 2012
No ratings yet
Chen Et Al 2012
7 pages
Resume: of MD: Saddam Hossain Personal Information
No ratings yet
Resume: of MD: Saddam Hossain Personal Information
3 pages
1.fault Related Folding
100% (1)
1.fault Related Folding
19 pages

Lecture 4

Uploaded by

Lecture 4

Uploaded by

Variation

• Variability: The extent numbers in a data set are

• BUT: We have a problem. ( X  X ) will always add

Note: The  X  X 2 is called the Sum of Squares.

• Why? Because standard deviation expresses

• Example: Standard deviation of ages in a class is

Mean, Median, Mode

Mean, Median, Mode

• Z-scores only reflect the data points’ position relative to

• This means use the population formula for standard

So, a person whose test is 1.25 standard deviations above the

You might also like