0% found this document useful (0 votes)

80 views76 pages

Unit II

The document discusses various statistical measures used to describe the dispersion or spread of data in a distribution, including range, interquartile range, quartiles, deciles, and percentiles. It provides formulas and explanations for calculating each measure, whether for an ungrouped or grouped data set. The key measures of dispersion discussed are range, interquartile range, standard deviation, and coefficient of variation.

Uploaded by

Vidhi Maheshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views76 pages

Unit II

Uploaded by

Vidhi Maheshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 76

Unit II

Descriptive Statistics:
Measures of Variance
(Standard Deviation for Sample & Population),
and Measure of Skewness
For which of the following distributions is the mean a true
representative of the data as a whole? Why?
Dispersion

Dispersion is the spread of the data in a distribution, that is, the

extent to which the observations are scattered
Dispersion:
Dispersion is the spread of the data in a distribution, that is, the extent to
which the observations are scattered. Notice that curve ….. in Figure has a
wider spread, or dispersion, than curve …..
• A company has 25 salespeople in the field, and the median annual sales figure for
these people is $1.2 million.

• Are the salespeople being successful as a group or not?

• The median provides information about the sales of the person in the middle, but
what about the other salespeople?

• Are all of them selling $1.2 million annually, or do the sales figures vary widely,
with one person selling $5 million annually and another selling only $150,000
annually?

• .
RANGES: USEFUL MEASURES OF DISPERSION
• The range is the difference between the largest value of a data set and the smallest
value of a set

• An advantage of the range is its ease of computation.

• One important use of the range is in quality assurance, where the range is used to
construct control charts.

• A disadvantage of the range is that, because it is computed with the values that are on
the extremes of the data, it is affected by extreme values, and its application as a
measure of variability is limited.
Interquartile Range

• Another measure of variability is the interquartile range.

• The interquartile range is the range of values between the first and third quartile.

• Essentially, it is the range of the middle 50% of the data and is determined by computing
the value of Q3 - Q1.

• The interquartile range is especially useful in situations where data users are more
interested in values toward the middle and less interested in extremes.
• In describing a real estate housing
market, Realtors might use the
interquartile range as a measure of
housing prices when describing the
middle half of the market for buyers
who are interested in houses in the
midrange.

• In addition, the interquartile range is

used in the construction of box-and-
whisker plots.
Quartiles
Quartiles are the set of values which has three points dividing the data set into four identical parts

he middle part of the three quarters measures the central point of distribution and shows the data which are near
to the central point. The lower part of the quarters indicates just half information set which comes under the
median and the upper part shows the remaining half, which falls over the median. In all, the quartiles depict the
distribution or dispersion of the data set.
Ungrouped data

Q1 = [(n+1)/4]th item

Q2 = [(n+1)/2]th item

Q3 = [3(n+1)/4]th item

Grouped data

Where, Qr is the rth quartile

• l1 is the lower limit
• l2 is the upper limit
• f is the frequency
• c is the cumulative frequency of the class preceding the quartile
class.
Quartile Deviation

• Quartile deviation is defined as half of

the distance between the third and the
first quartile.

• It is also called Semi Interquartile range.

• If Q1 is the first quartile and Q3 is the

third quartile, then the formula for
deviation is given by;
Decile
• The term “decile” refers to the nine values that split the population data into ten equal
fragments such that each fragment is representative of 1/10th of the population.
• The concept of decile because it is widely used in the field of portfolio management to
assess the performance of a portfolio. The ranking helps to compare the performance of
an asset with other similar assets.
• The decile method is also used by the government to determine the income distribution or
level of income equality in a nation.
Ungrouped data

D1 = [(n+1)/10]th item

D2 = [2(n+1)/10]th item

D9 = [9(n+1)/10]th item

The rth Decile (a measure of the relative standing of an observation) for
grouped data is

Where, Dr is the rth Decile

• l1 is the lower limit
• l2 is the upper limit
• f is the frequency
• c is the cumulative frequency of the class preceding the percentile
class.
Percentile

• Percentiles tell you how a value compares to other values. The general rule is
that if value X is at the kth percentile, then X is greater than K% of the
values.

• Percentiles are a measure of the

relative standing of observation
within a data. Percentiles divide a
set of observations into 100 equal
parts, and percentile scores are
frequently used to report results
from national standardized tests
such as NAT, GAT, etc.
• Note that 50th percentile is the median by definition as half
of the values in the data are smaller than the median and
half of the values are larger than the median.

• Similarly, 25th and 75th percentiles are the lower (Q1) and

upper quartiles (Q3) respectively.

• The quartiles, deciles, and percentiles are also

called quantiles or fractiles.
Ungrouped data

P1 = [(n+1)/100]th item

P2 = [2(n+1)/100]th item

P99 = [99(n+1)/100]th item

The rth percentile (a measure of the relative standing of an observation) for
grouped data is

Where, Pr is the rth percentile

• l1 is the lower limit
• l2 is the upper limit
• f is the frequency
• c is the cumulative frequency of the class preceding the percentile
class.
Standard Deviation
• One of the most common methods of determining the risk an investment
poses is standard deviation.
• When prices move wildly, standard deviation is high, meaning an investment
will be risky.
• Low standard deviation means prices are calm, so investments come with
low risk.
Example 01 of Standard Deviation Using Investments
• Let’s say you invest in Company XYZ which has returned an average of 10% per year
for the last 10 years. We’ll compare how risky this stock is compared to Company ABC.
We’ll take a closer look at the year-by-year returns that compose that average:
XYZ's returns

SD of XYZ stock 20.68%.

ABC's returns

SD of ABC stock is a much lower 0.0129 or 1.29%

What Is a Good Standard Deviation?

• There isn’t a standard benchmark of

what is considered a “good”
standard deviation –

• it all depends on your investing

goals. For someone who wants to be
less risky with their portfolio, a high
standard deviation would be
considered “bad”, whereas someone
who desires to be more aggressive
would consider it “good”.
Advantages and Disadvantages of Standard Deviation?

Standard Deviation
Advantages Disadvantages
•Shows how much data is clustered •It doesn't give you the full range of the
around a mean value data
•It gives a more accurate idea of how the •Only used with data where an
data is distributed independent variable is plotted against
•Not as affected by extreme values the frequency of it
•Assumes a normal distribution pattern
Empirical Rule of Standard Deviation?[Three Sigma Rule or the 68-
95-99.7 ]
• The Empirical Rule states that 99.7% of data observed following a normal distribution lies
within 3 standard deviations of the mean.
• Under this rule, 68% of the data falls within one standard deviation, 95% percent within
two standard deviations, and 99.7% within three standard deviations from the mean.
Sample and Population Standard Deviation?

• The formula we use for standard deviation depends on whether the

data is being considered a population of its own, or the data is a
sample representing a larger population.

• If the data is being considered a population on its own, we divide by

the number of data points, NNN.

• If the data is a sample from a larger population, we divide by one

fewer than the number of data points in the sample, n-1n−1n, minus,
1.
Population Standard Deviation?

• σ 2 = population variance
• σ = population standard deviation
• f = frequency of each of the classes
• x = midpoint for each class
• μ = population mean
• N= size of the population
Sample Standard Deviation?

√ ∑ 𝑓 (𝑥−𝑥) 2

𝑠=
𝑛−1
Coefficient of Variation
• The coefficient of variation (CV) is a statistical measure of the dispersion of data points in a data series
around the mean.
• The coefficient of variation represents the ratio of the standard deviation to the mean, and it is a
useful statistic for comparing the degree of variation from one data series to another, even if the
means are drastically different from one another.
Problem

iv Determine the Coefficient of Quartile Deviation

v Determine the 8th Decile
vi Compute Variance
Problem
Problems
Problems
• The following table gives the amount of time (in minutes) spent on the internet each
evening by a group of 56 students. Compute five number summary for the following
frequency distribution.
Time spent on 10-12 13-15 16-18 19-21 22-24
Internet (x)

No. of 3 12 15 24 2
students (f)
Compute for the following frequency distribution

1. Coefficient of Quartile Deviation

2. 7th Decile
3. Variance
4. 71th Percentile
5. Standard Deviation
6. Variance
7. Coefficient of Variation
Practice Problems

• The following data represent the

difference in scores between the
winning and losing teams in a sample of Point Number of Bowl
Difference Games
15 college football bowl games from
1-5 8
2004-2005. 6 - 10 0
11 - 15 2
16 - 20 3
Compute for the following frequency distribution 21 - 25 1
1. Coefficient of Quartile Deviation 26 - 30 0
31 - 35 1
2. 7th Decile
3. Variance
4. 71th Percentile
5. Standard Deviation
6. Variance
7. Coefficient of Variation
Q.03 A study of the age of 100 persons grouped into intervals 20-22,22-24, 24-
26….. Revealed the mean age and standard deviation to be 32.02 and13.18
respectively. While checking, it was discovered that the observation 57 was
misread as 27. Calculate the correct mean age and SD.
Problem

Q.03 A study of the age of 100 persons grouped into intervals 20-22,22-24, 24-
26….. Revealed the mean age and standard deviation to be 32.02 and13.18
respectively. While checking, it was discovered that the observation 57 was
misread as 27. Calculate the correct mean age and SD.
Problem

Q.04 The mean of 5 observations is 15 and the variance is 9. If two more

observations having values -3 and 10 are combined with these 5
observations, what will be the new mean and variance of 7 observations.
Practice Problems

• The mean and standard deviation of 20 items are found to be 10 and 2 respectively. At the time of checking it was
found that an item 12 was wrongly entered as 8. Calculate the correct mean and standard deviation.

• Mean of 100 items is 48 and their standard deviation is 10. Find the sum of all the items and the sum of the squares
of all the items.

• A student obtained the mean and the standard deviation of 100 observations as 40 and 5.1. It was later found that
one observation was wrongly copied as 50, the correct figure being 40. Find the correct mean and the S.D

• The mean and variance of seven observations are 8 and 16 respectively. If five of these are 2, 4, 10, 12 and 14, then
find the remaining two observations.

• For a group of 100 candidates the mean and standard deviation of their marks were found to be 60 and 15
respectively. Later on it was found that the scores 45 and 72 were wrongly entered as 40 and 27. Find the correct
mean and standard deviation
Skewness
• Skewness means “Lack of Symmetry.

• When curve is not symmetrical, the values of Mean, Mode and Mean fall at different
points. The curve may shift its bulk of the bell-shape either to the right or left of the Mean
Value. These are called skewness to the left or right of the mean.
Karl Pearson’s coefficient of skewness
Problem
Calculate karl Pearson Coefficient of Skewness for a distribution
having mean=3.41, median=3.4 and standard deviation =0.70
Sk=(3(3.41-3.4))/0.70
Sk=0.03/0.70
Sk=0.043
Problem
Calculate karl Pearson Coefficient of Skewness for a distribution
having mean=75, median=80 and standard deviation =20
Sk=(3(75-80))/20
Sk=-15/20
Sk=-0.75
• Karl Pearson Coefficient of skewness of a distribution is 0.32. Its s.d. is
6.5 and the mean is 29.6.Find the mode and median of the
distribution.
Problem
Calculate the Pearson’s coefficient of skewness based on Mean and Mode
from the following information.

Wages (Rs.) : 0-10 10-20 20-30 30-40 40-50

No. of workers : 15 20 30 25 10
Problem
Calculate the Pearson’s coefficient of skewness based on Mean and Mode
from the following information.

Wages (Rs.) : 0-10 10-20 20-30 30-40 40-50

No. of workers : 15 20 30 25 10
Problem
Calculate the Pearson’s coefficient of skewness based on Mean and Mode from the following
information.

Class : 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80

Frequency: 5 6 11 21 35 30 22 11
Practice Problem
The radio music listener market is diverse. Listener formats might include adult
contemporary, album rock, top 40, oldies, rap, country and western, classical, and
jazz. In targeting audiences, market researchers need to be concerned about the
ages of the listeners attracted to particular formats. Suppose a market researcher
surveyed a sample of 170 listeners of country music radio stations and obtained
the following age distribution.
Age Frequency
A. What are the mean and modal ages of country music 15–under 20 9
20–under 25 16
listeners? 25–under 30 27
30–under 35 44
B. What are the variance and standard deviation of the 35–under 40 42
ages of country music listeners? 40–under 45 23
45–under 50 7
C. Calculate the Pearson’s coefficient of skewness 50–under 55 2

Nature's Fury - Greg Maroney
No ratings yet
Nature's Fury - Greg Maroney
12 pages
(WWW - Asianovel.com) - How To Survive As A Villain Chapter 1 - Chapter 10
No ratings yet
(WWW - Asianovel.com) - How To Survive As A Villain Chapter 1 - Chapter 10
41 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Mba - I QM QB Unit 2
No ratings yet
Mba - I QM QB Unit 2
23 pages
Measures of Dispersion Tendency
No ratings yet
Measures of Dispersion Tendency
7 pages
Chapter 4 Measures of Dispersion
No ratings yet
Chapter 4 Measures of Dispersion
45 pages
Measures of Dispersion: Profgrcnair
No ratings yet
Measures of Dispersion: Profgrcnair
22 pages
Dispersion
No ratings yet
Dispersion
31 pages
R3.Descriptive Statistics
No ratings yet
R3.Descriptive Statistics
5 pages
Decriptive Part 3
No ratings yet
Decriptive Part 3
32 pages
Lecture 3
No ratings yet
Lecture 3
10 pages
DDDDDD 2
No ratings yet
DDDDDD 2
5 pages
Lecture 2b Brief Lecture Notes On Measures of Dispersion (Variability)
No ratings yet
Lecture 2b Brief Lecture Notes On Measures of Dispersion (Variability)
11 pages
Imp - MEASURES OF DISPERSION
No ratings yet
Imp - MEASURES OF DISPERSION
5 pages
Health Statistics III 2.1 Cert
No ratings yet
Health Statistics III 2.1 Cert
53 pages
Unit Five
No ratings yet
Unit Five
23 pages
BComp3 Module 5 Measures of Variability
No ratings yet
BComp3 Module 5 Measures of Variability
17 pages
Group-1 Module-1 PPT
No ratings yet
Group-1 Module-1 PPT
100 pages
9 MMW Data Management UNgrouped N Grouped FM1B
No ratings yet
9 MMW Data Management UNgrouped N Grouped FM1B
42 pages
sp5 1
No ratings yet
sp5 1
20 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
Lec006 - Measures of Dispersion
No ratings yet
Lec006 - Measures of Dispersion
42 pages
Measures of Dispersion Range & Quartile Deviation
No ratings yet
Measures of Dispersion Range & Quartile Deviation
5 pages
Qtymeth Dispersion
No ratings yet
Qtymeth Dispersion
8 pages
EDA W3 Obtaining-Data
No ratings yet
EDA W3 Obtaining-Data
57 pages
Variability
No ratings yet
Variability
26 pages
Dispersion
No ratings yet
Dispersion
26 pages
Topic 1 Numerical Measure
No ratings yet
Topic 1 Numerical Measure
11 pages
Chapter - 4 Dispersion
No ratings yet
Chapter - 4 Dispersion
10 pages
Introduction To Descriptive Statistics 2014
67% (3)
Introduction To Descriptive Statistics 2014
72 pages
Standard and Quartile Deviation
No ratings yet
Standard and Quartile Deviation
7 pages
Measures of Partition and Dispersion
No ratings yet
Measures of Partition and Dispersion
51 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
40 pages
Lecture 4
No ratings yet
Lecture 4
56 pages
2 Measures of Location - Dispersion
No ratings yet
2 Measures of Location - Dispersion
61 pages
Dispersion
50% (2)
Dispersion
58 pages
Quartile & Deviation
No ratings yet
Quartile & Deviation
31 pages
Quartile & Deviation
No ratings yet
Quartile & Deviation
31 pages
Unit 3. Measures of Dispersion Revised
No ratings yet
Unit 3. Measures of Dispersion Revised
41 pages
Chapter 4 QD, SD, Empirical Rule
No ratings yet
Chapter 4 QD, SD, Empirical Rule
25 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
46 pages
Group 4 Data Management Notes
No ratings yet
Group 4 Data Management Notes
21 pages
Probability Theory
No ratings yet
Probability Theory
354 pages
Lecture 3 - Numerical Statistics
No ratings yet
Lecture 3 - Numerical Statistics
7 pages
Lecture 5 Notes
No ratings yet
Lecture 5 Notes
23 pages
2 Descriptives
No ratings yet
2 Descriptives
43 pages
Topic 1 Describing Data II
No ratings yet
Topic 1 Describing Data II
68 pages
Dispersion 1
No ratings yet
Dispersion 1
18 pages
Chapter 3 Dispersion
No ratings yet
Chapter 3 Dispersion
12 pages
Dispersion Measures
No ratings yet
Dispersion Measures
23 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
Unit 4 Descriptive Statistics
No ratings yet
Unit 4 Descriptive Statistics
8 pages
4 - Dispersion & Skewness - Part 1
No ratings yet
4 - Dispersion & Skewness - Part 1
35 pages
Unit - 2: Measures of Central Tendency
No ratings yet
Unit - 2: Measures of Central Tendency
8 pages
Measures of Dispersion: Hapter
No ratings yet
Measures of Dispersion: Hapter
17 pages
Measures of Disperson
No ratings yet
Measures of Disperson
17 pages
4 - Dispersion & Skewness - Part 1
No ratings yet
4 - Dispersion & Skewness - Part 1
35 pages
Statistics 1
No ratings yet
Statistics 1
10 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Transportation Problem Unbalanced
No ratings yet
Transportation Problem Unbalanced
12 pages
4 Sensitivity Analysis
No ratings yet
4 Sensitivity Analysis
35 pages
Project - Picking Assignment: I065 Name: Vidhi Maheshwari
No ratings yet
Project - Picking Assignment: I065 Name: Vidhi Maheshwari
7 pages
Case 5 - Pricing Under Pressure
No ratings yet
Case 5 - Pricing Under Pressure
2 pages
The Greatest Showman Million Dreams Chords
No ratings yet
The Greatest Showman Million Dreams Chords
3 pages
Charms That Soothe Classical Music and The Narrative Film by Dean Duncan PDF
No ratings yet
Charms That Soothe Classical Music and The Narrative Film by Dean Duncan PDF
222 pages
152 Prelude in A Moll A.scriabin
No ratings yet
152 Prelude in A Moll A.scriabin
2 pages
Ballet: Manuel Maria Ponce
No ratings yet
Ballet: Manuel Maria Ponce
2 pages
Q4 P.E. Modules 1-4
No ratings yet
Q4 P.E. Modules 1-4
27 pages
Adam Szabo Thesis
100% (2)
Adam Szabo Thesis
6 pages
E203 - Transverse-Waves - Frequency-of-Vibration - Worksheet 2
No ratings yet
E203 - Transverse-Waves - Frequency-of-Vibration - Worksheet 2
4 pages
Lily Duolingo Wiki Fandom
No ratings yet
Lily Duolingo Wiki Fandom
1 page
Pihkal
No ratings yet
Pihkal
20 pages
Indonesian's Dangdut Music Classification PDF
No ratings yet
Indonesian's Dangdut Music Classification PDF
5 pages
GRP Work - Little Women PPT Using Eng Lang
No ratings yet
GRP Work - Little Women PPT Using Eng Lang
16 pages
All Tik Tok Songs Combined
No ratings yet
All Tik Tok Songs Combined
2 pages
JRDigGS - Opportunities - Brochure - 2022 Final
No ratings yet
JRDigGS - Opportunities - Brochure - 2022 Final
7 pages
Score Examples - Mark Berry - The Commissioning Project - Sounds, Shapes, and Synergy - Eight Works For Triangle Soloist
No ratings yet
Score Examples - Mark Berry - The Commissioning Project - Sounds, Shapes, and Synergy - Eight Works For Triangle Soloist
21 pages
Roland R8 Owners Manual (OCR)
No ratings yet
Roland R8 Owners Manual (OCR)
239 pages
Flowkey - Mrs. Robinson (Beginner)
No ratings yet
Flowkey - Mrs. Robinson (Beginner)
1 page
Welcome To The Transcription Style Guide!
No ratings yet
Welcome To The Transcription Style Guide!
16 pages
Prince
No ratings yet
Prince
13 pages
Agnus Dei From Michael W Smith
No ratings yet
Agnus Dei From Michael W Smith
12 pages
Movie Reviewt
No ratings yet
Movie Reviewt
22 pages
PRACTICE TEST 12 AK đã chuyển đổi đã nén
33% (3)
PRACTICE TEST 12 AK đã chuyển đổi đã nén
15 pages
Single Down Strokes Here..: Nallai Allai
No ratings yet
Single Down Strokes Here..: Nallai Allai
7 pages
Romantic Flight Full
No ratings yet
Romantic Flight Full
3 pages
J. S. Bach Air For Flute and Piano
No ratings yet
J. S. Bach Air For Flute and Piano
3 pages
RRB NTPC Static GK Course Schedule
No ratings yet
RRB NTPC Static GK Course Schedule
1 page
Ocarina of Time-Clarinet - II
No ratings yet
Ocarina of Time-Clarinet - II
2 pages
Applications Violin 1st Call
No ratings yet
Applications Violin 1st Call
29 pages
What Is It: Learning Activity Sheets-First Quarter
No ratings yet
What Is It: Learning Activity Sheets-First Quarter
2 pages

Unit II

Uploaded by

Unit II

Uploaded by

Unit II

Dispersion is the spread of the data in a distribution, that is, the

• Are the salespeople being successful as a group or not?

• An advantage of the range is its ease of computation.

• Another measure of variability is the interquartile range.

• In addition, the interquartile range is

Q1 = [(n+1)/4]th item

Q2 = [(n+1)/2]th item

Q3 = [3(n+1)/4]th item

Where, Qr is the rth quartile

• Quartile deviation is defined as half of

• It is also called Semi Interquartile range.

• If Q1 is the first quartile and Q3 is the

D1 = [(n+1)/10]th item

D2 = [2(n+1)/10]th item

D9 = [9(n+1)/10]th item

Where, Dr is the rth Decile

• Percentiles are a measure of the

• Similarly, 25th and 75th percentiles are the lower (Q1) and

• The quartiles, deciles, and percentiles are also

P1 = [(n+1)/100]th item

P2 = [2(n+1)/100]th item

P99 = [99(n+1)/100]th item

Where, Pr is the rth percentile

SD of XYZ stock 20.68%.

SD of ABC stock is a much lower 0.0129 or 1.29%

• There isn’t a standard benchmark of

• it all depends on your investing

• The formula we use for standard deviation depends on whether the

• If the data is being considered a population on its own, we divide by

• If the data is a sample from a larger population, we divide by one

iv Determine the Coefficient of Quartile Deviation

1. Coefficient of Quartile Deviation

• The following data represent the

Q.04 The mean of 5 observations is 15 and the variance is 9. If two more

Wages (Rs.) : 0-10 10-20 20-30 30-40 40-50

Wages (Rs.) : 0-10 10-20 20-30 30-40 40-50

Class : 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80

You might also like