0% found this document useful (0 votes)
12 views61 pages

B. Data Management

The document outlines a lesson plan on measures of dispersion and relative position in data management for BEED students. It covers key concepts such as range, variance, standard deviation, quartiles, and z-scores, along with examples and calculations to illustrate these concepts. The lesson emphasizes the importance of understanding data variability and distribution for real-world applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views61 pages

B. Data Management

The document outlines a lesson plan on measures of dispersion and relative position in data management for BEED students. It covers key concepts such as range, variance, standard deviation, quartiles, and z-scores, along with examples and calculations to illustrate these concepts. The lesson emphasizes the importance of understanding data variability and distribution for real-world applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 61

Data Management

Prayer

Mathematics in the Modern World


Data Management

Attendance

Mathematics in the Modern World


The class will be divided into
01 two groups.

Each group will be given a set


02 of clues that they need to
accomplish in order to
find the mystery word/phrase.

Mystery, 03
After 10 minutes, they will
write their answer on a

Mystery bondpaper and the teacher


will ask follow up questions
related to the phrase given.
Motivational Activity
Data Management

Measures of
Dispersion and
Measures of Relative
position
Prepared by: Wendy Lyn H. Edradan

Mathematics in the Modern World


Learning Objectives
At the end of the lesson, the BEED students should be able to:

Define key concepts such as range, variance, standard deviation,


quantiles, and z-score

Explain how measure of dispersion and relative position help describe


the spread and ranking of data in a data set

Calculate the standard deviation and quartile of a given data set

Compare and contrast different data sets using measures of dispersion


and relative position to determine data variability and distribution
Measures of
dispersion
Dispersion is the difference
between the actual value and
average value. It is the extent to
which a distribution is stretched
or squeezed. It represents the
size of distribution of values that
are expected for a specific
variable
Variability
Variability describes how far apart
data points lie from each other and
from the center of a distribution.
Range
The range is the difference of the highest value
and the lowest value in the data set. There are two
advantages of the range: (i) it is easy to compute
and (ii) it is easy to understand. On the other
hand, it also has two disadvantages, it can be
distorted by a single extreme value (or outlier)
and only two values are used in the calculation.
Example
The daily rates of a sample of eight employees at GMS Inc.
are P550, P420, P560, P500, P700, P670, P860, P480. Find
the range.
Solution:
Step 1: Determine the highest value and lowest value in the
data set.
Highest Value (HV) = P860 Lowest Value (LV)-P420
Step 2: Solve for the range.
Range= Highest Value (HV)-Lowest Value (LV)= P860-P420
=P440
The range in daily
rate salary is P440.
Variance and Standard
deviation
The range is the difference of the highest value and the
lowest value in the data set. There are two advantages
of the range: (i) it is easy to compute and (ii) it is easy
to understand. On the other hand, it also has two
disadvantages, it can be distorted by a single extreme
value (or outlier) and only two values are used in the
calculation.
Standard Deviation Variance
Standard deviation is the Variance is a statistical
average amount of variability in measurement of the spread
your data set. It tells you, on between numbers in a data set. It
measures how far each number in
average, how far each value
the set is from the mean (average),
lies from the mean. It is
and thus from every other number
calculated as the square root of
in the set. It is a mathematical
variance. Standard Deviation is expectation of the average squared
often depicted by this symbol: deviations from the mean. Variance
σ. is often depicted by this symbol:
σ^2.
Sample Variance and Sample Standard
Deviation for Ungouped Data
Example
The daily rates of a sample of eight
employees at GMS Inc. are P550, P420,
P560, P500, P700, 670, P860, P480.
Find the variance and standard
deviation.
Step 1
Compute the mean of the data set
Step 2
Subtract the mean from each of the value in the
data set.
Step 3 _
Square the 𝑥 − 𝑥 then get the sum.
Step 4
Solve for variance and the standard deviation. We can also
obtain the standard deviation by simply extracting the
square root of the variance.

Variance

Standard Deviation
Hence, the variance is
P20421.43 and the
standard deviation is
P142.90
An alternative
solution can be done
using the other
formulas
Step 1
Get the sum of the data set
Step 2
Square the values of the data set and get the sum
Step 3
Solve for the values of the variance and standard
deviation
Thus, the variance is
P20421.43 and the
standard deviation is
P142.90
Population Variance and population
Standard Deviation for Ungouped Data
Example
The monthly incomes of the five
research directors of Recoletos schools
are: P55,000, P59,500, P62,500,
P57,000, and P61,000. Find the
variance and standard deviation.
Step 1
Compute the mean of the data set.
Step 2
Subtract the population mean from each of
the value in the data set.
Step 3
Get the square of 𝑥 − 𝜇 then get the sum
Step 4
Solve for the population variance and
population standard deviation

Variance

Standard Deviation
Hence, the population
variance is 730,000 and
the population standard
deviation is 2,701.85.
Measures of
relative Position
Measures of relative position, like
percentiles, quartiles, and z-
scores, describe where a
particular data point stands in
relation to the entire dataset,
helping us understand its position
within the distribution.
Quartiles
Example
Find first, second, and third quartiles of
the ages of 9 middle-management
employees of a certain company. The
ages are 53, 45, 59, 48, 54, 46, 51, 58,
and 55.
Step 1
Arrange the data in order.

45, 46, 48, 51, 53, 54, 55, 58, 59


Step 2
Select the first, second, and third quartiles value
using formula
Step 3
Identify the first, second, and third quartiles in the
data set.
Step 3
Since the 2.5th falls between 46 and 48; and 7.5th
falls between 55 and 58, we can determine the first
and third quartiles of the data set by getting the
average of the two values.
Therefore,
𝑸𝟏=47, 𝑸𝟐=53, and
𝑸𝟑=56.5
z-score
To know the position of one observation
relative to others in a set of data we apply
z-score. It measures the distance between
an observation and the mean, measured in
units of standard deviation.
Formulas on computing z score for a
data value x in a sample and population

Population Sample
Example
The monthly expenditures of a large group of
households are normally distributed with a
mean of P48,700 and a standard deviation of
P10,400. What is the zvalue of monthly
expenditures of P59,400 and P38,300?
Solution
Let μ=48,700 σ =10,400

Using the formula of z to determine z- values for the two x values


(P59,400 and P38,300) are computed as follow:
The z of 1.00 indicates that a
monthly expenditure of
P59,400 for households is one
standard deviation above the
mean, and a z of -1.00 shows
that a P38,300 monthly
expenditures is one standard
deviation below the mean.
Note that both household
monthly expenditures
(P59,400 and P38,300) are
the same distance (P10,400)
from the mean.
Example
A normal curve has a mean of 650 and a
standard deviation of 40. An analyst
is interested in value of 575 and wants to find
its equivalent z-score.
Solution
_
Given: 𝑥 = 650 s = 40 x = 575

Substitute the given values into the z-score of -1.875


The value 575
has a z-score
of -1.875.
Box-and-Whisker Plot
John Wilder Tukey (1915-2000) introduced the boxplot in the
1970's. A boxplot (or boxand-whisker plot) is graph of a data
set obtained by drawing a horizontal line from the minimum
data to first quartile (Q1), drawing a horizontal line to third
quartile (Q) to the maximum data value, and drawing a box
whose vertical line passes through Qi and Qs with a vertical line
inside the box passing through the median or second quartile
(Q2).
The boxplot will give the
following information:
1. If the median is near the center of the box, the distribution is
approximately symmetric.
2. If the median falls to the right of the center of the box, the distribution
is negatively skewed.
3. If the median falls to the left of the center of the box, the distribution is
positively skewed.
4. If the lines are about the same length, the distribution is approximately
symmetric.
5. If the left line is larger than the right line, the distribution is negatively
skewed.
6. If the right line is larger than the left line, the distribution is positively
skewed.
Box-and-Whisker Plot
Example
Construct a boxplot for the data set of the
ages of 9 middle-management employees of
a certain company. The ages are 53, 45, 59,
48, 54, 46, 51, 58, and 55. What can we say
about the distribution of the data set?
Step 1
Determine the 𝑄1, Median, and 𝑄3 of the
given data set.

Recall that 𝑄1= 47, Median= 53, and 𝑄3=


56.5
Step 2
Locate the lowest value, 𝑄1,
the median, 𝑄3 and the
highest value on the scale.
Step 3
Draw a box around 𝑄1 and 𝑄3 draw a vertical line through
the median, and connect the 𝑄1 upper and lower values,
as shown in the figure:
The value of the data is
negatively-skewed, since
the median falls to the
right of the box.
Why is it important to
understand measures of
dispersion and relative
position when analyzing
real-world data, such as
income distribution or
student performance? How
can this knowledge help
promote fairness and equity
Let's think! in society?
In making decisions (e.g.,
choosing a school,
evaluating product quality,
or assessing risks in
investments), how can
understanding variability
and relative position guide
you in making better, more
informed choices? Provide
Let's think! an example.
Evaluation
On a one whole sheet of yellow paper,
calculate the range, variance, standard
deviation, quartiles, and z-score of the 3rd
data for the given dataset (12, 18, 25, 30, 35,
40, 50). Show your complete solution.
Assignment
Find the standard deviation, quartiles, and z-score
of the following data sets . Show your solution.

Two classes took a math test. Their scores are as


follows:
Class A Scores: 50, 55, 60, 65, 70, 75, 80, 85
Class B Scores: 40, 50, 55, 60, 75, 90, 95, 100
Assignment
Answer the following questions:
1. Compare and contrast the standard deviation of
Class A and Class B. Which class has more
variability in scores?
2. Explain how the quartiles and z-scores can help
in determining which students performed above or
below average.
Data Management

Thank You
MMW

You might also like