STATISTICS MODULE 2-Updated
STATISTICS MODULE 2-Updated
MODULE
FE C. MONTECALVO
Professor VI
GRADUATE SCHOOL
MAED, MSGC, MAST, MBA, & MPM
2020
Module Description:
This module deals with Descriptive Statistics. Descriptive statistics will teach
you the basic concepts used to describe data. They are used to describe the basic
features of the data in a study. They provide simple summaries about the sample and
the measures. Together with simple graphics analysis, they form the basis of virtually
every quantitative analysis of data.
Module Guide:
Module Outcomes:
Module Requirements:
At the end of this module, the students shall submit the following:
1. Assignments
2. Quizzes
Assessments:
Key Terms:
mean
median
mode
variability
range
Inter-quartile range
Quartile deviation
Mean absolute deviation
Variance
Standard deviation
SUMMARY STATISTICS
numerical measures that are used to describe certain characteristics of the
data
Measures of Location
Measures of Dispersion
Sample Mean:
x̄ = ————————————————
Size of the sample (n)
Example:
43 51 53 55 57 58 58 59 61 61
61 62 63 64 65 65 66 66 67 68
68 69 69 69 69 70 70 70 71 71
72 73 73 74 74 75 76 76 77 78
79 79 81 82 82 85 87 89 91 96
43 + 51 + …+ 91 + 96 3498
µ = —————————— = ——— = 69.96
50 50
Suppose that a sample of seven students from this college yielded the
following observations:
70 + 82 + 77 + 96 + 55 + 85 + 64
x̄ = ——————————————— = 75.57
7
Suppose another sample of students of the same size was taken and
resulted to the following scores:
58 + 72 + 77 + 89 + 63 + 85 +64
x̄ = ——————————————— = 70.714
7
The Median
a value that divides an ordered set of data (array) into two equal parts
and its commonly denoted by Md
a. The following are the total receipts of 7 companies (in million pesos):
The median is
11+16
Md = —————— = 13.5
2
It is a positional measure.
The Mode
the value in the data set that occurs with the highest frequency
Example:
Section 1: 83 64 98 66 83 87 83 93
86 80 93 83 75
Section 2: 87 76 96 77 94 92 88 85
66 89
Section 3: 68 84 79 79 84 75 80
Determine the mode for each set in the context of this problem.
[Course Code]: [Course Title] Page 7 of 21
Section Mode
1 83
2 does not exist
3 84 and 79
It does not always exist; if it does, it may not be unique. If a data set
has two modes, we call it bimodal, if there are three modes, we call
it trimodal and so on.
MEASURES OF LOCATION
numbers below which a specified amount or percentage of data must lie and
are oftentimes used to find the position of specific piece of data in relation to
the entire set of data
Percentiles
Values that divide an ordered set of data into 100 equal parts
Values that divide an ordered set of the data into 10 equal parts
Quartiles
values that divide an ordered set of the data into 4 equal parts
the ith quartile (i=1,2,3), denoted by Q i, is a value which 25i% of the data
must lie
Example:
The data from 50 measurements of the traffic noise level at an intersection are
already ordered from smallest to largest in the table given below. Locate the
quartiles.
Suggested Readings:
https://uomustansiriyah.edu.iq/media/lectures/5/5_2018_12_10!
09_06_45_PM.pdf
https://www.slideshare.net/chuckrymaunes5/measures-of-position-for-
ungrouped-data-quartiles-deciles-percentiles-130064276
58 30 75 77 94 97 80 35
74 58 70 99 60 63 71 16
Find: 1. Mean
2. Median
3. Mode
4. Q1
5. D5
6. P75
References/Sources:
Batacan,M.C.A., et. al., (2007). Statistics for Filipino Students, 2 nd ed. Manila
Phils: National Books Store
Learning Plan
Lesson No: 2
MEASURES OF DISPERSION
Meaning of Variability:
Need of Variability:
With the help of measures of validity we can compare the original data
expressed in different units.
These are:
A. Range
So, Range = 10 – 2 = 8
2.) What is the range of the dataset with 10 numbers: 99, 45, 23, 67, 45, 91,
82, 78, 62, 51?
Range = 99 – 23 = 76
It is based upon the interval containing the middle fifty percent of cases in
a given distribution. One quarter means 1/4th of something, when a scale
is divided in to four equal parts. “The quartile deviation or Q is the one-
half the scale distance between the 75t and 25th percentiles in a
frequency distribution.”
Symbolically:
Solution:
First, we need to arrange data in ascending order to find Q3 and Q1 and avoid
any duplicates.
Q1 = ¼ (n + 1)
= ¼ (9 + 1)
=¼ (10)
Q3= ¾ (n + 1)
= ¾ (9 + 1)
=¾ (10)
Q3= 7.5 Term
Now, to find the quartiles, we use the logic that the first quartile lies halfway
between the lowest value and the median; and the third quartile lies halfway
between the median and the largest value.
Using the values for Q1 and Q3, now we can calculate the Quartile Deviation as
follows –
= 22–13.75
2
= 8.25
2
= 4.125
Mean = 3 + 6 + 6 + 7 + 8 + 11 + 15 + 16 / 8 = 72/ 8 = 9
Distance from 9
X
xi – |xi – |
3 3 – 9 = -6 6
6 6 – 9 = -3 3
6 6 – 9 - -3 3
7 7 – 9 = -2 2
8 8 – 9 = -1 1
11 11 – 9 = 2 2
15 15 – 9 = 6 6
16 16 – 9 = 7 7
Σ |xi – | = 30
Step 3. Find the mean of those distances:
= 30/8 = 3.75
Variability can also be defined in terms of how close the scores in the
distribution are to the middle of the distribution. Using the mean as the
measure of the middle of the distribution, the variance is defined as the
average squared difference of the scores from the mean.
where s2 is the estimate of the variance and M or is the sample mean. Note
that M is the mean of a sample taken from a population with a mean of μ.
Since, in practice, the variance is usually computed in a sample, this formula
is most often used. The simulation "estimating variance" illustrates the bias in
the formula with N in the denominator.
Distance
X xi – from 9 (X - )2
|xi – |
3 3 – 9 = -6 6 36
6 6 – 9 = -3 3 9
6 6 – 9 - -3 3 9
= 148/8 = 18.5
= 148/7 = 21.14
Standard Deviation
The standard deviation is simply the square root of the variance. The symbol
for the population standard deviation is σ; the symbol for an estimate
computed in a sample is s.
where,
where,
148
S=
√ 8
= √ 18.5 = 4.30
148
σ=
√ 7
= √ 21.14 = 4.60
Let’s Read:
Assignment
58 30 75 77 94 97 80 35
74 58 70 99 60 63 71 16
Find: 1. Range
2. Quartile Deviation
3. Mean Absolute Deviation
4. Variance (Sample and pop. Variance)
5. Standard Deviation (Sample and pop SD)
References:
References/Sources: