Module 3 - Statistics
Module 3 - Statistics
Statistics
1. It is the difference between the lowest and the highest value in the
distribution.
a. 21 b. 24 c. 14 d. 7
a. 2 b. 2.5 c. 3 d. 3.5
C. On a 20-item quiz, the mean score is 12 and the standard deviation is 4. Find
the score that is
x 100 - 110 111 - 121 122 - 132 133 - 143 144 - 154 155 – 165
f 2 1 5 12 9 1
2
What you will do
Lesson 1
The Range
The three measures of central tendencies that you have learned in the
previous module do not give an adequate description of the data. We need to
know how the observations spread out from the average or mean. It is quite
possible to have two sets of observations with the same mean or median that
differs in the variability of their measurements about the mean.
Sample A Sample B
0.97 1.06
1.00 1.01
0.94 0.88
1.03 0.91
1.11 1.14
Both samples have the same mean, 1.00 liters. It is quite obvious that company
A packed apple juice with a more uniform content than company B. We say that
the variability or the dispersion of the observations from the mean is less for
sample A than for sample B. Therefore, in buying apple juice, we would feel more
confident that the tetra pack we select will be closer to the advertised mean if we
buy from company A.
Statistics other than the mean may provide additional information from
the same data. This statistics are the measures of dispersion.
3
Measures of dispersion or variability refer to the spread of the values
about the mean. These are important quantities used by statisticians in
evaluation. Smaller dispersion of scores arising from the comparison often
indicates more consistency and more reliability.
The Range
R = H - L
The main disadvantage of the range is that it does not consider every
measure in the data.
Examples:
1. The IQs of 5 members of a family are 108, 112, 127, 118 and 113. Find
the range.
2. The range of each of the set of scores of the three students is as follows:
H = 98 L = 92 R = 98 - 92 = 6
Student A
H = 97 L = 90 R = 97 - 90 = 7
Student B
H = 97 L = 90 R = 97 - 90 = 7
Student C
Observe that two students are “ tie.” This indicates that the range is not a
reliable measure of dispersion. It is a poor measure of dispersion, particularly if
the size of the sample or population is large. It considers only the extreme values
and tells us nothing about the distribution of numbers in between.
4
3. Consider the following two sets of data, both with a range of 12:
Set A Set B
3 3
4 7
5 7
6 7
8 8
9 8
10 8
12 9
15 15
In set A the mean and median are both 8, but the numbers vary over the
entire interval from 3 to 15. In set B the mean and median are also 8, but most of
the values are closer to the center of the data. Although the range fails to
measure the dispersion between the upper and lower observations, it does have
some useful applications. In industry the range for measurements on items
coming off an assembly line might be specified in advance. As long as all
measurements fall within the specified range, the process is said to be in control.
1. It makes use of very little information: that is, it ignores all but two
items only.
4. It cannot identify the differences between two sets of data with the
same extreme values, example, the two sets of data are
2, 2, 2, 2, 2, 2, 2, 2, 18
5
Range of a Frequency Distribution
Example:
Scores Frequency
46 - 50 1
41 - 45 10
36 - 40 10
31 - 35 16
26 - 30 9
21 - 25 4
3. ( 12, 12, 13, 13, 13, 13, 13, 15, 19, 20, 20 )
6
6. ( 12, 13, 14, 15, 16, 17, 18 )
8. ( 12, 12, 13, 13, 13, 13, 13, 15, 19, 20, 20 )
Pete Ricky
82 88
98 94
86 89
80 87
100 92
94 90
2.
x 100–110 111–121 122 - 132 133-143 144-154 155–165
f 2 1 5 12 9 1
C.1. The reaction times for a random sample of 9 subjects to a stimulant were
recorded as 2.5, 3.6, 3.1, 4.3, 2.9, 2.3, 2.6, 4.1 and 3.4 seconds. Calculate
range.
2. If the range of the set of scores is 29 and the lowest score is 18, what is the
highest score?
3. If the range of the set of scores is 14, and the highest score is 31, what is
the lowest score?
7
Lesson 2
Examples:
Solution:
Student A:
97 + 92 + 96 + 95 + 90
x = = 94
5
97 - 94 = 3
92 - 94 = -2
96 - 94 = 2
95 - 94 = 1
90 - 94 = -4
8
Step 3. Square each deviation.
( 3 )2 = 9
( -2 )2 = 4
( 2 )2 = 4
( 1 )2 = 1
( -4)2 = 16
9 + 4 + 4 + 1 + 16
= 6.8
5
Step 5. Get the square root of the mean of the squared deviations.
SD = 6.8 = 2.6
Student B:
94 + 94 + 92 + 94 + 96
x = = 94
5
94 - 94 = 0
94 - 94 = 0
92 - 94 = -2
94 - 94 = 0
96 - 94 = 2
9
Step 3. Square each deviation.
( 0)2 = 0
( 0 )2 = 0
( 2 )2 = 4
( 0 )2 = 0
( 2 )2 = 4
4+4
= 1.6
5
Step 5. Get the square root of the mean of the squared deviations.
SD = 1.6 = 1.3
Student C:
95 + 94 + 93 + 96 + 92
x = = 94
5
95 - 94 = 1
94 - 94 = 0
93 - 94 = -1
96 - 94 = 2
92 - 94 = -2
10
Step 3. Square each deviation.
( 1)2 = 1
( 0 )2 = 0
( -1 )2 = 1
( 2 )2 = 4
( -2 )2 = 4
1+ 0 +1+ 4 + 4
= 2
5
Step 5. Get the square root of the mean of the squared deviations.
SD = 2 = 1.4
∑ (x − x )
2
SD =
n
x = individual score
x = mean
n = number of scores
11
This can also be illustrated by plotting the scores on the number line.
x = 94
2.6 2.6
90 91 92 93 94 95 96 97
x = 94
1.3 1.3
90 91 92 93 94 95 96 97
x = 94
1.4 1.4
90 91 92 93 94 95 96 97
5 + 10 + 16 + 19 + 24 + 26 + 29 + 30 + 39
x = = 22
9
12
x x- x ( x - x )2
5 -17 289
10 -12 144
16 -6 36
19 -3 9
24 2 4
26 4 16
29 7 49
30 8 64
39 17 289
∑ (x − x )
2
SD =
9
900
=
9
= 100
SD = 10
The standard deviation is 10. This means that most of the scores are
within 10 units from the mean or the values are scattered widely about the mean.
∑ (x − x )
2
f
SD =
n −1
13
2. Get the midpoint ( x ) of each class interval in column 2.
8. Multiply frequency ( f ) and ( x - x )2. Find the sum of each product to get
∑ (x − x ) f .
2
∑ (x − x )
2
f
SD =
n −1
Number of Frequency
Mistakes
18 – 20 2
15 – 17 5
12 – 14 6
9 – 11 10
6–8 15
3–5 8
0–2 4
Total 50
14
Solution:
x =
∑ fx =
438
= 8.76
n 50
1 - 8.76 = -7.76
15
7. Square the deviation of each interval to get ( x - x )2.
( -7.76 )2 = 60.22
8. Multiply frequency ( f ) and ( x - x )2. Find the sum of each product to get
∑ (x − x ) f .
2
( 60.22 ) 4 = 240.88
Number of f X Fx (x- x ) ( x - x )2 f ( x - x )2
Mistakes
18 – 20 2 19 38 10.24 104.86 209.72
16
∑ (x − x )
2
f
9. Calculate the standard deviation using the formula SD =
n −1
∑ (x − x )
2
f
SD =
n −1
=
(1062.56)
50 − 1
1062.56
=
49
= 21.68
SD = 4.66
Scores IV – Emerald
41 – 45 1
36 – 40 5
31 – 35 10
26 – 30 12
21 – 25 10
16 – 20 5
11 – 15 3
6 – 10 3
1–5 1
17
Solution:
(∑ f )[∑ ( fX )] − [∑ ( fX )]
2 2
SD =
∑ f (∑ f − 1)
where: SD = standard deviation
f = frequency
x = midpoint
Scores F x Fx x2 fx2
41 - 45 1 43 43 1849 1849
16 - 20 5 18 90 324 1620
11 - 15 3 13 39 169 507
6 - 10 3 8 24 64 192
1 - 5 1 3 3 9 9
∑f = 50 ∑ fx = 1,285 ∑ fx 2
= 36, 985
(∑ f )[∑ ( fX )] − [∑ ( fX )]
2 2
SD =
∑ f (∑ f − 1)
=
(50)(36,985) − (1,285)2
50(50 − 1)
1,819,250 − 1,651,225
=
50(49)
18
168,025
=
2,450
= 68.58
SD = 8.28
3. ( 12, 12, 13, 13, 13, 13, 13, 15, 19, 20, 20 )
C. The IQ of 100 pupils at a certain high school are given in the following table.
IQ Frequency
55 - 64 1
65 - 74 3
75 - 84 7
85 - 94 20
95 - 104 32
105 - 114 25
115 - 124 10
125 - 154 1
135 - 144 1
19
Lesson 3
34% 34%
13.5% 13.5%
2.5% 2.5%
x - 3s x - 2s x- s x x +s x + 2s x - 3s
When two groups are comparer, the group having a smaller standard
deviation is less varied.
20
Examples:
a. How many students had score within the range one standard
deviation from the mean?
b. How many students had scores within the range one standard
deviation from the mean?
c. if the school will be very selective and will admit only students who
got scores one standard deviation above the mean, how many
students will be taken in?
Answer:
a. The range of score with one standard deviation from the mean
21
Student C is 1.4. Based on the result we can say that Student B has more
consistent scores.
Set A 10, 11, 9, 12, 11, 10, 12, 11, 11, 9, 10, 12, 9, 11, 10, 12
Set B 8, 10, 11, 9, 11, 12, 10, 15, 8, 9, 10, 13, 14, 10, 9, 11
Set A:
x = 10.63 R=3
Set B:
x = 10.63 R=7
Set A tells us that this group of students whose scores are very near each
other have almost the same abilities and therefore more teachable and would
progress at the same rate.
Set B consists of very slow and very fast learners. They are more difficult
to manage because different mental abilities.
1. The scores received by Ann and Apple on a ten math quizzes are as
follows:
Ann: 4, 5, 3, 2, 2, 5, 5, 3, 5, 0
Apple : 5, 4, 4, 3, 3, 1, 4, 0, 5, 5
22
2. The mean IQ score for 2,000 students is 110 with a standard deviation of
12. The scores follow approximately a normal curve
Third Fourth
Year Year
Mean 80 81.4
Standard 2.5 5.5
Deviation
4. Two brands of air conditioning units were compared as to their life span in
years and the following data was obtained.
Brand Brand
A B
Mean 9.2 11.5
23
Let’s summarize
The range is the simplest measure of variability. It is the difference between
the largest and smallest measurement.
R = H - L
Where, R = Range,
H = Highest measure,
L = Lowest Measure
∑ (x − x )
2
SD =
n
x = individual score
x = mean
n = number of scores
∑ (x − x ) (∑ f )[∑ ( fX )] − [∑ ( fX )]
2 2 2
f
SD = SD =
n −1 ∑ f (∑ f − 1)
34% 34%
13.5% 13.5%
2.5% 2.5%
x - 3s x - 2s x- s x x +s x + 2s x - 3s
24
What have you learned
Answer the following completely:
2. The minimum distance a batter has to hit the ball down the center of the field
to get a home run in 8 different stadiums is 410, 420, 406, 400, 440, 421, 402
and 425 ft. Compute the standard deviation.
4. In the entrance examination to an elite school, 9854 took the test. The mean
of the applicants was 76.5 with a standard deviation of 8.5. Assume that the
results approximated a normal curve.
a. How many of the students scored within one standard deviation from the
mean?
b. If only those applicants who got scores of at least one standard deviation
above the mean were accepted and the score of Albert was 87, was he
accepted? Why?
d. If the applicants who got scores 2 standard deviations above the mean
were the students given scholarships, about how many students were
granted scholarships?
25
Answer Key
How much do you know
A. 1. d 2. c 3. a 4. d 5. d
C. 1. 17 2. 20 3. 18 4. yes
D. 1. The mean will increase by 5 but the standard deviation will remain the
same.
2. The mean will decrease by 1 but the standard deviation will remain the
same.
Lesson 1
A 1. 6 2. 8 3. 8 4. 14 5. 15
6. 6 7. 8 8. 8 9. 14 10. 15
Lesson 2.
5. Mean = 30 SD = 5
26
Lesson 3
1. Ann Apple
c. both of them were consistent with the results of their test, which means
that they are equally good.
b. The 4th year group because the standard deviation is higher than the 3rd
year group.
1. a. 4 b. 17 c. 14
c. Both Students A and B were consistent with their scores in the test.
4. a. 6701 students
27