Module 4 Content
Module 4 Content
Chapter IV
Analysis and Interpretation of Assessment Results
Overview
This chapter focuses on the Analysis and interpretation of assessment results. It
presents discussion on Descriptive Statistics and Correlation.
Discussion
Fields of Statistics
B. Descriptive Statistics
-the first step in data analysis is to describe, or summarize the data using
descriptive statistics.
The measure of central tendency is the point about which scores tend to
cluster, a sort of average in the series. It is the center of concentration of scores in
any set of data. It is a single number which represents the general level of
performance of a group.
Mean – known also as arithmetic mean is defined as the sum of the values in the
data group divided by the number of values.
Formula: χ = ∑χ
N
= 60
6
=10 units
Computation of Weighted Mean (WM) in case the researcher is using 5,4,3,2,1 scales
(weights).
Formula: χ = ∑fM
N
χ = ∑ (45)(5)+(30)(4)+(20)(3)+(25)(2)+(15)(1)
135
= 3.48 – this means that the pupils are undecided whether they
prefer science than other subjects.
Note: follow same process to compute the mean of the second and third
statements.
Median (Mdn) – the middle score in a distribution or the middle most score of
the distribution. Unlike the mean, the median is not sensitive to extreme scores; hence,
it is very appropriate when there are values which are relatively large or relatively small
compared to most of the scores.
To calculate the median, just arrange the scores from smallest to largest (or from
largest to smallest) and then count up to the middle score
The median would be the fourth score as there are seven scores. The median of
the scores is 16.
The median would be half way between the third and fourth scores as there are
six scores, (19+23)/2 = 21, so the median is 21.
Sometimes two figures tie for “most popular”, as with the ages
17,17,17,18,18,18,19,19,20,20,21. The modes are 17 and 18. This is referred as
bimodal.
The Modal class is 46-50 because it is the interval class with highest
frequency. The lower limit of the modal class is 46 and the upper limit is 50.
The formula in finding the mode of the grouped data:
Mode= lb(mo)+ D1 i
D1 + D2
lb(mo)- lower boundary of the modal class 46-0.5 = 45.5
D1- the highest frequency minus the frequency of the next lower
Class 10-2 = 8
D2- the highest frequency minus the frequency of the next upper
Class 10-7 = 3
i- the class width is the difference between two consecutive
lower limits 46-41 = 5
Mode= lb(mo)+ D1 i
D1 + D2
= 45.5 + 8 5
7
8+3
Mode = 49.14
The measures of variability we will consider are the range, the variance,
and the standard deviation.
A. RANGE
The range (R) is defined as the difference between the highest value (HV)
and the lowest value (LV) in the data.
R = HV-LV
Ex: James took 7 math tests in one marking period. What is the range of his
test scores?
89, 77, 95, 65, 90, 94, 70
R= 95-65= 30
B. VARIANCE
Variance also indicates a relationship between the mean of a distribution and the
data points.
Sample Variance:
s2= n∑ X2 – (∑ X)2
n(n-1)
8
s2= n∑ X2 – (∑ X)2
n(n-1)
= 10 (2,528) – (152)2
10(9)
= 24.18
Population Variance:
σ2 = ∑ (X – µ)2
N
Where: σ2= (sigma squared), Population variance
(X – µ)2 = square of the differences
∑ (X – µ)2 = sum of squared differences
N = number of cases
σ2 = ∑ (X – µ)2
N
=4
7
= 0.57
9
C. STANDARD DEVIATION
If the scores are grouped closely around the mean of a distribution, the
standard deviation of that distribution will be small; but if the scores are spread
out from the mean, the standard deviation will take on a large value. Conversely,
if the standard deviation computed from a set of data is small, the values are
concentrated near the mean; and if the standard deviation is large, the values are
dispersed widely about the mean.
Standard Deviation (SD) is the square root of the variance. This
calculation is useful because it allows for the same flexibility as variance
regarding further calculations and yet also expresses variation in the same units
as the original measurements.
The standard deviation is equal to the principal square root of the
variance.
s= n∑ X2 – (∑ X)2
n(n-1)
Ex: Calculate the sample standard deviation of the following data.
8, 9, 10, 12, 17, 18, 18, 19, 20, 21
X X2
8 64
9 81
10 100
12 144
17 289
18 324
18 324
19 361
20 400
21 441
∑ X = 152 ∑ X2= 2,528
n=10
s= n∑ X2 – (∑ X)2
n(n-1)
= 10 (2,528) – (152)2
10(9)
= 24.18 = 4.92
10
For a population:
σ = ∑ (X – µ)2
N
Ex.: Calculate the Standard deviation for 7 scores.
X X–µ (X – µ)2
5 1 1
3 -1 1
4 0 0
4 0 0
3 -1 1
4 0 0
5 1 1
Mean (µ)= ∑ (X – µ)2 = 4
28/7 = 4
σ= ∑ (X – µ)2
N
= 4
7
= 0.57 = 0.755
3. Two sections have the same mean but the standard deviation of
Section 2 is higher than Section 1. Which of the two sections is more
homogenous?
a. Section 1 c. Both A and B
b. Section 2 d. None of these
C. CORRELATION
Correlation is a measure of relationship between two variables. Coefficient
of correlation determines validity, reliability and objectivity of an examination
prepared. It also indicates the amount of agreement or disagreement between
group of scores, measurements, or individuals. Correlation ranges in value from
+1.00 through 0.00 up to -1.00.
Correlation tests are used to determine how strongly the scores of two
variables are associated or correlated, for instance, whether a correlation exists
between Accounting students’ pre-board examination scores and their scores in
the CPA board examination.
Examples:
Interpretation of ranges:
0.00 No correlation
+0.01-+0.20 Negligible correlation
+0.21-+0.40 Low correlation
+0.41-+0.70 Moderate correlation
+0.71-+0.90 High correlation
+0.91-+0.99 Very high correlation
+1.00 Perfect correlation
Formula:
r= N∑XY - ∑X∑Y
[N∑X2 – (∑X)2][N∑Y2-(∑Y)2]
∑X= sum of X
∑Y = sum of Y
∑X2 = sum of the squares of X
∑Y2 = sum of the squares of Y
Student X Y XY X2 Y2
1 21 27 567 441 729
2 22 28 616 484 784
3 28 27 756 784 729
4 27 10 270 729 100
5 48 30 1440 2304 900
6 22 21 462 484 441
7 27 27 729 729 729
8 6 21 126 36 441
9 11 21 231 121 441
10 12 28 336 144 784
11 16 30 480 256 900
12 28 21 588 784 441
Total 268 291 6601 7296 7419
r= 12(6601)-(268)(291)
[12(7296)-(268)2][12(7419)-(291)2
rho = 1 - 6 ∑D2
N(N2 -1)
where: 1 is constant ∑D2 = sum of D2
6 is constant N = sum of pairs of scores or measures
13
Student X Y Rx Ry D D2
1 21 27 8 6 2 4
2 22 28 6.5 3.5 3 9
3 28 27 2.5 6 -3.5 12.25
4 27 10 4.5 12 -7.5 56.25
5 48 30 1 1.5 -0.5 0.25
6 22 21 6.5 9.5 -3 9
7 27 27 4.5 6 -1.5 2.25
8 6 21 12 9.5 2.5 6.25
9 11 21 11 9.5 1.5 2.25
10 12 28 10 3.5 6.5 42.25
11 16 30 9 1.5 7.5 56.25
12 28 21 2.5 9.5 -7 49
∑D2= 249
rho= 1 - 6(249)
12(122 -1)
rho = 0.13. This means that there is negligible correlation between the
scores of 12 students in English and Science.
D. Distributions of Data
Normal Distribution:
Mean = Median = Mode
Negatively skewed distribution:
Mode > Median > Mean
Positively skewed distribution
Mean > Median > Mode
Stanine scores – standard scores that tell the location of a raw score in a specific
segment in a normal distribution which is divided into 9 segments, numbered from a
low of 1 through 1 high of 9.
Stanine is calculated by first giving each score a ranking from lowest score to highest
score. The scores are organized in the following groups:
Example: There are 100 scores that have been ranked in ascending order as:
5 5 6 8 10 11 12 14 15 15
16 17 17 19 20 20 22 24 24 25
25 25 27 28 28 28 29 29 30 30
30 31 31 32 32 33 33 33 34 34
35 35 35 35 36 36 37 37 37 37
37 38 38 38 38 38 39 39 39 39
40 40 41 41 41 42 42 42 42 42
43 43 43 43 44 44 44 45 45 45
45 46 46 46 46 46 46 46 46 47
47 47 47 47 48 48 49 49 49 50
5-8 has a stanine score of 1. These students have a below average performance.
10-16 has a stanine score of 2. These students have a below average performance.
17-27 has a stanine score of 3. These students have a below average performance.
28-34 has a stanine score of 4. These students have an average performance.
35-39 has a stanine score of 5. These students have an average performance.
40-44 has a stanine score of 6. These students have an average performance.
45-46 has a stanine score of 7. These students have an above average performance.
47-48 has a stanine score of 8. These students have an above average performance.
49-50 has a stanine score of 9. These students have an above average performance.
Percentile Rank – reflect the percentage of students in the norm group surpassed at each
raw score in the distribution.
15
Sample questions:
7. When the distribution is skewed to the right, what kind of test was
administered?
a. Difficult b. Easy c. Moderately difficult d. Partly easy-partly
difficult
11. Precy performed better than 65% of the total number of examinees in the
District Achievement Test. What is her percentile rank?
a. P35 b. P65 c. P66 d. P75
KUDER-RICHARDSON FORMULA 20
FORMULA:
KR-20 = (k/(k-1))*(1-∑pjqj/var)
var – variance of scores for all individuals who took the test
The value for KR-20 ranges from 0 to 1, with higher values indicating
higher reliability.
Between 50 to 80 - moderate
Example:
The results of the test are listed below, with 1 indicating a correct answer
and 0 indicating an incorrect answer.
Student Q1 Q2 Q3 Q4 Q5 Q6 Q7 Total
Correct
1 0 1 1 0 1 1 1 5
2 1 1 1 1 0 0 0 4
3 1 1 1 1 0 1 1 6
4 1 1 0 0 1 1 0 4
5 0 1 1 1 1 0 1 5
6 1 0 1 0 1 1 0 4
7 1 1 0 0 0 0 0 2
8 1 1 0 1 0 1 0 4
9 0 0 1 1 0 0 0 2
10 1 1 1 0 1 0 1 5
p 0.7 0.8 0.7 0.5 0.5 0.5 0.4
q 0.3 0.2 0.3 0.5 0.5 0.5 0.6
pq 0.21 0.16 0.21 0.25 0.25 0.25 0.24
K=7 variance = 1.6556 (Note: Use the formula for sample variance)
∑pq=1.57
= 0.0603
The KR-20 value turns out to be 0.0603. Since the value is extremely low. This
indicates that the test has low reliability. This means the questions may need to
be re-written or rephrased in such a way that the reliability of the test can be
increased.
18
References
Disclaimer
This module is prepared for instructional purposes only based on our course syllabus.
The teacher who prepared this does not claim ownership of this module but patterned
the ideas from different authors.