Statistics in Research 1
Statistics in Research 1
3. Estimate the sample size to be used in the study about the teaching competencies of
elementary school teachers in Metro, Vigan, given the total no. of teachers in the 5
districts. Prepare a table which will also show the number of respondents to be
taken per district.
N = 1,010
N 1,010
n= 2
= 2
=529
1+ Ne 1+1,010 ( 0.03 )
n 529
P= = =0.5238
N 1,010
(∑ )
5
a. xi = (x1+x2+x3+x4+x5)²
l=1
= (7 + 5 + 9 + 8 + 5 )²
= 43²
= 1, 849
5
b . ∑ xi ² = (x1)²+(x2)²+(x3)²+(x4)²+(x5)²
l=1
2 2 2 2
¿ 7 +5 +9 + 8 + 5²
¿ 49+ 25+81+64+ 25
¿ 244
4
c . ∑ xi yi = (x2)(y2)+(x3)(y3)+(x4)(y4)
l =2
= (5)(3)+(9)(8)+(8)(4)
= 15 + 72 + 32
= 119
5
d . ∑ ( yi , xi) = (y1)(x1)+(y2)(x2)+(y3)(x3)+(y4)(x4)+(y5)(x5)
l=1
¿ ( 2 ) ( 7 ) + ( 3 ) ( 5 ) + ( 9 )( 8 )+ ( 4 )( 8 )+ (12 )( 5 )
¿ 14+ 15+72+ 32+ 60
¿ 193
= 5 (7+5+9+8) +(2+3+8+4)
= 5 (29 + 17)
= 5 (46)
= 230
4
b . ∑ c (xiyi ²) = 5 (7)(2)² + (5)(3)² + (9)(8)² + (8)(4)²
l=1
7
5
6 1 0 6
2 0 5 2 2
3 6 0 7 2
4 3 2 3 8 2
6 6 2 3 0 7 2
0 6 0 5 4 5 2
6 5 2 0 6 3 2
7 0 5 3 2 6 0
x
3 4 5 6 7 8 9
b. Construct a frequency distribution selecting an appropriate class size.
Cumulative
Class Intervals Frequency (f) Frequency
(cf)
90-99 4 50 c. Construct a
80-89 8 46 histogram, a
70-79 8 38 frequency
60-69 7 30 polygon and a
50-59 5 23 cumulative
40-49 10 18
30-39 8 8
Σf = 50
polygon frequency graph for these data using the frequency distribution constructed from
letter b.
a. Pie diagram
250,000
200,000
150,000
100,000
50,000
0
No Grade Elementary High School College Academic Not stated
Completed degree
holder
2000 1995
MODULE 2 -MEASURES OF CENTRAL TENDENCY,
DISPERSION, AND POSITION
Self – Test
Give what is asked for in the following problems:
1. The data given below are the weights (lbs) of 15 pupils of Mr. Cruz.
35 38 39 38 39
45 41 47 38 43
40 42 45 50 48
∑ xi= 35 + 45 + 40 + 38 + 41 + 42 + 39 + 47 + 45 + 38 + 38 + 50 + 39
+N 43 + 48 / 15
= 628/15
Mean = 41.87
35, 38, 38, 38, 39, 39, 40, 41, 42, 43, 45,45, 47, 48, 50
Mode
35, 38, 38, 38, 39, 39, 40, 41, 42, 43, 45,45, 47, 48, 50
The mode is 38
Solution:
Σfd (i)
X = assumed mean +
N
73+33(5)
= = 75.35
70
Median
Cumulative Frequency
Class Score Frequency (f)
(cf)
96-100 1 70
91-95 5 69
86-90 10 64
81-85 4 54
76-80 15 50
71-75 14 35
66-70 8 21
61-65 7 13
56-60 2 6
51-55 3 4
46-50 1 1
N = 70
Solution:
( N /2)−cfb( i)
Md = II +
fm
(70 /2)−21(5)
= 70.5 + = 71.5
70
Mode
Consider the table above
Solution:
d2i −1 (5)
Mo = II + = 70.5 + = 69.5
d 1+ d 2 6+(−1)
Standard Deviation:
S . D .=
√ Σ(Xᵢ− X)²
N−1
=
√ (28.9−0)²
10−1
=
9√
28.9 ¿ 1.79
Variance:
Σ ( Xᵢ−X )²
s² =
N −1
(28.9−0)² 2
= =( 1.79 ) =3.21
10−9
S . D .=
√ Σ(Xᵢ− X)²
N−1
=
√ (74.6−0)²
10−1
=
√74.6
9
=2.88
3. As the head of a social service agency, you believe that your staff of 20 social
workers is very much overworked compared to 10 years ago. Th case loads for each
worker are reported below for each of the two years in question.
1990 2000
52 55 42 82
50 49 75 50
57 50 69 52
49 52 65 50
45 59 58 55
65 60 64 65
60 65 69 60
55 68 60 60
42 60 50 60
50 42 60 60
1990 2000
¿ 54.25 ¿
A.D.= Σ∨ Xᵢ− X∨ N ¿ = = 2.71 A.D. = Σ∨ Xᵢ− X∨ N ¿ =
20
60.3
=3.015
20
Σ¿ Xᵢ = 1,085/20 = 54.25 Σ¿ Xᵢ=1,206=60.3
Based on the mean or average deviation calculated, the workload had increased by
10.85%.
b. Compare the standard deviations for the two time periods along with the changes in the
mean. What happened? Why?
1990 2000
S . D .=
√ Σ(Xᵢ− X)²
N−1
S . D .=
√ Σ(Xᵢ− X)²
N−1
¿
√ (54.25−0)²
10−1
=
√
54.25 = 2.46
9 √ (60.3−0)²
10−1
=
√
60.3 = 2.59
9
During the next 10 years, from 1990 to 2000 the workers have increased their workload
and have more varied cases. We can assume that due to increasing population that the
cases increase. As such it can be considered as a factor.
Cumulative
Class Range Frequency Cumulative
Frequency %
93-97 2 55 100.00
88-92 4 53 96.36
83-87 5 49 89.09
78-82 3 44 80
73-77 10 41 74.54
68-72 12 31 56.36
63-67 9 29 52.72
58-62 8 20 36.36
53-57 7 12 21.81
48-52 5 5 9.09
Pn = .50(55) = 27.5
ll = 62.5
F = 20
Class width = 5
( ₚN −F )ᵢ
Pₙ = II +
f
( 27.5−20 ) (5)
= 62.5 +
10
= 62.5 + 3.75
P₅₀ = 66.25
b. Q₁, Q₃ AND QD
Qₙ=II +
[ ( 4)
n
)
N −F ᵢ
f
Where:
Qₙ =
ll = 57.5
f =8
F =12
i=5
[ 13.75−12 ) 5
Q ₁=57.5+
8
[ 13.75−12 ) 5
= 57.5+
8
= 57.5 + 1.093
Q₁ = 58.59
Q₃
¿ II +
[() )
3
n
4
N−F ᵢ
f
[ 41.25−41 ) 5
= 77.5+
3
= 77.92
77.92−58.59
QD =
2
= 9.67
c. D₃ = P₃₀
( ₚN −F )ᵢ
P₃₀ = ll +
f
( 16.5−12 ) (5)
= 57.5 +
10
= 57.5 + 2.25
D₃ = 59.75
MODULE 3 – THE NORMAL DISTRIBUTION
1. Below z= 2.05
The area under the unit normal curve for z=2.05 is approximately 0.9798.
2. Above z=2.52
The area under the unit normal curve for z = 2.52 is approximately
0.9948.
3. Above z= -1.44
The area under the unit normal curve for z = -1.44 is 0.0749.
4. Below z= 1.23
Since we want the area under the curve for a z= 1.23, we subtract the area
to the left of -1.2 from 1 to get the area to the right of -1.2, which is 1 –
0.8849 = 0.1151. Therefore, the area under the unit normal curve for z =
1.23 is approximately 0.1151.
The area under the unit normal curve between z = -1.75 and z = 2.52 is
approximately 0.9546
Using a standard normal distribution table, we find the area to the left of
z=-1.51 is 0.0655 and the area to the left of z=-2.12 is 0.0179. To find the
area between z=-1.51 and z=-2.12, we can subtract the smaller area from
the larger area: 0.0655 – 0.0179 = 0.0476Therefore, the area under the
standard normal curve between z=-1.51 and z=-2.12 is approximately
0.0476.
The area under the unit normal curve between z = 1.25 and z = 0.45 is
approximately 0.2208.
EXERCISE
b. Below 500
e. Above 400
f. Exactly 400