Workbook
Workbook
Workbook
WORK BOOK
(ENGINEERING DATA ANALYSIS)
Submitted by:
Reyes Lorwel C.
BSEE
Submitted to:
ENGR. FELIX S. LICAS
Professor
MODULE 1
"INTRODUCTION TO STATISTICS"
Problem No. 1
Direction: From the previous questions regarding the heights of the family member.
Instructions:
Select 3 families in your barangay.
Using a tape measure or a meter stick, measure the individual heights of each member
of the family. Use centimeter unit. Round off units to the nearest centimeter.
Group yourselves making each family a one group. List down all the raw data
and present it in the best presentation you can.
Family No.1
Respondent Measurement Family No.2
Mother 167cm
Father 170cm Respondent Measurement
Child no.1 155cm Mother 152cm
Child no.2 146cm
Child no.3 168cm Father 165cm
Child no.1 171cm
FamilyChild
No.3no.2 155cm
Respondent Measurement
Mother 155cm
Father 158cm
Child no.1 145cm
Child no.2 163cm
Questions:
Problem No. 2
Instruction: survey from your street or specific area how many members are there in their
family.
No. Family’s Name No. of Family Members
1 Tan 5
2 Cu 8
3 Siervo 7
4 Morillo 3
5 Fernandez 6
6 Sosing 5
7 Magluyoan 4
8 Gumarao 4
9 Cabili 2
10 Mapao 4
Questions:
1. What do these numbers represent?
These numbers represent the numbers of family members in a specific area of
Victoria
2. Are these information precise?
In this information, I can say that these are precise. Number of family members is
usually in small size and purely natural numbers that is why this is precise
information.
Problem No.3.
Instruction:
Problem No. 4
Instruction:
Problem No. 5
Direction: Make a surveyed in your barangay age 18-22 about on how many hours do they
spend playing online games.
"DATA COLLECTION"
Problem No. 2
Instruction: Make a survey from 15 students if they are learning from online class/modular-
type learning.
Yes No
1 /
2 /
3 /
4 /
5 /
6 /
7 /
8 /
9 /
10 /
11 /
12 /
13 /
14 /
15 /
Total: 2 total: 13
based from the table presented, most of the students are not learning from this
blended-type of learning.
Problem No. 3.
What is the best method to collect data about the voter’s information?
Problem No. 4.
What is the kind of method used in the scenario: Consider someone on the busy street of a
New York neighborhood asking random people that pass by how many pets they have, then
taking this data and using it to decide if there should be more pet food stores in that area.
Problem No 5.
A company wants to have the feedbacks of their clients to be gathered in order for their
services to be improved. What is the best way to gather data?
12 21 30 19 12 20 24 16 15 20
9 27 23 20 8 28 21 25 7 29
16 26 22 22 17 30 17 5 19 18
Score tally Frequency
1-5 / 1
6-10 /// 3
11-15 /// 3
16-20 /////-///// 10
21-25 /////-// 7
26-30 /////-/ 6
Question:
1. Based on the group frequency data, what score has the most frequency?
Score from 16 to 20 has the most frequency, with a frequency of 10.
Problem No. 2
Identify the population and sample in this setting: A factory overseer selects 40 threaded
rods at random from those that week at the factory, then she test their tensile strength.
Answer: The population is the threaded rods produced at the factory that week; the sample is
the 40 threaded selected.
Problem No 3
Identify the population and sample in this setting: A researcher conducted an experiment on
a randomly selected group of 50 positive Covid patients.
Answer: The population is all the positive Covid patients; the sample is the 50 patients
selected.
Problem No. 4
Instruction: Construct a pie chart for the following. Show computations of per cent and angle
distribution using the given data on preferred strand of Grade 10 when going to Senior High.
Strand frequency
STEM 14
HUMSS 13
ABM 10
GAS 8
Others 5
STEM
HUMSS
ABM
GAS
OTHERS
Problem No. 5
Instruction: Make a survey from 15 students if they are learning from online class/modular-
type learning. Present it with bar graph
Yes No
1 /
2 /
3 /
4 /
5 /
6 /
7 /
8 /
9 /
10 /
11 /
12 /
13 /
14 /
15 /
Total: 2 total: 13
14
12
10
8
Series 3
Series 2
6 Series 1
0
No Yes
Problem No. 1
Problem No. 2
Problem No. 3
The painting is 14 inches wide and 12 inches long. What type of data?
Quantitative data
Problem No. 4
Notes from classroom observations. What type of data?
Qualitative data
Problem No. 5
Feedback from a teacher about a student's progress. What type of data?
Qualitative data
Problem No.1
Construct a pie chart for the following. Show computations of percent using the given data
on a family budget.
Budget Percent
Food 9000 30
Rent 7500 25
Kids 6000 20
Leisure 1500 5
Savings 3500 12
Gasoline 2500 8
Total=30000 Total=100%
o The pie chart show that food is having more budget than the others and the less
is leisure.
Problem No. 2
Make a group frequency table on the ages of participants to an event on “Laugh out loud” for
our city. Use an interval 3
16 37 20 21 45 20 22 19 18
43 37 34 35 21 19 38 24 18
31 29 32 27 18 22 23 21 19
37
Ages of the 28 Participants to a Vigil on our Country
Age Tally Frequency
15-17 III 3
18-20 IIIII-II 7
21-23 IIIII-I 6
24-26 II 2
27-29 II 2
30-32 II 2
33-35 II 2
36-38 II 2
39-41 O 0
42-44 I 1
45-47 I 1
N=28
The table show that 18-20 years old have participated the event on “Laugh out
loud” and 39-41 years old have no participant on the vigil.
Problem No. 3
The final grades in Engineering Data Analysis of 80 students at UEP are recorded in
the accompanying table. Construct a relative frequency distribution table.
68 84 75 82 68 90 62 88 76 93
73 79 88 73 60 93 71 59 85 75
61 65 75 87 74 62 95 78 63 72
66 78 82 75 94 77 69 74 68 60
96 78 89 61 75 95 60 79 83 71
79 62 67 97 78 85 76 65 71 75
65 80 73 57 88 78 62 76 53 74
86 67 73 81 72 63 76 75 85 77
PROBLEM No. 4
29 79 75 66 63 58 50 85 81 72
54 42 80 74 68 67 59 48 86 80
60 56 44 78 72 69 64 60 52 88
67 61 55 47 82 71 66 64 62 90
73 65 62 53 46 83 76 70 68 92
Problem No.5
Forbes magazine published data on the best small firms in 2012. These were firms
which have been publicly traded for at least a year, have a stock price of at least $5 per
share, and have reported annual revenue between $5 million and$1 billion. Complete
the frequency distribution of ages of the chief executive officers for the first 60 ranked
firms.
Age Frequency
40-44 3
45-49 11
50-54 13
55-59 16
60-64 10
65-69 6
70-74 1
Answer:
Age Frequency Median Relative frequency Cumulative relative
frequency
40-44 3 42 0.05 0.5
45-49 11 47 0.18 0.23
50-54 13 52 0.22 0.45
55-59 16 57 0.27 0.72
60-64 10 62 0.17 0.89
65-69 6 67 0.1 0.99
70-74 1 72 0.02 1
Questions:
a. What is the frequency for CEO ages between 54 and 65?
The frequency for CEOs ages between 54 and 65 is 33.
c. What is the cumulative relative frequency for CEOs younger than 55?
The cumulative relative frequency for CEOs younger than 55 is 0.45.
MODULE 3
Problem No. 1
3 5 6 6 7 9
There are two middle values so we need to get the average of the two.
6+6
=6
6
Thus, the median is 6.
We can observe that the measurement 6 appeared twice. Therefore, the mode is 6.
Problem No.2
The observation below are the body temperatures in degrees Celsius of Five patients
who have fever in ward B of Hospital E. Find the mean body temperature of the
patients.
Patient Temperature ( ˚C )
Dianne 40.5
Lily 38..9
Antonio 41
Catherine 38.8
Will 39.6
❑
∑ X
Answer: 198.8
X́ = ❑
=
n 5
x́=39.76 ° C
Problem No.3
Find the mean of the following scores:
22 25 22 20 23 24 23 21 20 20
22 22 20 28 29 30
Answer:
❑
∑ X
22+ 25+22+20+23+24 +23+21+20+20+22+22+20+ 28+29+30 371
X́ = ❑
= = =23.188
n 16 16
x́=23.188
Problem No 4
The intelligence quotients of 10 boys are recorded. Find the mean, median and mode.
98+100
X́ = = 99
2
There are 10 data so the median is between the fifth and sixth item in
ascending order which is 99.
We can observe that 88 has appeared two times. Thus, the mode is 88.
Problem No. 5
Alex timed 21 people in the sprint race, to the nearest second: 59, 65, 61, 62, 53, 55,
60, 70, 64, 56, 58, 58, 62, 62, 68, 65, 56, 59, 68, 61, and 67. Find the mean, median and
mode.
53, 55, 56, 56, 58, 58, 59, 59, 60, 61, 61, 62, 62, 62, 64, 65, 65, 67, 68, 68, 70
1289
¿ =¿ 61.38
21
There are 21 data so the median is the 11th item in ascending order which is 61.
We can observe that 62 has appeared three times. Thus, the mode is 62.
LESSON 2: MEASURES OF CENTRAL TENDENCY FOR GROUPED DATA
Problem No. 1
Alex timed 21 people in the sprint race, to the nearest second: 59, 65, 61, 62, 53, 55, 60,
70, 64, 56, 58, 58, 62, 62, 68, 65, 56, 59, 68, 61, and 67. Construct a grouped frequency table.
Find the mean, median and mode.
¿ 60.5+0.9375
¿ 61.4375 seconds
For the mode:
∆1
Mo=LMo + ( ∆ 1+∆ 2)
i
∆ 1=8−7=1
∆ 2=8−4=4
Mo=60.5+ ( 1+1 4 )5
¿ 60.5+ ( 23 )6=¿65.5 seconds
Problem No. 2
A farmer grew fifty baby carrots using special soil. He dig them up and measure their
lengths (to the nearest mm) and group the results:
N
Md=L Md +
2
(−¿ Cf b
f Md
i )
50
( )
¿ 169.5+
2
−21
9
5
¿ 169.5+2.2222
¿ 171.7 mm
∆ 1=11−9=2
∆ 2=11−6=5
Mo=169.5+ ( 2+52 )5
¿ 169.5+1.42 = 175
Problem No. 3
The ages of the 112 people who live on a tropical island are grouped below.
Determine the mean, median and mode.
(
¿ 20+
2
−41
23 )5
¿ 20+6.52=26.5
∆ 1=23−21=2
∆ 2=23−16=7
Mo=20+ ( 2+72 ) 5
¿ 20+2.22=¿22
Problem No. 4
What is the value of mean, median and mode for the data in the following frequency
distribution below?
23
¿ 4.5+
2
15( )
2
¿ 5.42
∆ 1=15−5=10
∆ 2=15−10=5
15
Mo=4.5+ ( 15+5 )2
¿ 4.5+ ( 32 )
Mo=6
Problem No. 5
Determine the mean, median and mode for the grouped data given below.
∆ 1=33−15=18
∆ 2=33−24=9
Mo=60.5+ ( 18+18 9 ) 6
¿ 60.5+ ( 23 )6
Mo=¿64.5
LESSON 3: FRACTILES FOR UNGROUPED DATA
Problem No. 1
The following data lists the number of calories in 30 manufacturers of vanilla flavored
ice cream bars. Solve for the 2nd quartile, 6th decile and 80th percentile.
111 132 151 182 197 209 255 295 337 377
126 147 179 185 200 234 286 310 353 377
131 151 180 190 201 255 294 319 365 439
2 N 2 ( 30 )
Q 2= = = 15TH item which is 201.
4 4
6 N 6 ( 30 )
D 6= = = 18th item which is 255.
10 10
80 N 80 ( 30 )
P80= = = 24th item which is 319.
100 100
Problem No. 2
Calculate the Q3, D7 and P28 for the following test scores of 10 students.
10 22 24 27 32 36 40 41 50 90
3 N 3 (10 )
Q 3= = = 7.5TH Item which is 41.
4 4
7 N 7 (10 ) th
D 7= = = 7 item which is 40.
10 10
For the 30th percentile:
30 N 30 (10 ) th
P30= = = 3 item which is 24.
100 100
Problem No. 3
Determine the 2nd quartile, 8th decile, and 25th percentile of the data given below.
22 30 36 41 53
23 33 36 42 54
24 33 37 49 54
28 35 38 53 56
For the 2nd quartile:
2 N 2 ( 20 )
Q 2= = = 10TH Item which is 36.
4 4
8 N 8 ( 20 )
D 8= = = 16th item which is 53.
10 10
25 N 25 (20 ) th
P25= = = 5 item which is 30.
100 100
Problem No.4
2 N 2 ( 10 ) TH
Q 2= = = 5 Item which is 85.
4 4
45 N 45 ( 10 )
P45= = = 4.5th item which is 85.
100 100
Problem No. 5
87 90 95 96 97 98 98 99
100 100 100 100 100 101 101 102
102 102 103 104 105 107 110
N ( 23 )
Q 1= = = 5.75TH Item which is 98.
4 4
5 N 5 ( 23 )
D 5= = = 11.5th item which is 100.
10 10
70 N 70 ( 23 )
P70= = = 16.1th item which is 102.
100 100
Problem No. 1
Calculate the 1st quartile, 8th decile and 65th percentile of the Engineering Data
Analysis test score of 50 students.
N ( 50 )
Q 1= = = 12.5TH item
4 4
Thus, the 1st quartile class is 75 – 79 since it is where the 12.5th item is found.
N
Q 1=LQ +
4
fQ1 (
−¿ Cf b
i ) 1
( 50 )
¿ 74.5+
4
( )
16
−3
5
¿ 74.5+2.97=77.47
Thus, the 8th percentile class is 85 – 89 since it is where the 40th item is found.
8N
D8=L D +8
10
(
−¿ Cf b
fL
i
8
)
8 ( 50 )
¿ 84.5+
10
(
10
−33
5 )
¿ 84.5+3.5= 88
65 N
P65=L P +
100
fP
65 (
−¿ Cf b
i
65
)
65 ( 50 )
P65=79.5+
100
14
−19
(5 )
¿ 79.5+4.82=84.32
Problem No. 2
Calculate the 1st quartile, 3rd decile and 50th percentile of the Differential test score of
50 students.
N ( 50 )
Q 1= = = 12.5TH item
4 4
Thus, the 1st quartile class is 30 – 34 since it is where the 12.5th item is found.
N
Q 1=LQ +
4
fQ
1 (
−¿ Cf b
i ) 1
( 50 )
¿ 29.5+
4
( )
9
−8
5
¿ 29.5+2.5¿ 32
Thus, the 3rd decile class is 30 – 34 since it is where the 15th item is found.
3N
D 3=L D +
3
10
(
−¿ Cf b
fL
i
3
)
3 ( 50 )
¿ 29.5+
10
( 9
−6
5 )
¿ 29.5+5= 34.5
LP = 34.5
30
f P =10
30
<Cf = 17 i=5 N = 50
50 N
P50=L P +
100
50
fP (
−¿ Cf b
i
50
)
50 ( 50 )
P50=34.5+
100
10 (
−17
5 )
¿ 34.5+ 4=38.5
Problem No. 3
In a work study investigation, the time taken by 20 men in a firm to do a particular job
were tabulated below. Determine the 2nd quartile, 7th decile, and 30th Percentile.
Time taken 8 – 10 11 – 13 14 – 16 17 – 19 20 – 22 23 – 25
Frequencies 2 4 6 4 3 1
2 N 2 ( 20 )
Q 2= = = 10TH item
4 4
Thus, the 2nd quartile class is 14 – 16 since it is where the 10th item is found.
2N
Q 2=LQ +
2
4
(
fQ )
−¿ Cf b
i
2
2 (20 )
¿ 13.5+ ( )
4
6
−6
3
¿ 13.5+2=15.5
Thus, the 6th decile class is 14 – 16 since it is where the 10th item is found.
6N
D 6=L D +6
10
(
−¿ Cf b
fL 6
i )
6 ( 20 )
¿ 13.5+ (10
6
−6
3 )
¿ 13.5+ ( 66 )3= 16.5
For the 30th percentile:
30 N 30 (20 ) th
P30= = = 6 item
100 100
Thus, the 30th percentile class is 11 – 30 since it is where the 6th item is found.
LP = 10.5
30
f P =4
30
<Cf = 2 i=3 N = 20
30 N
P30=L P +
100
30
fP(
−¿ Cf b
i
33
)
30 ( 20 )
P33=10.5+
100
4 (
−2
3 )
¿ 10.5+3=13.5
Problem No. 4
Calculate the 2nd quartile, 7th decile and 45th percentile of the mathematics test score of
50 students.
2 N 2 ( 50 )
Q 2= = = 25TH item
4 4
Thus, the 2nd quartile class is 31-35 since it is where the 25th item is found.
LQ2 = 30.5 N = 50 fq2 = 9 i=5 <cf = 18
2N
Q 2=LQ +
4
fQ
2 (
−¿ Cf b
i
2
)
2 (50 )
¿ 30.5+
4
9 (−18
5 )
¿ 405.5+ ( 79 ) 5
¿ 405.5+ ( 359 )
¿ 34.39
Thus, the 7th decile class is 36 - 40 since it is where 35th item is found.
7N
D 7=L D +7
10
(
−¿ Cf b
fL
i
7
)
7 ( 50 )
¿ 35.5+
10
11( −27
5 )
¿ 35.5+ ( 4011 )
=39.14
LP = 25.5
33
f P =12
33
<Cf = 6 i=5 N = 50
33 N
P33=L P +
100
33 (
−¿ Cf b
fP
i
33
)
33 ( 50 )
P33=25.5+
100
12 (
−6
5 )
¿ 25.5+ ( 78 )5
¿ 29.88
Problem No. 5
The airborne speeds in kilometer per hour of 26 planes are shown below. Find the 1 st
quartile, 6th decile and 95th percentile.
N ( 26 )
Q 1= = = 6.5TH item
4 4
Thus, the 1st quartile class is 406 – 425 since it is where the 6.5th item is found.
¿ 405.5+ ( 16 ) 20
¿ 405.5+ ( 103 )
¿ 408.83
Thus, the 6th decile class is 486 – 505 since it is where 15.6th item is found.
6N
D 6=L D +6
10
fL (
−¿ Cf b
i
6
)
6 ( 26 )
¿ 485.5+
10
(
3
−14
20 )
¿ 405.5+ ( 5815 ) 20
¿ 405.5+ ( 2323 )
¿ 408.17
LP = 525.5 f P =5 <Cf = 21
95 95
i = 26 N = 26
95 N
P95=L P +
95
100
( −¿Cf b
fP
i
95
)
95 ( 26 )
P95=525.5+
100
(
5
−21
26 )
( 3750 )26=525.5+( 745 )=540.3
¿ 525.5+
Module 4
"Measures of Variation"
LESSON 1: RANGE
Problem No.1
Given the measurements 20, 26, 40, 39, 25, 36, 21, 34, 33, and 37. Find the range.
R=Highest observation−Lowest observation
Answer:
Problem No. 2
A group of adventurers went to the mountain range in Sierra Madre, Philippines to enjoy the
great view and to explore the mountain in the area. The ages of the adventurers are 34, 30, 27,
50, 45, 35, 38, 47, 52, 31, 38, and 40. What is the range of their ages?
Answer:
Problem No. 3
Answer:
Problem No. 4
You take 7 statistics tests over the course of a semester. You score 94, 88, 73, 84, 91,
87, and 79. What is the range of your scores?
Range = 94 – 73 = 21
The range is 21 which is the difference between the highest and lowest observation.
Problem No. 5
The range is 675 which is the difference between the highest and lowest observation.
Problem No. 1
Calculate the mean deviation about the mean for the following data.
Problem No.2
In a pancake eating competition the number of pancake eaten by five contestants in
an hour is as follows:
12, 18, 21, 26, 17, 20, 18
Answer:
Score(x) x-
x́ /
12 6.86
18 0.86
21 2.14
26 7.14
17 1.86
20 1.14
18 0.86
❑ ❑
∑ ¿132 ∑ ( x− x́ ) = 20.86
❑ ❑
❑
∑ X
132
X́ = ❑
= =18.86
n 7
❑
20.86
Mean Deviation: MAD¿ ∑ f ¿ ¿ ¿ MAD¿ = 2.98
❑ 7
- The mean deviation of the number of pancake eaten by five contestants is 2.98
Problem No. 3
Find the mean deviation, standard deviation and variance, coefficient of variation,
skewness and kurtosis of the sample observations 2, 5, 7, 9 and 12.
2 3 4
X X − X́ | X− X́| ( X − X́ ) ( X − X́ ) ( X − X́ )
2 2 – 7 = -5 5 25 -125 625
5 5 – 7 = -2 2 4 -8 32
7 7–7=0 0 0 0 0
9 9–7=2 2 4 8 32
12 12 – 7 = 5 5 25 125 625
❑ ❑ ❑ ❑
2 3 4
∑ | X− X́|=14 ∑ ( X− X́ ) =58 ∑ ( X− X́ ) =0 ∑ ( X− X́ ) =1314
❑ ❑ ❑ ❑
For Mean Deviation
❑
∑ |X − X́|
MD = ❑ 14 = 2.8
=
n 5
For Standard Deviation:
❑
√
2
∑ ( X− X́ ) 58
S= ❑
n−1
=
√ 5−1
= 3. 81
S 2 ∑ ( X − X́ )
= ❑ 58 58
= = =14.5
n−1 5−1 4
S 3.81
V= (100) = ( 100 )=54.42 %
X́ 7
Problem No. 4
Given the values 4, 4, 6, 7 and 9. Compute the mean deviation, standard deviation and
variance, coefficient of variation, skewness and kurtosis.
( X − X́ )2 3 4
X X − X́ | X− X́| ( X − X́ ) ( X − X́ )
4 4 – 6 = -2 2 4 -8 16
4 4 – 6 = -2 2 4 -8 16
6 6–6=0 0 0 0 0
7 7–6=1 1 1 1 1
9 9–6=3 3 9 27 81
❑ ❑ ❑ ❑
2 3 4
∑ | X− X́|=6 ∑ ( X− X́ ) =18 ∑ ( X− X́ ) =¿ ¿∑ ( X− X́ ) =114
❑ ❑ ❑ ❑
12
For Mean Deviation
❑
∑ |X − X́|
MD = ❑ 6 = 1.2
=
n 5
√
2
∑ ( X− X́ ) 18
S= ❑
n−1
=
√ 5−1
= 2.12
S 2 ∑ ( X − X́ )
= ❑ 18 18
= = =4.5
n−1 5−1 4
S 2.12
V= (100) = ( 100 )=35.33 %
X́ 6
Problem No. 5
Computer the average deviation for the age at which men in a Chataqua bowling club
scored their first game over 175. Solve also for the standard deviation, variance, coefficient
of variation, skewness and kurtosis.
( X − X́ )2 3 4
X X − X́ | X− X́| ( X − X́ ) ( X − X́ )
29 29 – 51 = - 22 22 484 -10648 234256
36 36 – 51 = - 15 15 225 -3375 50625
42 42 – 51 = - 9 9 81 -729 6561
48 48 – 51 = - 3 3 9 -27 81
49 49 – 51 = - 2 2 4 -8 16
56 56 – 51 = 5 5 25 125 625
59 59 – 51 = 8 8 64 512 4096
62 62 – 51 = 11 11 121 1331 14641
64 64 – 51 = 13 13 169 2197 28561
65 65 – 51 = 14 14 196 2744 38416
❑ ❑ ❑ ❑
2 3 4
∑ | X− X́|=98 ∑ ( X− X́ ) =1378 ∑ ( X− X́ ) =¿ ¿ ∑ ( X− X́ ) =¿ ¿
❑ ❑ ❑ ❑
-7878 377878
For Mean Deviation
❑
∑ |X − X́|
MD = ❑ 98 = 9.8
=
n 10
√
2
∑ ( X− X́ ) 1378
S= ❑
n−1
=
√ 10−1
= 12.24
S 2 ∑ ( X − X́ )
= ❑ 1378 1378
= = =153.11
n−1 10−1 9
For coefficient of variation:
S 12.24
V= (100) = ´ ( 100 ) =24 %
X́ 51
For the skewness:
❑
3
∑ ( X − X́ )
Skewness = ❑ =
−7878
(10−1)¿ ¿
(n−1)( S3 )
Problem No. 1
Get the quartile deviation of the observation below in ascending order: 70, 76, 80, 83, 85,
85, 95, 96, 100, 110
N +1 11
Q 1= = = 2.75TH Item
4 4
= 76 + 3
=79
3( N +1) 3(11)
Q 3= = = 8.25TH Item
4 4
= 96 + 1
=97
Q 3−Q 1 97−79 18
Q= = = =9
2 2 2
Problem No. 2
The age at which men in a Chataqua bowling club scored their first game over 175 are
recorded below in order. Get the quartile deviation
N +1 11
Q 1= = = 2.75TH Item
4 4
= 36 + 4.5
= 40.5
3( N +1) 3(11)
Q 3= = = 8.25TH Item
4 4
= 62 + 0.5
= 62.5
Q 3−Q 1 62.5−40.5 22
Q= = = =11
2 2 2
Problem No. 3
Get the quartile deviation of the sample observations 85, 96, 76, 108, 85, 80, 100, 85,
70, 95, 106, 70, 99, 79, 88.
N +1 16 TH
Q 1= = = 4 Item= 79
4 4
Q 3−Q 1 99−79 20
Q= = = =5
2 2 4
Problem No. 4
Harry Itd. is a textile manufacturer. They want to know how much their production spread
is. Use the quartile deviation formula to help the management find the dispersion with the
data collected for the last 10 days per employee.
140, 145, 150, 155, 156, 169, 175, 177, 188, 190
N +1 11
Q 1= = = 2.75TH Item
4 4
= 145 + 3.75
=148. 75
3( N +1) 3(11)
Q 3= = = 8.25TH Item
4 4
= 177 + 2.75
=179.75
Q 3−Q 1 179.75−148.75 31
Q= = = =15.5
2 2 2
Problem No. 5
Solve the quartile deviation of the sample observations 10, 3, 13, 11, 15, 5, 4, 2, 3, 2.
In order: 2 2 3 3 4 5 10 11 13 15
N +1 11
Q 1= = = 2.75TH Item
4 4
= 2 + 0.75 (3 – 2)
= 2 + 0.75
= 2.75
3( N +1) 3(11)
Q 3= = = 8.25TH Item
4 4
= 11 + 0.5
= 11.5
Q= 8.7
Lesson 4: Coefficient of Variation
Problem No. 1
The following table gives the values of mean and variance of heights and weights of the 10th
standard students of a school.
Answer:
Convert :σ 2 ¿ σ
σ =√ 72.25=8.5
8.5
CV = ×100 %=5.48 % (height )
155
σ =√ 28.09=5.3
5.3
CV = ×100 %=11.40 % (weight )
46.50
- The weight of the students is more varied than the height having 11.40
coefficients of variation.
Problem No.2
Given the mean and standard deviation of the consumption of number of banana and
apple on one family in a week.
Apple Banana
MEAN 4.29 4.29
STANDARD DEVIATION 1.01 2.81
Answer:
1.01 2.81
CV = × 100 % CV = × 100 %
4.29 4.29
CV =23.54 % CV =65.5 %
- This show that the consumption of the Banana show more varied than the Apple.
Problem No.3
The standard deviation and mean of a data are 6.5 and 12.5 respectively. Find the coefficient
of variation.
Answer:
6.5
CV = ×100 %
12.5
CV =0.52 ×100 %
CV =52 %
Problem No.4-5
5.86
CV = × 100 %
11.48
CV =0.5105 ×100 %
CV =51.05 %
Problem No.1
o PR=P 90−P 10
Problem No.2
Calculate the percentile range of the runs scored by a batsman in last 20 matches:
34 39 63 64 67 70 75 76 81 82 84 85 86 88 89 90 90 96 96 100
Answer:
90 N 90(20)
P90= = =18 th P90=96
100 100
10 N 10(20)
P10= = =2 th P90=39
100 100
PR=P 90 −P 10=96−39
PR=57
Problem No. 3
Calculate the percentile range of the scores of students in a Post-test examination. The scores
are follows:
77 56 89 90 76 72 65 92 83 84 71 94 64
64 80 86 74 90 64 88
Answer:
56 64 64 64 65 71 72 74 76 77 80 83 84 86 88
89 90 90 92 94
90 N 90(20)
P90= = =18 th P90=90
100 100
10 N 10(20)
P10= = =2 th P90=64
100 100
Problem No. 4
Calculate the percentile range of the weight of students in section A. The following data are
weights:
56 49 70 63 58 62 67 51 69 72 64
67 61 54 59
Answer:
49 51 54 56 58 59 61 62 63 64
67 67 69 70 21
90 N 90(15)
P90= = =13.5 th P90=70
100 100
10 N 10(15)
P10= = =1.5 th P90 =51
100 100
PR=P 90 −P 10=70−51
PR=19
-The percentile range of the weights is 19.
Problem No. 5
Calculate the percentile range of frequency distribution table:
Class Interval Frequency Middle class(x) fx Cumulative
Frequency(cf)
54-57 3 55.5 166.5 3
58-61 2 59.5 119 5
62-65 11 63.5 698.5 16
66-69 12 67.5 810 28
70-73 9 71.5 652.5 37
74-77 8 75.5 604 45
78-81 4 79.5 318 49
82-85 1 83.5 83.5 50
Answer:
90 N 90(50)
P90= = =45 th𝑙𝑏P90 =77.5 < 𝑐ƒ = 45 i=4 ƒQ1 = 4
100 100
90 N
P90=L P 90+ [
100
−¿ cf
f ]
×i
= 77.5 +
45−45
[4 ]
× 4=77.5
10 N 10(50)
P10= = =5 th𝑙𝑏P10 =61.5 < 𝑐ƒ = 5 i=4 ƒQ1 = 11
100 100
10 N
P10=L P 10+ [
100
−¿ cf
f ]
×i
= 61.5 +
5−5
[ ]
11
× 4=61.5
PR=P 90 −P 10=77.5−61.5
PR=16
-The percentile range of the given frequency distribution is 16.
MODULE 5
PROBABILITIES
Lesson 1: Sample pace and event
Problem No.1
There are 6! permutations of the 6 letters of the word ”square.” In how many of them is r the
second letter?
Solution
Let r be the second letter. Then there are 5 ways to fill the first spot, 4 ways to fill the third, 3
to fill the fourth, and so on. There are 5! such permutations
Problem No.2
.Five different books are on a shelf. In how many different ways could you arrange them?
Solution
The five books can be arranged in 5·4·3·2·1 = 5! = 120 ways
Problem No. 3
Two coins are tossed, find the probability that two heads are obtained.
Problem No.4
Two dice are rolled, find the probability that the sum is equal to 1.
A die is rolled and a coin is tossed, find the probability that the die shows an odd number
and the coin shows a head.
Let H be the head and T be the tail of the coin. The sample space S of the experiment
described in problem 3 is as follows:
S = { (1,H),(2,H),(3,H),(4,H),(5,H),(6,H),(1,T),(2,T),(3,T),(4,T),(5,T),(6,T)}
Let E be the event "the die shows an odd number and the coin shows a head". Event E
may be described as follows:
E={(1,H),(3,H),(5,H)}
The probability P(E) is given by
P(E) = n(E) / n(S) = 3 / 12 = 1 / 4
Problem No. 3.
QUESTION: A bag contains fifteen balls distinguishable only by their colours; ten are blue
and five are red. I reach into the bag with both hands and pull out two balls (one with each
hand) and record their colours.
(a) What is the random phenomenon? (b) What is the sample space? (c) Express the event
that the ball in my left hand is red as a subset of the sample space.
SOLUTION:
(a) The random phenomenon is (or rather the phenomena are) the colours of the two balls. (b)
The sample space is the set of all possible colours for the two balls, which is {(B,B),(B,R),
(R,B),(R,R)}. (c) The event is the subset {(R,B),(R,R)}.
Problem No. 4.
QUESTION: M&M sweets are of varying colours and the different colours occur in different
proportions. The table below gives the probability that a randomly chosen M&M has each
colour, but the value for tan candies is missing.
Colour Brown Red Yellow Green Orange Tan Probability 0.3 0.2 0.2 0.1 0.1 ?
(a) What value must the missing probability be? (b) You draw an M&M at random from a
packet. What is the probability of each of the following events? i. You get a brown one or a
red one. ii. You don’t get a yellow one. iii. You don’t get either an orange one or a tan one.
iv. You get one that is brown or red or yellow or green or orange or tan.
SOLUTION:
(a) The probabilities must sum to 1.0 Therefore, the answer is 1−0.3−0.2−0.2−0.1−0.1 =
1−0.9 = .1. (b) Simply add and subtract the appropriate probabilities. i. 0.3+0.2 = 0.5 since it
can’t be brown and red simultaneously (the events are incompatible). ii. 1−P(yellow) = 1−0.2
= 0.8. iii. 1−P(orange or tan) = 1−P(orange)−P(tan) = 1−0.1−0.1 = 0.8 (since orange and tan
are incompatible events). iv. This must happen; the probability is 1.0
Problem No.5.
QUESTION: You consult Joe the bookie as to the form in the 2.30 at Ayr. He tells you that,
of 16 runners, the favourite has probability 0.3 of winning, two other horses each have
probability 0.20 of winning, and the remainder each have probability 0.05 of winning,
excepting Desert Pansy, which has a worse than no chance of winning. What do you think of
Joe’s advice?
SOLUTION: Assume that the sample space consists of a win for each of the 16 different
horses. Joe’s probabilities for these sum to 1.3 (rather than unity), so Joe is incoherent, albeit
profitable! Additionally, even “Dobbin” has a non-negative probability of winning.
LESSON 3. PROBABILITY DISTRIBUTION
Problem No. 1
A fair coin is tossed twice. Let X be the number of heads that are observed.
X 0 1 2
P(X) 0.25 0.50 0.25
Find the probability that at least one head is observed.
P(X ≥ 1) = P (1) + P (2) = 0.50 + 0.25 = 0.75
Problem No. 2
A pair of fair dice is rolled. Let X denote the sum of the number of dots on the top faces.
Find P(X ≥ 9)
Problem No. 3
A service organization in a large town organizes a raffle each month. One thousand
raffle tickets are sold for $1 each. Each has an equal chance of winning. First prize is
$300, second prize is $200, and third prize is $100. Let X denote the net gain from the
purchase of one ticket.
Problem No. 4
A life insurance company will sell a $200,000 one-year term life insurance policy to an
individual in a particular risk group for a premium of $195. Find the expected value to the
company of a single policy if a person in this risk group has a 99.97% chance of surviving
one year.
Problem No. 5
Question:. In the Arizona lottery called Pick 3, a player pays $1 and then picks a three-digit
number. If those three numbers are picked in that specific order the person wins $500. What
is the expected value in this game?
Solution: To find the expected value, you need to first create the probability distribution. In
this case, the random variable x = winnings. If you pick the right numbers in the right order,
then you win $500, but you paid $1 to play, so you actually win $499. If you didn’t pick the
right numbers, you lose the $1, the x value is −$1. You also need the probability of winning
and losing. Since you are picking a three-digit number, and for each digit there are 10
numbers you can pick with each independent of the others, you can use the multiplication
rule. To win, you have to pick the right numbers in the right order. The first digit, you pick 1
number out of 10, the second digit you pick 1 number out of 10, and the third digit you pick 1
number out of 10. The probability of picking the right number in the right order is 1/10*
1/10* 1/10 = 1/1000 =0.001. The probability of losing (not winning) would be 1− 1 1000 =
999/1000 =0.999. Putting this information into a table will help to calculate the expected
value.
LESSON 4. DEPENDENT AND INDEPENDENT EVENTS
Problem No. 1
A purse contains four $5 bills, five $10 bills and three $20 bills. Two bills are selected
without the first selection being replaced. Find P($5, then $5).
Solution:
There are four $5 bills.
There are a total of twelve bills.
P ($5) = 4/12
The result of the first draw affected the probability of the second draw.
P ($5, then $5) = P ($5) · P ($5 after $5) = (4/12) X (3/11) = 1/11
Problem No. 2
Solution:
Problem No. 3
Two sets of cards with a letter on each card as follows are placed into separate bags.
Sara randomly picked one card from each bag. Find the probability that she picked the letters
‘J’ and ‘R’.
1 1 1
Solution: Probability that she picked J and R = x =
5 6 30
Problem No. 4
When we have just got 6 heads in a row, what is the probability that the next toss is
also a head?
Problem No. 5.
A bag contains 6 red, 5 blue and 4 yellow marbles. Two marbles are drawn, but the first
marble drawn is not replaced. Find P(red, then blue).
P(red, then blue) = P(red) · P(blue after red) = 6/15 x 5/14 = 1/7
The probability of drawing a red marble and then a blue marble is 1/7
LESSON 5. PROBABILITY AND COMBINATORIAL ANALYSIS
Problem No. 1.
In a lottery you have to guess 6 out of 49 numbers. What is the probability that you
get all of them right? If submit 100 guesses every week, how long on average will it take you
to win?
There are 49C6 = 13,983,816 possible outcomes of the lottery, so the probability of getting the
right solution is 1 / 49C6 = 0.000000072.
On average it will also take 13,983,816 attempts to win. If we submit 100 guesses every week
this corresponds to 139,838 weeks, which is the same as 2,689 years.
Problem No. 2.
In order to compute the probability, we need to count the total number of ways six
numbers can be drawn, and the number of ways the six numbers on the player’s ticket could
match the six numbers drawn from the machine.
Since there is no stipulation that the numbers be in any particular order, the number of
possible outcomes of the lottery drawing is
48C6 = 12,271,512.
Of these possible outcomes, only one would match all six numbers on the player’s
ticket, so the probability of winning the grand prize is:1/(6C6)(48C6)= 0.00000008156
Problem No. 3.
Compute the probability of randomly drawing five cards from a deck and getting
exactly two Aces.
Problem No. 4.
Compute the probability of randomly drawing five cards from a deck and getting exactly
one Ace.
Problem No. 5.
Compute the probability of randomly drawing five cards from a deck of cards and getting
three Aces and two Kings.
"NORMAL DISTRIBUTION"
Lesson 1: Z-Score
Problem No. 1.
Convert the following scores to z-scores, where µ= 75 and 𝜎= 5
a. 75
b. 80
c. 58
Solution
a. x= 75
75−75
z=
5
0
=
5
=0
b. x= 80
80−75
z=
5
5
=
5
=1
c. x= 58
58−75
z=
5
−17
=
5
= -3.4
Problem No. 2.
Convert the following scores to z-scores, where µ= 100 and 𝜎= 5
a. 110
b. 101
c. 95
Solution
a. x= 110
110−100
z=
5
10
=
5
=2
b. x= 101
101−100
z=
5
1
=
5
= 0.2
c. x= 95
95−100
z=
5
−5
=
5
= -1
Problem No.3.
Convert the following scores to z-scores, where µ= 25 and 𝜎= 2.75
a. 31
b. 16
c. 26.5
Solution
a. x= 31
31−25
z=
2.75
6
=
2.75
= 2.18
b. x= 16
16−25
z=
2.75
−9
=
2.75
= -3.27
c. x= 26.5
26.5−25
z=
2.75
1.5
=
2.75
= 0.55
Problem No. 4
Convert the following scores to z-scores, where µ= 50 and 𝜎= 5
a. 60
b. 49.5
c. 51.5
Solution
a. x= 60
60−50
z=
5
10
=
5
=2
b. x= 49.5
49.5−50
z=
5
−0.5
=
5
= -0.1
c. x= 51.5
51.5−50
z=
5
1.5
=
5
= 0.3
Problem No. 5
The following are the scores of 10 students in Math Quiz Bee:
14 10 15 12 17 19 13 11 9 20
Convert scores to z-scores where µ= 15 and 𝜎= 4.5
X Z
9 -1.33
10 -1.11
11 -0.89
12 -0.67
13 -0.44
14 -0.22
15 0
17 0.44
19 0.89
20 1.11
Problem No. 1
Solution:
z = (40 - 30) / 4 = 2.5
P(x < 40) = P (z < 2.5) = [area to the left of 2.5] = 0.9938
Problem No. 2
A radar unit is used to measure speeds of cars on a motorway. The speeds are
normally distributed with a mean of 90 km/hr and a standard deviation of 10 km/hr. What is
the probability that a car picked at random is travelling at more than 100 km/hr?
Solution:
Let x be the random variable that represents the speed of cars. X has μ = 90 and σ =
10. We have to find the probability that x is higher than 100 or P(x > 100).
= 1 - 0.8413 = 0.1587
The probability that a car selected at a random has a speed greater than 100 km/hr is
equal to 0.1587.
Problem No. 3
For a certain type of computers, the length of time bewteen charges of the battery is
normally distributed with a mean of 50 hours and a standard deviation of 15 hours. John
owns one of these computers and wants to know the probability that the length of time will be
between 50 and 70 hours.
Solution:
Let x be the random variable that represents the length of time. It has a mean of 50
and a standard deviation of 15.
Problem No. 4
Entry to a certain University is determined by a national test. The scores on this test are
normally distributed with a mean of 500 and a standard deviation of 100. Tom wants to be
admitted to this university and he knows that he must score better than at least 70% of the
students who took the test. Tom takes the test and scores 585. Will he be admitted to this
university?
Solution:
Let x be the random variable that represents the scores. X is normally distributed with a mean
of 500 and a standard deviation of 100. The total area under the normal curve represents the
total number of students who took the test. If we multiply the values of the areas under the
curve by 100, we obtain percentages.
Problem No. 5
a) What is the probability that the length of this component is between 4.98 and 5.02
cm?
b) What is the probability that the length of this component is between 4.96 and 5.04
cm?
Solution:
Lesson 3: Skewness
Problem No. 1
Calculate the degree of skewness of a distribution if the mean is 45, the median is 40, and
standard deviation is 5
Solution:
3( Ẋ −Md )
Sk=
S
3(45−40)
=
5
3(5)
=
5
=3 hence, the distribution is positively skewed
Problem No. 2
Calculate the degree of skewness of a distribution if the mean is 60, the median is 50, and
standard deviation is 2
Solution:
3( Ẋ −Md )
Sk=
S
3(60−50)
=
2
3(10)
=
2
= 15 hence, the distribution is positively skewed
Problem No. 3
Calculate the degree of skewness of a distribution if the mean is 100, the median is 120, and
standard deviation is 5
Solution:
3( Ẋ −Md )
Sk=
S
3(100−120)
=
5
3(−20)
=
2
= -12 hence, the distribution is negatively skewed
Problem No. 4
Calculate the degree of skewness of a distribution if the mean is 7.5, the median is 6.3, and
standard deviation is 0.7
Solution:
3( Ẋ −Md )
Sk=
S
3(7.5−6.3)
=
0.7
3(1.2)
=
0.7
= 504 hence, the distribution is positively skewed
Problem No. 5
Calculate the degree of skewness of a distribution if the mean is 1050, the median is 995, and
standard deviation is 25.7
Solution:
3( Ẋ −Md )
Sk=
S
3(1050−995)
=
25.7
3(55)
=
25.7
= 6.42 hence, the distribution is positively skewed
LESSON 4. KURTOSIS
Problem No. 1
Time taken 8 – 10 11 – 13 14 – 16 17 – 19 20 – 22 23 – 25
Frequencies 2 4 6 4 3 1
N ( 20 ) TH
Q 1= = = 5 item
4 4
Thus, the 1st quartile class is 11 – 13 since it is where the 5th item is found.
¿ 10.5+2.2¿ 12.75
3 N 3 (20 )
Q 3= = = 15TH item
4 4
Thus, the 3rd quartile class is 17 – 19 since it is where the 15th item is found.
N
Q 3=LQ +
4
3
fQ (
−¿ Cf b
i
3
)
¿ 16.5+ ( 15−12
4 )
3
¿ 39.5+2.25=41.75
Q 3−Q 1 41.75−12.75 29
Q= = = =14.5
2 2 2
LP = 19.5
90
f P =3
90
<Cf = 16 i=3 N = 20
90 N
P90=L P +
100
90
fP (
−¿Cf b
i
90
)
P90=19.5+ ( 18−16
3 )
3
¿ 19.5+2=21.5
LP = 7.5
10
f P =2
10
<Cf = 0 i=3 N = 20
90 N
P10=L P +
10
100
( −¿ Cf b
fP 10
i )
P10=7.5+ ( 2−0
2 )
3
¿ 7.5+3=10.5
Q 14.5 14.5
k= = = =1.32
P 90−P10 21.5−10.5 11
Problem No. 2
N ( 50 )
Q 1= = = 12.5TH item
4 4
Thus, the 1st quartile class is 30 – 34 since it is where the 12.5th item is found.
N
Q 1=LQ +
4
−¿ Cf b
fQ 1
i ( ) 1
( 50 )
¿ 29.5+
4
9( )
−8
5
¿ 29.5+2.5¿ 32
For the 3rd quartile:
3 N 3 (50 )
Q 3= = = 37.5TH item
4 4
Thus, the 3rd quartile class is 40 – 44 since it is where the 37.5th item is found.
N
Q 3=LQ +
4
3
fQ (
−¿ Cf b
i
3
)
¿ 39.5+ ( 37.5−27
12 )
5
¿ 39.5+ 4.375=43.875
LP = 44.5
90
f P =7
90
<Cf = 39 i=5 N = 50
90 N
P90=L P +
100
−¿Cf b
fP
90
i( 90
)
P90=44.5+ ( 45−39
7 )
5
¿ 44.5+ 4.29=48.79
LP = 24.5
10
f P =6
10
<Cf = 2 i=5 N = 50
90 N
P10=L P +
100
10 (−¿ Cf b
fP
i
10
)
P10=24.5+ ( 5−2
6 )
3
¿ 24.5+1.5=26
Q 5.938 5.938
k= = = =0.2606
P 90−P10 48.79−26 22.79
Problem No. 3
Calculate the quartile deviation of the Engineering Data Analysis test score of 50
students.
Thus, the 1st quartile class is 75 – 79 since it is where the 12.5th item is found.
N
Q1=LQ +
4
−¿ Cf b
fQ 1
i ( ) 1
( 50 )
¿ 74.5+
4
16( )
−3
5
¿ 74.5+2.97=77.47
For the 3rd quartile:
3 N 3 (50 )
Q 3= = = 37.5TH item
4 4
Thus, the 3rd quartile class is 85 – 89 since it is where the 37.5th item is found.
N
Q3=LQ +
4
fQ
3 (
−¿ Cf b
i
3
)
¿ 84.5+ ( 3.7 .5−33
10 )5
¿ 84.5+2.25=86.75
Thus, the 90th percentile class is 85 – 89 since it is where the 45th item is found.
LP = 84.5
90
f P =10
90
<Cf = 33 i=5 N = 50
90 N
P90=L P +
100
90 (
−¿Cf b
fP
i
90
)
P90=84.5+ ( 45−33
10 )
5
¿ 84.5+6=90.5
LP = 74.5
10
f P =16
10
<Cf = 3 i=5 N = 50
90 N
P10=L P +
100
10 (−¿ Cf b
fP
i
10
)
5−3
P10=74.5+ ( )
16
5
¿ 74.5+0.625
¿ 75.125
Q 4.64 4.64
k= = = =0.3017
P 90−P10 90.5−75.125 15.375
LP = 84.5
90
f P =10
90
<Cf = 33 i=5 N = 50
90 N
P90=L P +
100
90 (
−¿Cf b
fP
i
90
)
P90=84.5+ ( 45−33
10 )
5
¿ 84.5+6
¿ 90.5
LP = 74.5
10
f P =16
10
<Cf = 3 i=5 N = 50
90 N
P10=L P +
100
10
fP(
−¿ Cf b
i
10
)
P10=74.5+ ( 5−3
16 )
5
¿ 74.5+0.625
¿ 75.125
Q 4.64 4.64
k= = = =0.3017
P 90−P10 90.5−75.125 15.375
Problem No. 4
N ( 30 )
Q 1= = = 7.5TH item
4 4
Thus, the 1st quartile class is 5 – 7 since it is where the 7.5th item is found.
¿ 4.5−1.875.
¿ 2.625
3 N 3 (30 )
Q 3= = = 22.5TH item
4 4
Thus, the 3rd quartile class is 8 – 10 since it is where the 22.5th item is found.
N
Q3=LQ +
4
fQ
3 (
−¿ Cf b
i
3
)
¿ 7.5+ ( 22.5−14
9 )3
¿ 7.5+2.83
¿ 10.33
LP = 10.5
90
f P =7
90
<Cf = 23 i=3 N = 30
90 N
P90=L P +
100
90 (−¿Cf b
fP
i
90
)
P90=10.5+ ( 27−23
7 )
3
¿ 10.5+1.71
¿ 12.21
LP = 1.5
10
f P =10
10
<Cf = 0 i=3 N = 30
90 N
P10=L P +
100
10 (
−¿ Cf b
fP
i
10
)
Class f Class Boundaries <Cf
10 – 19 8 9.5 – 19.5 8
20 – 29 16 19.5 – 29.5 24
30 – 39 21 29.5 – 39.5 55
40 – 49 11 39.5 – 49.5 66
50 – 59 4 49.5 – 59.5 70
70
P10=1.5+ ( 3−0
10 )
3
¿ 1.5+0.9
¿ 2.4
Q 3.85 3.85
k= = = =0.3925
P 90−P10 12.21−2.4 9.81
Problem No. 5
A sample of college students was asked how much they spent monthly on a cellphone
phone plan. Solve for quartile deviation.
For the 1st quartile:
N ( 70 )
Q 1= = = 17.5TH item
4 4
Thus, the 1st quartile class is 20 - 29 since it is where the 17.5th item is found.
N
Q 1=LQ +
4
1
fQ (
−¿ Cf b
1
i )
¿ 19.5+ ( 17.5−8
16 )
10
¿ 19.5+5.94.
¿ 25.44
3 N 3 (70 )
Q 3= = = 52.5TH item
4 4
Thus, the 3rd quartile class is 30 – 39 since it is where the 52.5th item is found.
N
Q3=LQ +
4
fQ
3 (
−¿ Cf b
i
3
)
¿ 29.5+ ( 52.5−24
21 )10
¿ 29.5+13.57
¿ 43.07
LP = 39.5
90
f P =11
90
<Cf = 55 i = 10 N = 70
90 N
P90=L P +
100
90 (
−¿Cf b
fP
i
90
)
P90=39.5+ ( 63−55
11 )
10
¿ 39.5+7.27
¿ 46.77
LP = 9.5
10
f P =8
10
<Cf = 0 i = 10 N = 70
90 N
P10=L P +
100
10 (−¿ Cf b
fP
i
10
)
P10=9.5+ ( 7−0
8 )
10
¿ 9.5+8.75
¿ 18.25
Q 8.815 8.815
k= = = =0.3091
P 90−P10 46.77−18.25 28.52
MODULE 7
"HYPOTHESIS TESTING"
$ 1,080−$ 1,240
Step 4: Compute t =√ 15 ≈ -3.443
$ 180
Problem No. 2.
The hourly French fried potato output by the Krisp-o-Matic fry machine is advertised to be
150 pounds. For the new machine purchased by the Burger Heaven drive-in, tests were run
for 22 different one-hour periods, producing an average production of 143 pounds, with a
standard deviation of 17 pounds. At the 5% level of significance, does the Burger Heaven
management have grounds for complaints?
SOLUTION: Here are the steps for this problem.
Step 1: The hypothesis statement is H0: μ = 150 versus H1: μ ≠ 150.
Observe that μ represents the true-but-unknown mean for the new Krisp-o-Matic machine.
The comparison value 150 is the numerical claim, and we want to compare μ to 150.
It might seem that the whole problem was set up with H1: μ < 150 in mind. After all, the test
could not possibly be designed to detect a machine that was performing better than
advertised. However, in the absence of a blatant statement that the experiment was designed
with a one-sided motive, we should use the two-sided alternative. As before, we should not
let the value in the data influence the choice of H1. Also as before, you should not attempt to
second-guess the researcher’s motives. In general, we really like to stay away from one-sided
alternative hypotheses.
Step 2: Level of significance α = 0.05. The value 0.05 is requested. If the α value were left
vague or unspecified, most users would take 0.05 as the default.
Step 3: The test statistic will be 0 x tn s −μ = . The null hypothesis will be rejected if | t | ≥
tα/2;n-1. If | t | < tα/2;n-1 then H0 will be accepted or judgment will be reserved.
At this point it would be helpful to recognize that the sample size is small; we should state
the assumption that the data are sampled from a normal population.
In using this formula, we’ll have n = 22, μ0 = 150 (the comparison value). The numbers x
=143 and s = 17 will come from the sample. The value tα/2;n-1 is t0.025;21 = 2.080.
The “judgment will be reserved” phrase allows for the possibility that you might end up
accepting H0 without really believing H0. This happens frequently when the sample size is
small.
143−150
Step 4: Compute t = √ 22 ≈ -1.931
17
Step 5: Since | -1.931 | = 1.931 < 2.080, the null hypothesis is accepted. The results are not
significant. The Krisp-o-Matic would be declared not significantly different from the claim.
The phrase not significant means that the null hypothesis has been accepted. This does not
mean that we really believe H0 ; we might simply reserve judgment until we get more data.
The p-value would be reported as p > 0.05 (NS). The NS stands for not significant.
Problem No. 3.
Researchers are interested in the mean age of a certain population.
A random sample of 10 individuals drawn from the population of interest has a
mean of 27.
Assuming that the population is approximately normally distributed with variance
20,can we conclude that the mean is different from 30 years ? (α=0.05) .
If the p - value is 0.0340 how can we use it in making a decision?
Solution
1-Data: variable is age, n=10, =27 ,σ2=20,α=0.05
2-Assumptions: the population is approximately normally distributed with variance 20
3-Hypotheses:
H0 : μ=30
HA: μ 30 x
4-Test Statistic:
Z = -2.12
5.Decision Rule
The alternative hypothesis is HA: μ ≠ 30
Hence we reject H0 if Z > Z1-0.025= Z0.975 or Z< - Z1-0.025 = - Z0.975
Z0.975=1.96(from table D)
Problem No. 4
A contractor wishes to lower heating bills by using a special type of insulation in houses. If
the average of the monthly heating bills is $78, her hypotheses about heating costs will be.
Answer:
H o : µ ≥ $78
H a : µ < $78
Problem No. 5
A chemist invents an additive to increase the life of an automobile battery. If the mean
lifetime of the battery is 36 months, then his hypotheses are.
Answer:
H o: µ ≤ 36
H a: µ > 36
Problem No. 1
The school nurse thinks the average height of 7 th graders has increased. The average height of
a 7th grader five year ago was 145cm with a standard deviation of 20cm. She takes a random
sample of 200 students and finds that average height of her sample is 147cm. are 7 th graders
now taller than they were before? Conduct a single tailed hypothesis test using 0.5 level of
significance.
Answer:
: µ ≤145
H a : µ > 145
The calculated value is smaller than the tabulated value. Therefore, the null hypothesis is not
rejected.
Problem No. 2
A researcher reports that the average salary of assistant professors is more than $42,000. A
sample of 30 assistant professors has a mean salary of $43,260. At α = 0.05, test the claim
that assistant professors earn more than $42,000 a year. The standard deviation of the
population is $5230.
Answer:
H o: µ ≤ $42,000
H a: µ > 42,000
( x́−μ ) 43260−4200
z= √ n= 5230 √30=1.32
σ
The calculated value is smaller than the tabulated value. Therefore, the null hypothesis is not
rejected.
Problem No. 3
A national magazine claims that the average college student watches less television than the
general public. The national average is 29.4 hours per week, with a standard deviation of 2
hours. A sample of 30 college students has a mean of 27 hours. Is there enough evidence to
support the claim at a= 0.01?
Answer:
H o: µ ≥ 29.4
H a: µ <29.4
( x́−μ ) 27−29.4
z= √ n= 2 √ 30=−6.57
σ
The calculated value is greater than the tabulated value. Therefore, the null hypothesis is
rejected.
Problem No. 4
The Medical Rehabilitation Education Foundation reports that the average cost of
rehabilitation for stroke victims is $24,672. To see if the average cost of rehabilitation is
different at a large hospital, a researcher selected a random sample of 35 stroke victims and
found that the average cost of their rehabilitation is $25,226. The standard deviation of the
population is $3,251. At α = 0.01, can it be concluded that the average cost at a large hospital
is different from $24,672?
Answer:
H o: µ = $24,672
H a: µ ≠$24,672
( x́−μ ) 25226−29.4
z= √ n= 3251 √ 35=1.01
σ
The calculated value is smaller than the tabulated value. Therefore, the null hypothesis is not
rejected.
Problem No. 5
A researcher wishes to test the claim that the average age of lifeguards in Ocean City is
greater than 24 years. She selects a sample of 36 guards and finds the mean of the sample to
be 24.7 years, with a standard deviation of 2 years. Is there evidence to support the claim at
α= 0.05?
Answer:
H o: µ ≤ 24
H a: µ >24
( x́−μ ) 24.7−24
z= √ n= 2 √ 36=2.10
σ
The calculated value is greater than the tabulated value. Therefore, the null hypothesis is
rejected.
Problem No. 1
An investigator thinks that people under the age of forty have vocabularies that are different
than those of people over sixty years of age. The investigator administers a vocabulary test to
a group of 31 younger subjects and to a group of 31 older subjects. Higher scores reflect
better performance. The mean score for younger subjects was 14.0 and the standard deviation
of younger subject's scores was 5.0. The mean score for older subjects was 20.0 and the
standard deviation of older subject's scores was 6.0. Does this experiment provide evidence
for the investigator's theory? The level of significance is 0.05.
Answer:
H o: There is no significant that people under the age of forty have vocabularies that are
different than those of people over sixty years of age.
H a: There is significant that people under the age of forty have vocabularies that are different
than those of people over sixty years of age.
x́1 − x́2
t=
( n1−1 ) s 21+(n 2−1) s22 n 1+n 2
√[ n 1+ n2−2 ][ ]
n1 n2
14−20
t=
( 31−1 ) 25+(31−1)36 31+31 = -4.28
√[ 31+31−2 ][
31× 31 ]
df =n1 +n2−2=31+31−2=60
The calculated value is greater than tabulated. Therefore, the null hypothesis is rejected.
Problem No. 2
An investigator predicts that dog owners in the country spend more time walking their dogs
than do dog owners in the city. The investigator gets a sample of 21 country owners and 23
city owners. The mean number of hours per week that city owners spend walking their dogs
is 10.0. The standard deviation of hours spent walking the dog by city owners is 3.0. The
mean number of hour’s country owners spent walking their dogs per week was 15.0. The
standard deviation of the number of hours spent walking the dog by owners in the country
was 4.0. Do dog owners in the country spend more time walking their dogs than do dog
owners in the city? Use 0.01 level of significance.
Answer:
H o: There is no significant between the time dog owner in the country and city in spending
more time to their dogs.
H a: There is significant between the time dog owners in the country and city in spending
more time to their dogs.
Let:
15−10
t=
( 21−1 ) 16+(23−1) 9 21+23 = 4.78
√[ 21+23−2 ][
21 ×23 ]
df =n1 +n2−2=21+23−2=42
-The calculated value is greater than tabulated. Therefore, the null hypothesis is
rejected.
Problem No. 3
An investigator theorizes that people who participate in a regular program of exercise will
have levels of systolic blood pressure that are significantly different from that of people who
do not participate in a regular program of exercise. To test this idea the investigator randomly
assigns 21 subjects to an exercise program for 10 weeks and 21 subjects to a non-exercise
comparison group. After ten weeks the mean systolic blood pressure of subjects in the
exercise group is 137 and the standard deviation of blood pressure values in the exercise
group is 10. After ten weeks, the mean systolic blood pressure of subjects in the non-exercise
group is 127 and the standard deviation on subjects in the non-exercise group is 9.0. Please
test the investigator's theory using an alpha level of .05.
Answer:
Let:
137−127
t=
( 21−1 ) 100+(21−1)81 21+21 = 3.41
√[ 21+21−2 ][
21× 21 ]
df =n1 +n2−2=21+21−2=40
The calculated value is greater than tabulated. Therefore, the null hypothesis is rejected.
Problem No. 4
A statistics teacher wants to compare his two classes to see if they performed any differently
on tests he gave that semester: Class F has a 25 students with an average score of 70, standard
deviation 15. Class H had 20 students with an average score of 74, standard deviation of 25.
The level of significance is 0.05. Did these classes performed differently the tests?
Answer:
H o: μclass a=μclass b
H a : μclassa ≠ μclassb
Let:
70−74
t=
( 25−1 ) 225+(20−1)625 25+20 = -0.67
√[ 25+20−2 ][25 × 20 ]
df =n1 +n2−2=25+20−2=43
The calculated value is smaller than the tabulated value. Therefore, the null hypothesis is not
rejected.
Problem No. 5
Leo grows tomatoes in two separate fields. When the tomatoes are ready to picked, he is
curious as to whether the sizes of his tomatoes plants differ between the two fields. He takes a
random sample of plants from each field and measures the heights of the plants. Here is a
summary of the results: Use 0.05 as level of significance.
Field A Field B
Mean 1.3m 1.6m
Standard deviation 0.5m 0.3m
Number of plants 22 24
Answer:
H o: μa =μ b
H a : μa ≠ μ b
Let:
1.3−1.6
t=
( 22−1 ) 0.25+ (24−1 ) 09 22+24 = -2.49
√[ 22+24−2 ][ 22× 244 ]
df =n1 +n2−2=22+24−2=44
The calculated value is greater than the tabulated value. Therefore, the null hypothesis is
rejected.
Problem No. 1
The English teacher conduct an vocabulary quiz in the first meeting and last meeting in the
class each year to assess if the students learn something in class hours. In the first meeting the
student scored 142 overall and in the last meeting students scored 173. A 200 item quiz and
having a sample variance of 42. Determine if the student performance improved. The level
of significance is 0.01
Answer:
H o: μ1=μ 2
H a : μ1 ≠ μ2
d 142−173
n 200
t= √n= √200=−0.34
sd 6.481
df =n−1=200−1=199
The calculated value is smaller than the tabular value. Therefore, the null hypothesis is not
rejected.
Problem No. 2
The following are fear ratings administered to five subjects before and after exposure to “fear
of the dark therapy”:
Subject Before After
Shaggy 8 4
Scooby 9 6
Fred 4 3
Velma 2 2
Daphne 5 3
Answer:
: μ1 ≠ μ2
Let:
❑
d=10 n=5 ∑ d 2=¿ ¿29
❑
sd =√ 5 ( 29 )−¿ ¿ ¿0.707
d 10
n 5
t= √n= √ 5=¿6.33
sd 0.707
df =n−1=5−1=4
The calculated value is greater than the tabular value. Therefore, the null hypothesis is
rejected.
Problem No. 3
Suppose a sample of n students was given a diagnostic test before studying a particular
module and then again after completing the module. We want to find out if, in general, our
teaching leads to improvements in students’ knowledge/skills (i.e. test scores). We can use
the results from our sample of students to draw conclusions about the impact of this module
in general.
Answer:
H a : μ1 ≠ μ2
d −25
n 15
t= √n= √15=¿-2.23
sd 2.89
df =n−1=15−1=14
The calculated value is smaller than the tabular value. Therefore, the null hypothesis is not
rejected.
Problem No. 4
We could have conducted the charter school study in a different way—by comparing
teachers’ satisfaction ratings before and after a school was converted to a privately operated
school. This design could be classified as a single-group pretest-posttest design. I have used
the same numbers as in the first between-subjects example given in class to illustrate a point,
but this is completely different example where we have two scores for each of 5 teachers.
Notice that in this design we only are using half the number of cases. Each teacher has two
scores.
Answer:
H o: μ1=μ 2
H a : μ1 ≠ μ2
sd =√ 5 ( 60 )−¿ ¿ ¿
d −16
n 5
t= √n= √ 5=−¿4.83
sd 1.48
df =n−1=5−1=4
The calculated value is greater than the tabular value. Therefore, the null hypothesis is
rejected.
Problem No. 5
A researcher is studying the influence of noise on one’s ability to solve statistics problems.
The researcher randomly selects n = 10 students and exposes them to a noisy condition for 10
minutes and then a quiet condition for 10 minutes. In each condition, students are given a set
of statistics problems to solve. The dependent variable is the number of mistakes made on the
statistics problems during the ten minutes. Here, the researcher is testing a non-directional
hypothesis, because she wants to know if there is any effect of noise on performance (errors).
Answer:
df =n−1=10−1=9
The calculated value is greater than the tabular value. Therefore, the null hypothesis is
rejected.
Problem No. 1
You’re testing two flu drugs A and B. Drug A works on 41 people out of a sample of 195.
Drug B works on 351 people in a sample of 605. Are the two drugs comparable? Use a
5% alpha level.
Answer:
H o: P1=P2
H a : P1 ≠ P2
a= 0.05, two tailed
41 351
−
p 1− p2 195 605
z= = =8.99
p1 q1 p2 q2
√ n1
+
n2
√
41 154
195 195
195
+
351 254
( )( ) ( )( )
605 605
605
The calculated value is greater than the tabulated value. Therefore, the null
hypothesis is rejected.
Problem No. 2
Suppose the Acme Drug Company develops a new drug, designed to prevent colds. The
company states that the drug is equally effective for men and women. To test this claim, they
choose a simple random sample of 100 women and 200 men from a population of 100,000
volunteers. At the end of the study, 38% of the women caught a cold; and 51% of the men
caught a cold. Based on these findings, can we reject the company's claim that the drug is
equally effective for men and women? Use a 0.05 level of significance.
Answer:
H o: P1=P2
H a : P1 ≠ P2
38 102
−
p 1− p2 100 200
z= = =−2.16
p1 q1 p2 q2
√ n1
+
n2
√
38 31
100 50
100
+
102 49
( )( ) ( )( )
200 100
200
The calculated value is greater than the tabulated value. Therefore, the null hypothesis is
rejected.
Problem No. 3
Two types of medication for hives are being tested to determine if there is a difference in the
proportions of adult patient reactions. Twenty out of a random sample of 200 adults given
medication A still had hives 30 minutes after taking the medication. Twelve out of
another random sample of 200 adults given medication B still had hives 30 minutes after
taking the medication. Test at a 1% level of significance.
Answer:
H o: P A =PB
Ha : PA ≠ PB
The calculated value is smaller than the tabulated value. Therefore, the null hypothesis is not
rejected.
Problem No. 4
A research study was conducted about gender differences in “sexting.” The researcher
believed that the proportion of girls involved in “sexting” is less than the proportion of boys
involved. The data collected in the spring of 2010 among a random sample of middle and
high school students in a large school district in the southern United States is summarized in
the table. Is the proportion of girls sending sexts less than the proportion of boys “sexting?”
Test at a 1% level of significance.
Males Females
Sent “sexts” 183 156
Total number surveyed 2231 2169
Answer:
H o: P F= P M
H a: P F < P M
a=0.01, one tailed
183 156
−
p 1− p2 2231 2169
z= = =1.26
p1 q1 p2 q2
√ n1
+
n2
√
183 2048
( 2231
2231
156 671
)( 2231 ) + ( 2169 )( 723 )
2169
The calculated value is smaller than the tabulated value. Therefore, the null hypothesis is not
rejected.
Problem No. 5
Researchers conducted a study of smartphone use among adults. A cell phone company
claimed that iPhone smartphones are more popular with whites (non-Hispanic) than with
African Americans. The results of the survey indicate that of the 232 African American cell
phone owners randomly sampled, 5% have an iPhone. Of the 1,343 white cell phone owners
randomly sampled, 10% own an iPhone. Test at the 5% level of significance. Is the
proportion of white iPhone owners greater than the proportion of African American iPhone
owners?
H o: Pw = P A
H a: Pw > P A
A=0.05, one tailed
11.6 134.3
−
p 1− p2 232 1343
z= = =−3.03
p1 q1 p2 q2
√ n1
+
n2
√
11.6 19
( )( ) (
232 20
232
+
134.3 9
)( )
1343 10
1343
The calculated value is greater than the tabulated value. Therefore, the null hypothesis is
rejected.
MODULE 8
"CORRELATION ANALYSIS"
Problem No. 1
Find the value of the correlation coefficient from the following table below and test the
hypothesis that there is no significant correlation between the age and glucose level at 5%
level of significance.
❑ ❑ ❑
r=
n (∑❑ XY )−(∑❑ X )(∑❑ Y )
√(
❑ ❑ ❑ ❑
❑
)
n ∑ X 2−∑ X 2 (n ∑ Y 2−∑ Y 2)
❑ ❑ ❑
6(20485)−(247)(486)
=
√[ ( 6 ) (11409 )−(11409)][ ( 6 ) ( 40022 )−(40022)]
r = 0.529809
e. df = N – 2 = 6 – 2 = 4
f. Tabular Value = ±0.811
g. The null is accepted since the computed value does not fall in the critical region. It
falls between the critical values.
h. There is no significant linear relationship between the age and glucose level. The
verbal interpretation of r shows that there is moderate correlation.
Problem No. 2
Marls obtained the scores of 5 students in algebra and trigonometry. Calculate the
Pearson correlation coefficient and test the hypothesis that there is no significant correlation
between the scores in algebra and trigonometry level at 5% level of significance.
Algebra 15 16 12 10 8
Trigonometry 18 11 10 20 17
❑ ❑ ❑
r=
n ( ❑
)
∑ XY −(∑ X )(∑ Y )
❑ ❑
√(
❑ ❑ ❑ ❑
❑
)
n ∑ X 2−∑ X 2 (n ∑ Y 2−∑ Y 2)
❑ ❑ ❑
5( 902)−(61)(76)
=
√[ (5 )( 789 ) −(789)][ ( 5 ) (1234 )−(1234)]
r = -0.424
e. df = 5 – 2 = 5 – 2 = 3
f. Tabular Value = ± 0.878
g. The null is accepted since the computed value does not fall in the critical region. It
falls between the critical values.
h. There is no significant linear relationship between the scores in trigonometry and
algebra. The verbal interpretation of r shows that there is moderate correlation.
Problem No. 3
Calculate the Pearson correlation coefficient of the age of husbands and wives below and
test the hypothesis that there is no significant correlation between the ages of husbands and
wives at 5% level of significance.
Husband (X) 36 72 37 36 51 50 47 50 37 41
Wife (Y) 35 67 33 35 50 46 47 42 36 41
❑ ❑ ❑
r=
n ( ❑
)
∑ XY −(∑ X )(∑ Y )
❑ ❑
√(
❑ ❑ ❑ ❑
❑
)
n ∑ X 2−∑ X 2 (n ∑ Y 2−∑ Y 2)
❑ ❑ ❑
10(20737)−(457)( 432)
=
√[ (10 )( 22005 ) −(22005)][ ( 10 ) ( 19594 )−(19594)]
r = 0.973
e. df = 10 – 2 = 8
f. Tabular Value = ± 0.632
g. Reject the null hypothesis because the computed value, 0.973, is greater than the
tabular value, 0.632.
h. There is a significant linear relationship between the ages of the husbands and wives.
The verbal interpretation of r shows that there is a very high correlation.
Problem No. 4.
The statics and differential calculus scores of engineering students of the University of
Eastern Philippines were recorded. Test the hypothesis that there is no significant correlation
between the scores of engineering students in statics and differential calculus at 5% level of
significance.
Statics 10 12 15 14 12 10 9 11 14 10 16
Calculus 32 46 62 60 51 40 38 42 56 30 65
r=
n ( ❑
)
∑ XY −(∑ X )(∑ Y )
❑ ❑
√(
❑ ❑ ❑ ❑
❑
)
n ∑ X 2−∑ X 2 (n ∑ Y 2−∑ Y 2)
❑ ❑ ❑
11(6582)−(133)(522)
=
√[ (11 )( 1663 )−(1663)][ ( 11) ( 26254 ) −(26254)]
r = 0.9589
e. df = 11 – 2 = 9
f. Tabular Value = ± 0.602
g. Reject the null hypothesis because the computed value, 0.9589, is greater than the
tabular value, 0.602.
h. There is a significant linear relationship between the scores of engineering students in
statics and differential calculus. The verbal interpretation of r shows that there is a
very high correlation.
Problem No. 5
Below are the data for six participants giving their number of years in college (X) and
their subsequent monthly income in thousands (Y). Calculate the Pearson correlation
coefficient and test the hypothesis that there is no significant correlation between the
participant’s number of years in college and their subsequent monthly income at 5% level of
significance.
X 0 1 3 4 4 6
Y 15 15 20 25 30 35
❑ ❑ ❑
r=
n (∑❑ XY )−(∑❑ X )(∑❑ Y )
√(
❑ ❑ ❑ ❑
❑
)
n ∑ X 2−∑ X 2 (n ∑ Y 2−∑ Y 2)
❑ ❑ ❑
6(505)−(18)(140)
=
√[ ( 6 ) (78 )−(78)][( 6 )( 3600 ) −(3600)]
r = 0.1924
e. df = 6 – 2 = 4
f. Tabular Value = ± 0.811
g. The null is accepted since the computed value does not fall in the critical region. It
falls between the critical values.
h. There is no significant linear relationship between the participant’s number of years in
college and their subsequent monthly income. The verbal interpretation of r shows
that there is a slight correlation.
Problem No. 1
There are two variables that need to be studied: weight loss and days spent exercising one
month. You are given a data set in which individuals have been asked the number of days
they exercise for more than half an hour in one month. Predict the weight loss in 50 days.
Exercise Weight XY X2 Y2
Days (X) Loss (Y)
1 0 4 0 0 16
2 4 1 4 16 1
3 8 1.5 12 64 2.25
4 12 2 24 144 4
5 16 4 64 256 16
6 20 5 100 400 25
7 24 2 48 576 4
∑ 84 19.5 252 1456 68.25
a = a = 2.30
❑ ❑ ❑
❑ ❑ ❑ ❑ n (∑ XY )−(∑ X )(∑ Y )
(∑ Y )(∑ X )−(∑ X )(∑ XY )
❑ ❑
2
❑ ❑
b= ❑
❑
❑
❑
n ∑ X 2−( ∑ X )
❑
2
❑ ❑
2
n ∑ X 2−( ∑ X ) ❑ ❑
❑ ❑
7(252)−( 84)(19.5)
b=
19.5(1456)−( 84)(252) ( 7 ) ( 1456 )−(84)2
a=
( 7 ) ( 1456 )−(84)2
b = 0.040
y = a + bx
y = 2.30 + 0.040x
y = 4.3
Problem No. 2
Price 10 12 13 12 16 15
Amount Demanded 48 38 43 45 37 43
(X) (Y) XY X2 Y2
1 10 48 480 100 2304
2 12 38 456 144 1444
3 13 43 559 169 1849
4 12 45 540 144 2025
5 16 37 592 256 1369
6 15 43 645 225 1849
∑ 78 254 3272 1038 10840
a = a = 58.58
❑ ❑ ❑
❑ ❑
6(3272)−( 78)(254 )
b=
254(1038)−(78)(3272) ( 6 )( 1038 ) −(78)2
a=
( 6 )( 1038 ) −(78)2
b = -1.25
y = a + bx
y = 58.58 -1.25x
y = 33.58
Problem No. 3
Calculus 85 82 79 85 85 90 85 84 74 81
(X)
Statics 75 76 90 78 92 90 95 85 82 82
(Y)
(X) (Y) XY X2 Y2
1 85 75 6375 7225 5625
2 82 76 6232 6724 5776
3 79 90 7110 6241 8100
4 85 78 6630 7225 6084
5 85 92 7820 7225 8464
6 90 90 8100 8100 8100
7 85 95 8075 7225 9025
8 84 85 7140 7056 7225
9 74 82 6068 5476 6724
10 81 82 6642 6561 6724
∑ 830 845 70192 78383 71847
❑ ❑ ❑ ❑ ❑ ❑ ❑
a=
(∑❑ Y )(∑❑ X 2)−(∑❑ X )(∑❑ XY ) b=
n (∑❑ XY )−(∑❑ X )(∑❑ Y )
❑ ❑ ❑ ❑
2 2 2
n ∑ X −( ∑ X ) n ∑ X 2−(∑ X )
❑ ❑ ❑ ❑
845(78383)−(830)(70192) 10(70192)−(830)(845)
a= b=
( 10 ) ( 78383 )−(830)2 ( 10 ) ( 78383 )−(830)2
a = 84.00 b = 0.006
y = a + bx
y = 84 + 0.006x
y = 84 + 0.006 (98)
y = 84.588
Problem No. 4
a=
( )
❑ ❑
2
∑ Y (∑ X )−(∑ X )(∑ XY )
❑ ❑
b=
n ( ❑
)
∑ XY −(∑ X )( ∑ Y )
❑ ❑
❑ ❑ ❑ ❑
2 2
n ∑ X 2−( ∑ X ) n ∑ X 2−(∑ X )
❑ ❑ ❑ ❑
a = 65.14 b = 0.39
y = a + bx
y = 65.14 + 0.39x
y = 76.84
Problem No. 5
Algebra 15 16 12 10 8
Trigonometry 18 11 10 20 17
a=
(∑ Y )(∑ X )−(∑ X )(∑ XY )
❑ ❑
2
❑ ❑
b=
n (∑ XY )−(∑ X )(∑ Y )
❑ ❑ ❑
❑ ❑ ❑ ❑
2 2
n ∑ X 2−( ∑ X ) n ∑ X 2−(∑ X )
❑ ❑ ❑ ❑
76(789)−(61)(902) 5(902)−(61)(76)
a= b=
( 5 )( 789 )−(61)2 ( 5 )( 789 )−(61)2
a = 22.06 b = -0.56
y = a + bx
y = 22.06 - 0.56x
y = 10.