GE 5 Module 4
GE 5 Module 4
Module
in
GE 5
Mathematics as a Tool
Contents Page
Cover Page i
Title Page ii
Table of Contents iii
Introduction to the User iv
Introduction v
Chapter 4 Mathematics as a Tool 1
Overview 1
Learning Outcomes 1
Pre-test 1
Lesson 1 Sampling Techniques
A. Learning Outcomes 2
B. Time Allotment 1
C. Discussion 1
D. Activities/Exercises 4
E. Evaluation/Post-test 5
Lesson 2 Organization and Presentation of Data
A. Learning Outcomes 5
B. Time Allotment 5
C. Discussion 5
D. Activities/Exercises 9
E. Evaluation/Post-test 11
Lesson 3 Measures of Central Tendency
A. Learning Outcomes 12
B. Time Allotment 12
C. Discussion 12
D. Activities/Exercises 13
E. Evaluation/Post-test 16
Lesson 4 Measures of Dispersion
A. Learning Outcomes 17
B. Time Allotment 18
C. Discussion 18
D. Activities/ Exercises 19
E. Evaluation/Post-test 19
Lesson 5 Hypothesis Testing
A. Learning Outcomes 19
B. Time Allotment 20
C. Discussion 20
D. Activities/Exercises 22
References 22
Privacy Notice 22
Student’s Information 22
first the sample size especially if the population is very large. Sampling is allowed to
save time and money, there are several sampling techniques that can be applied in
selecting sample without being accused as biased which will be discussed in this
chapter.
Data presented in the form of graphs and tables looked more interesting and
organized and readers may appreciate reading statistical information on this. This
Mathematics as a Tool
Overview
Module 4 covers topics on sampling techniques, frequency distribution and
graphs. In this chapter, you will learn more on how to organize and present data
using terms and concepts for us to understand statistics comprehensively. It aims to
develop your skills in organizing, interpret/analysing and presenting data in the form
of graphs and table.
Learning Outcomes
At the end of this chapter, you can:
1. differentiate a representative sample from a biased sample;
2. solve problems using slovin’s formula;
3. identify the appropriate way of collecting data;
4. organize data in a table and graphs;
5. construct the most appropriate graph for sets of data;
6. identify and use measures of central tendency and measures of dispersion
appropriate for a given data;
7. make hypothesis and use hypothesis testing in the given statistical problems; and
8. use appropriate test hypothesis in a given statement of a problem;
PRE-TEST
Instructions: To test your prior knowledge patterns, symmetry, sequence and series. Please
provide a yellow paper as your answer sheet.
Column A Column B
1. The process of obtaining sample a. pictograph
2. method by which every element of a population has an equal b. sampling
chance of being included in a sample. c. line graph
3. sampling technique where elements of a population are d. frequency
drawn based on the judgement of the researcher e. random
4. method of data collection from the entire population sampling
5. it is the number of times the datum occurs f. cumulative
6. listing of data that pairs each entry in the list with the number frequency
of times it occurs g. pie graph
7. the number of scores contained in each class h. histogram
8. texts are used in presenting statistical information i. bar graph
9. it refers to diagram or drawings which show the successive j. graph
values of a changing quantity k. table
10. it is the graph used when characteristics are opposite in nature l. textual
and are to be compared. presentation
11. a graph constructed as a sequence of touching vertical rectangles m. class interval
12. it is the graph used when we wish to show how one value changes n. complete
to another value over successive periods of time enumeration
In this lesson, you are given learning outcomes to achieve at the end of the
lesson with time allotment for you to learn time management in learning the topics
on sampling techniques. There is an activity that will assess your learning including
the post test on performance task that is guided with rubrics for scoring.
Learning Outcomes:
Time Allotment: The minimum allotted hours for you to work on this module is 3
hrs. Make sure that you have learned the process, completed the assigned task
and achieved expected learning outcomes in this module.
DISCUSSION
There are several sampling techniques you can apply in selecting samples without
being accused as biased. Generally, we have five (5) methods to choose these are
random sampling, systematic sampling, purposive sampling, quota sampling and
convenience sampling. Let us study each and learn more how to use it.
3
Example: Suppose a researcher wants to determine the average income of the
families in a barangay 5, 000 families, distributed in five purok’s.
Computing for the sample size n at a 5% margin of error
Solution: 𝑛 = = 370
b. Cluster sampling
When the geographical area where the study will be done is too big and the
target population is too large, the cluster sampling technique may be
appropriate. In this technique, the selection of sample units is not by
individual but by groups called cluster. The area will be divided into
clusters, then a desired number of clusters will be selected at random.
4
1.1.5 Convenience sampling
This technique is resorted to by researchers who need the information the
fastest was possible. This method may be fast but it is also biased because
there is a possibility that some respondents may not be included in the study.
Census or complete enumeration is a method of data collection from
the entire population.
Observation is appropriate when data can be collected without any response
from people.
Experiment is used to find out cause and effect relationship
Survey is used when we can obtain data through questionnaires which may be
distributed by hand, face-to-face or telephone interviews.
ACTIVITY 1
Directions: Complete the table using the stratified sampling technique.
Compute first for the sample size using Slovin’s formula at a 5% margin
of error for the hotel listed in the table below: (15 points)
ACTIVITY 2
Directions: Solve the following problems and write your complete solutions in
yellow paper as your answer sheet. (10 points)
1. Compute for the sample size required for each of the following
population:
a. N = 5, 263, e = 3%
b. N = 8, 000, e = 5%
2. Compute for the margin of error to be used if 800 sample units are
required from a population of 2, 400.
3. A researcher plans to get 588 sample units from a population N using a
4% margin of error. What is the value of the population?
4. A biology researcher wants to assign male and female students to an
experimental group and a control group. If he is to choose from 600
students and plans to assume a 5% margin of error, what sample size
would be appropriate?
Directions: Identify the appropriate sampling technique for the following and state how the
data will be gathered. (10 points)
1. Prof. Gonzalez selects 10 students who would represent the class in a seminar.
2. Mrs. Biason obtained the total number of students through the school registrar and was
requested by the administration to conduct a survey about the agreement or disagreement of
students regarding the reduced days of classes.
3. A researcher wants to make a study on coping mechanism of solo parents in the business
industry.
4. You consider your six year old sister as respondent which you will be observing and
interviewing in an ample time during the conduct of your research.
5. If a researcher wants 2000 respondents and he needs to do the conduct of the study within
a short timetable.
Learning Outcomes:
Time Allotment: The minimum allotted hours for you to work on this module is 6
hrs. Make sure that you have learned the process, completed the assigned task
and achieved expected learning outcomes in this module.
DISCUSSION
When texts are used in presenting statistical information data are presented in
phrase, sentence or paragraph form. However, textual presentation would not be of much
use as basis for analysis and interpretation.
Example:
There are 353 respondents were interviewed in five (5) purok in the barangay. 95 families were chosen
in purok 1, 46 from purok 2, 60 from purok 3, 71 from purok 4 and 81 from purok 5 for a total of 353
families. As to how they are selected, lottery method was applied.
The data types are frequency distribution, correlated data and time series data. Data that are
presented in a frequency distribution are called grouped data which has more than 30
numbers of observations.
A frequency distribution table is a device for organising and presenting data and frequency
is the number of times datum occurs. The arrangement by rows and column helps you to
make comparisons of the data more easily. It is a listing of data that pairs each entry in the
list with the number of times it occurs.
1. Find the range R using the formula: R = Higher score – Lowest Score
2. Compute for the number of intervals n by using the formula n = 1 + 3.3log N
where n = number of class intervals,
N = population or total number of observations
3. Compute for the class size i using the formula i = R/n
4. Using the lowest score as lower limit, add (i – 1) to it to obtain the
highest limit of the desired class interval
5. The lower limit of the second interval may be obtained by adding the
class size to the first interval. Add (i – 1) to the result to obtain the
higher limit of the second interval.
6. Repeat step 5 to obtain the third class interval and so on ad so forth.
7. When n class intervals are completed, determine the frequency for each
class interval by counting the elements.
25 28 27 30 32 25 31 26 29 6 31 20 21 32 18 50 53 60
50 54 45 40 37 25 20 27 32 24 29 30 25 24 10 12 15 28
Solution: 1. R = 60 – 6 = 54
2. n = 1+3.3 log 36 = 1+3.3(1.556) = 6.1 = 6 lowest score/lower limit
3. i = =9
4. 6 + (9-1) = 6 + 8 = 14 higher limit, so 6-14 is the first interval
Class Mark is the midpoint of a class interval. To obtain this point, add the lower limit and
the upper limit, then divide the sum by two.
Class Boundaries it is also known as exact limits¸ this can be obtained by subtracting 0.5
from the lower limit of an interval and adding 0.5 to the upper limit of the interval.
Relative frequency is also known as the percentage frequency. To compute, divide the
frequency of each class interval by the total number of observations N, then multiply the
results by 100.
Correlated Data is a table where two or more frequency are shown for easy comparison.
Example:
Time Series Data is a table where the data shows the change of a variable over a period of time .
Example:
Population of Region A, 2010-2020
Year Population
1 100 000
2 150 000
3 180 100
4 200 500
5 250 505
6 260 000
7 261 000
8 300 000
9 340 000
Graphs refer to diagram or drawings which show the successive values of a changing quantity.
Kinds of Graphs
1. Bar Graph
Rectangular figures or bars are used to show variations in the frequencies of
observations and it is usually presented to compare data or to determine
which class or interval is common or appear frequently in the text.
2. Line Graph
This graph is presented using a continuous line to show how one value
changes with respect with another one over successive period of time.
3. Pie Graph
One of the kinds of graphs that can be applied for grouped data is the
frequency histogram, it is represented by points in the vertical axis and the
class interval in the horizontal axis. The histogram is constructed as a
sequence of touching vertical rectangles which is drawn with a height
corresponding to the frequency.
5. Picto graph
In this graph the frequency of each item is represented by pictures or
graphics. It looks more interesting and attractive than frequency
distribution.
7. Frequency Polygon
Unlike in the frequency histogram where rectangles drawn side by side are
used, points connected by line segments are utilized in the frequency
polygon. It looks like line graph except for the labels in the horizontal axis
which are class intervals.
Criteria Points
Heading or title of the graph 2
Legends/label/ name of the content of each graph 3
Correctness of form or the graph 10
Textual Presentation 10
TOTAL 25
POST TEST
This test will measure your mastery in Lesson 2 covered in this module.
Directions: Copy the given below, complete the frequency distribution table
by writing the values for each column (15 points)
Learning Outcomes:
Time Allotment: The minimum allotted hours for you to work on this module is 3
hrs. Make sure that you have learned the process, completed the assigned task
and achieved expected learning outcomes in this module.
DISCUSSION
Four students are vying for scholarship. The College has agreed to give the
scholarship to the top student last semester. The table below gives the grades of the
four applicants in the five core courses.
A. Determine the top student by taking the arithmetic mean grade for each
student. (10 points)
B. Avel says that ranking must be used. As he top in three subjects, then he must
be the top student. For each subject, rank the student 1,2,3 and 4 and then add
up the ranks. Determine if Avel is the student with the lowest (best) ranking.
Use the table below. (20 points)
a. Weighted Mean
Example:
where f is the frequency of each data and xm is the class mark for ech
interval i. n is the number of the class intervals.
c. Midrange is the value midway between the endpoints of the set of data. It
is found by adding the least value in the set of data to the greatest value
and dividing the sum by 2.
Example:
The frequency distribution of the results of the short quiz in GE 5 Class is shown
below:
Score 10 11 12 13 14 15 16 17 19 21
Frequency 2 8 6 5 1 2 7 2 1 1
Midrange = L + G = 10+21 = 31 = 15.5
2 2 2
Example:
Median = xmd + ( )
Where xmd is the lower class boundary of the median class, <cfb less than
cumulative frequency below the median class, i is the class interval and n
is the total number of observations.
e. Mode
Example:
The grouped frequency distribution of scores of students in a math examination is
shown below. x f Lower class
45-49 2 44.5
40-44 7 39.5
35-39 11 34.5
30-34 13 = fmo 29.5 = xmo
25-29 12 24.5
20-24 8 19.5
15-19 2 14.5
i=5 n = 55
Mode = xmo + ( )
Where xmo is the lower boundary of the modal class, is the difference
between the frequency of the modal class and the frequency above it, is
the difference between the frequency of the modal class and the frequency
just below it, i is the class interval and n is the total number of
observations.
16
The table below shows the measures of central tendency that are used
using a specific level of measurement and their advantage and
disadvantages.
ACTIVITY 2:
Find the mean, median and mode of the height of the Covid 19 patients in
Hospital Z. (30 points)
POST TEST:
This test will measure your mastery in Lesson 3 covered in this module.
Directions: Read the following carefully. Do what is indicated.
17
B. Consider the bar graph below and answer the following. (5 points)
CAF… Population
CFAS
CCJE
CET
Ced
CBM
CAS
0 500 1000 1500 2000 2500
Learning Outcomes:
Time Allotment: The minimum allotted hours for you to work on this module is 3
DISCUSSION
The terms variability, spread and dispersion are synonyms. Measures of dispersion tell
us how much the data tend to disperse or scatter.
4.1.1 Range
The range for a set of data is found by subtracting the smallest value from the largest
value in the given set of data. R = HS - LS
Example: 18
The grades of the ten students are show below. The respective range for male and for female
students is also shown in the table. Note that the man is 85.
Female 82 83 85 87 88 Range= HS-LS = 88-82 = 6
Male 70 82 86 90 97 Range= HS-LS = 97-70 = 27
Answer this!
a. If the range is zero, what can you say about a set of data?
b. What can you say about the data with respect to the mean, if the measure of
variability is small?
c. If the measure of variability is large, what can you say about the data with respect to
the mean?
4.1.2 Variance
The two commonly uses measures of variation that consider all scores in a given set
of data are variance and standard deviation.
The measures of dispersion that eliminates negative signs by squaring all deviations
of each number from the mean and getting the average squared deviation is the variance.
∑
= variance in a populationI, where m is the population mean and n is the
number of cases
∑
= variance in a sample, where x is the sample mean and n-1 is the
number of cases which gives an unbiased estimate of variance
Example:
The grades of the ten students are show below. The respective range for male and for female
students is also shown in the table. Note that the man is 85.
Male 70 82 86 90 97
Note: Calculating variance is an important part of many statistical applications and analyses.
It is the first step in calculating standard deviation.
The standard deviation formula is obtained by simply getting the square root of the
variance. The formula for the standard deviation in a population is given by the parameter
∑
= √ where m is the population mean and n is the number of cases
The formula for the standard deviation in a sample is given by the parameter 19
∑
= √ variance in a sample, where x is the sample mean and n-1 is the
Example:
The grades of the ten students are show below. The respective range for male and for female
students is also shown in the table. Note that the man is 85.
Female 82 83 85 87 88
ACTIVITY 1
Give the mean, range, variance and standard deviation of scores of Raynold in eight
basketball games; 16, 18, 14, 19, 17, 20, 22, 18. (10 points)
POST TEST
Give the mean, range, variance and standard deviation of the following.(15 pts.)
1) 12, 13, 15, 15, 16, 18, 21
2) 200 mg, 200 mg, 210 mg, 210 mg, 220 mg, 220 mg
3) Machine A and B produce balls. The standard deviations of the weights of the
identical balls produced by machine A and B are 1 mg and 2 mg, respectively.
Which machine produces balls with consistent weight? Explain your answer.
Learning Outcomes
At the end of the chapter, you can:
1. show how inferential statistics is used to reach conclusions from the analysis
of the obtained data;
2. determine the most appropriate statistical tool and procedure in testing data;
3. testing hypothesis concerning one population mean; and
4. testing hypothesis concerning two population means .
20
Time Allotment:
The minimum allotted hours for you to work on this module is 3 hrs. Make sure that you
have learned the process, completed the assigned task and achieved expected learning
outcomes in this module.
DISCUSSION
Test statistics – a sample statistics computed from the data. The value of the test
statistics is used in determining whether or not the null hypothesis is rejected
Critical or rejection region – a range of test statistics values for which the null
hypothesis should be rejected. This range of values will indicate that there is a
significant or large enough difference between the hypothesized parameter value and
the corresponding point estimate for the parameter.
Critical value – first value in the critical region. The set of values that are not in the
critical region is called the region of acceptance (noncritical or non-rejection).
Illustrative examples
1. New software is being integrated into the teaching of a course with the hope that it will
help to improve the overall average score for this course. The historical average score
for this course is 70.
Ho : the overall average score for this course is 70. (μ = 70)
Ha : The overall average score for this course is higher than 70. (μ > 70)
2. A real estate agent claims that the average price for socialized houses in a certain
subdivision is P 1.8 M. you believe that the average price is lower. You plan to test his
claim by taking a random sample of the prices of the homes in the subdivision;
formulate the set of hypotheses.
Ho: The average price for socialized houses in the subdivision is P 1.8 M.
(μ = P 1.8 M )
Ha : the average price for socialized houses in the subdivision is lower than P 1.8 M .
(μ < P 1.8 M )
3. An advertisement on the TV claims that a certain brand of tire has an average lifetime
of 50,000 miles. Suppose you plan to test this claim by taking a sample of tires and
putting them on test. What is the correct set of hypotheses to set up?
Ho : the average lifetime of this brand of tire is 50,000 miles.
(μ = 50,000 miles )
Ha : The average life time of this brandWPU-QSF-ACAD-82A
of tire is not 50,000 miles.
Rev. 00 (09.15.20)
(μ ≠ 50,000 miles )
Type 1 and Type 11 Errors
Notes:
A type 1 error will be committed when the true null hypothesis is rejected.
A type 11 error is committed when a false null hypothesis is not rejected.
A. One –tailed Test: a test of any statistical hypothesis where the alternative
hypothesis is one- sided. This is also known as Directional Alternative
Hypothesis. It could be left-tailed or right-tailed. ( Ha : > or < ).
B. Two –tailed Test: A test of any statistical hypothesis where the alternative
hypothesis is two-sided. Non-directional Alternative Hypothesis is concerned
with the two sides of the distribution. ( Ha : ≠ )
ACTIVITY 1
Directions: State the appropriate null hypothesis (Ho ) and alternative hypothesis (Ha ) for
each of the following. ( use another paper for your answers )
5.2 t-TEST 23
= √𝒏
where
t= t –value
x= sample mean
µ= population mean
s= sample standard deviation
n= number of cases less than 30
ACTIVITY 1:
PROBLEM #1
1. H0:_____________________________________________________________________________
_______________________________________________________________________________
4. Computation:
5. df =
6. t- value (table for t- value) tv = __________
24
7. Decision: ____________________________________________
_____________________________________________________
_____________________________________________________
_______________________________________________
8. Conclusion: __________________________________________
_____________________________________________________
_____________________________________________________
_______________________________________________
___________________________________________________
2. The average length of time for people to vote using the old procedure during a
presidential election period in precinct A is 55 minutes. Using
computerization as a new election method, a random sample of 20 registrants
was used and found to have a mean length of voting time of 30 minutes with a
standard deviation of 1.5 minutes. Test the significance of the difference
between the population mean and the sample mean.
PROBLEM #2
1. H0:_____________________________________________________________________________
_______________________________________________________________________________
H1:_____________________________________________________________________________
________________________________________________________________________________
2. = _____
3. _________________________________________
4. Computation:
Reference
Marcelino, D.B. & Ocampo, S.R.,2011, “Contemporary Mathematics”
Lorimar Publishing, Inc.
Parreño, E.B. & Jimenez, R.O.,2006, “Basic Statistics: A Worktext” C
& E Publishing, Inc.
Freund, J.E., 2003,“Modern Elementary Statistics” (10th edition),
Pearson Education, Inc.
Walpole, R.E., reprinted 2002,“Introduction to Statistics” (3rd
edition), Pearson Education, Inc.
Ferguson, G.A. & Takane,Y., 1989, “Statistical Analysis in Psychology
and Education” (6th edition), McGraw Hill Book Co.
Privacy Notice
For this module, we collect your names, year and section, contact number, facebook
account and messanger account when you register for purposes of coordination. All
personal information collected will be stored in a secure location and only authorized
staff will have access to them.
Student’s Information
Name:
Program:
Year and Section:
Contact No.:
E-mail address:
Facebook Account:
WPU-QSF-ACAD-82A Rev. 00 (09.15.20)
Messenger Account:
Vision 2020
WPU: the leading knowledge center for sustainable
development of West Philippines and beyond.
Mission
WPU commits to develop quality human resource and green
technologies for a dynamic economy and sustainable
development through relevant instruction,
research and extension services.
Creativity