Teaching Learning and Applying Geometry
Teaching Learning and Applying Geometry
Teaching Learning and Applying Geometry
HANDLING DATA
Written by:
1
PBMa121: LEARNING, TEACHING AND APPLYING GEOMETRY AND HANDLING
DATA
COURSE DESCRIPTION
Geometry is a critical component of mathematics education because student teachers are required
to relate concepts from geometry to geometric phenomena. It provides the necessary mathematical
tools for complex reasoning and solving problems in the sciences, technology, engineering, and
many skilled trades and professions.
Handling Data also provides tools for describing variability in data and for making informed
decisions. This course is designed to develop and consolidate the basic mathematical knowledge
and skills in the domain of Geometry and Handling Data taking into account uses of mathematics
in different local contexts as well as exploring learners’ misconceptions and difficulties in these
domains. Student teachers will be required to demonstrate good understanding of all the areas
covered by the senior high school core mathematics, especially areas where the chief examiners’
reports have highlighted as difficult. There is the need to do auditing of subject knowledge to
establish and address student teachers’ learning needs, perceptions and misconceptions in
Geometry and Handling Data.
These areas include, but not limited to, bearing – representing the given information on a correct
diagram; circle geometry and its applications; mensuration of plane and three dimensional shapes;
drawing required diagrams correctly; geometrical construction; geometry and basic trigonometry
with applications; representation of information in diagrams; congruence and similarities; finding
angles and distances; global mathematics, introductory statistics and probability; cumulative
frequency curve; drawing and reading from graphs; reading and answering questions from graphs;
probability: meaning and application in real-life situations.
The student teacher will also be required to demonstrate the ability to identify how their own
individual characteristics (culture, ethnicity, religion, family constellation, socio-economic
background, dis/ability, etc.). Differentiated approach to teaching will be used to ensure that
student teachers will be supported in the area of Geometry and Handling Data. The course will
focus on mathematical content on one hand and the strategies and learning experiences in doing
mathematics on the other hand. These will be combined to form an integrated instructional
approach that addresses the course learning outcomes. The instructional strategies will pay
attention to all learners, especially girls and students with Special Education Needs. The course
will be assessed using a variety of assessments methods including coursework (assignments,
quizzes, project works, and portfolio entries with presentation) and end of semester examination
to provide a comprehensive outlook of student teachers’ competencies and skills. References are
made to the following (NTS, 2b, 2f, 3j, 3m) and (NTECF p.30, p.39).
2
COURSE LEARNING OUTCOMES (CLOS) WITH INDICATORS
Outcomes Indicators
On successful completion of the course, 1.1 Select and use the most appropriate mathematical method(s)
student-teachers will be able to: or heuristics in carrying out tasks/exercises/problems in
1. Demonstrate deep understanding of Geometry and Handling data within the basic education
key mathematical concepts in mathematics foundation list.
Geometry and Handling Data 1.2 Make connections between mathematical concepts in
content domains in the basic school Geometry and Handling Data content domains and applying
mathematics curriculum them to solve real-life problems.
(professional values, knowledge & 1.3 Identify and resolve mathematics related learning
practice) (NTS, 2b) difficulties within Geometry and Handling Data content
domains such as inability to visualise geometrical shapes.
2. Use manipulatives and other TLMs 2.1 Use manipulatives and other TLMs in developing Geometry
including ICT in a variety of ways in and Handling data concepts.
learning mathematics concepts in 2.2 Use ICT as a tool in developing Geometry and Handling data
Geometry and Handling data concepts. E.g. Geometer Sketchpad, Geogebra
(practical skills, digital literacy, 2.3 Use drawing tools to conduct geometrical investigations
problem solving) (NTS, 3j); emphasising visualization, pattern recognitions and
conjecturing.
2.4 Solve mathematics problems using manipulatives and/or
technology related
strategies in a variety of ways.
3. Demonstrate value as well as 3.1 Both tutors and student-teachers do individual reflection on
respect equity and inclusivity as their knowledge of Geometry and Handling Data.
well as core skills in the 3.2 Identify and reflect on core skills applied in the mathematics
mathematics classroom classroom
(knowledge)(NTS, 2f) 3.3 Appreciate the contributions of, and supports, colleagues in
the mathematics classroom.
3.4 Cooperate with colleagues in carrying out mathematics tasks
in Geometry and Handling Data.
3.5 Engage in reflective thinking about how mathematics was
taught in student-basic and high school days.
4. Demonstrate awareness of socio- 4.1 Reflect and show how student-teachers’ mathematics
cultural issues in teaching and history influences their views of mathematics and its
learning mathematics in the content learning.
domains of Geometry and Handling 4.2 Identify appropriate TLMs for teaching topics in Geometry
data (knowledge) (NTS, 2f). and Handling data
4.3 Identify and use manipulates in Geometry and Handling data
lessons
3
COURSE CONTENT
4 Vectors and Algebra of vectors, vector Using worksheets on bearing to explore the
Bearing: representation notation components relationship between angles in bearing and
back bearing
4
Learning, of vector, vector operations, Discussing translation of word problems
teaching and magnitude and direction of a vector, into mathematical statements in vectors and
applying Teaching types of bearings and their bearing
applications
Mathematical vocabulary related to
vectors and bearing
5 Mensuration: Teaching parts of a circle. Project work – individual/group
Learning, Teaching measurement of length presentations on the application of circle
teaching and (arc length, radius, diameter, chord) concepts in real life situation
applying Teaching area of a sector, area of
segment, volume of cone, cylinder.
Application of mensuration in real
life problems
6 Global The earth as a sphere, lengths on Tutor led presentations on lengths on a
Mathematics: latitudes and longitudes. sphere
Learning, Using worksheets for practical investigation
teaching and to distinguish between latitudes and
applying longitudes
7 Introductory Teaching collection of data, Project work – individual/group
Statistics measures of central tendencies, presentations on data collection
(Patterns in measures of dispersion, graphical Discussion on establishing the relationship
data): representation (cumulative between the measures of central tendencies
Learning, frequency) and measures of dispersion
teaching and
applying
8 Basic Teaching basic concepts of Interactive collaborative group work on
probability: probability: sample space, events, probability.
Learning, mutually exclusive and independent Exploring the concept of probability through
teaching and events. experiments.
applying Applications to real life situation. Different ways of presenting probability
through games in mathematics lessons.
5
COURSE ASSESSMENT (EDUCATIVE ASSESSMENT: OF, FOR AND AS LEARNING)
COMPONENT 1: Examination
Summary of Assessment Method:
Students should be summatively assessed by an examination linked to the themes listed below:
• knowledge, understanding and applications of the key mathematical concepts in Geometry and
Handling Data within the basic school mathematics curriculum.
• use manipulatives and other TLMs including ICT in a variety of ways to establish Geometry and
Handling Data concepts in the classroom
• how their mathematics history influences their views of mathematics in the realm of social context
and how this affects their way of learning mathematics.
• relevant professional values and attitudes for teaching mathematics at basic school level
Weighting: 40%
Assesses Learning Outcome(s): CLO 1, 2, 3, 4; (NTS 2b, 2f, 3j)
Component 2: Coursework 1
Summary of Assessment Method:
Individual Assignments with Presentations: Student teachers may be asked to
• use ICT tools to conduct geometrical and statistical investigations emphasizing visualization, pattern
recognitions, conjecturing etc. in a variety of ways.
• select the most appropriate mathematical method(s) or heuristics (i.e. using mental strategies, models,
paper and pencil, etc.) in carrying out tasks / exercises / problems in Geometry and Handling Data in
the basic school mathematics curriculum.
• reflect on how Geometry and Handling Data were taught in their basic school days and compare with
current practice in basic schools.
• reflect on the core skills and competencies (e.g. communication and collaboration, critical thinking
and problem solving, digital literacy) teachers need to develop to make them good teachers.
• engage in peer assessment on awareness of core skills and competencies needed to enhance own
strengths and address limitations regarding the teaching and learning of Geometry and Handling Data.
Weighting: 40%
Assesses Learning Outcome(s): CLO 1- 4 (NTS 2b, 3j)
Component 3: Coursework 2
Summary of Assessment Method:
Self-Assessment (as part of their portfolio): Student-teachers should be given an assessment tool or
questionnaire at the onset and the end of the course to
• do self-assessment and compare their attitude towards learners, mathematics teaching and readiness
to support learners who have misconceptions or struggle with the subject.
6
• do self-assessment and compare their value as well as respect for equity and inclusivity in the
mathematics classroom.
• reflect critically on their own learning experiences and use them to plan for their own continuous
personal development.
• identify and reflect on mathematics related learning difficulties within the context of Geometry and
Handling Data.
Weighting: 20%
Assesses Learning Outcome(s): CLO 3, 4 (NTS 1a, 2f)
7
UNIT ONE
PLANE GEOMETRY (PATTERNS IN SHAPE): LEARNING, TEACHING AND
APPLYING
Introduction
You are welcome to the first unit of this course in Learning, Teaching and Applying Geometry.
This unit looks at fundamental concepts in Geometry of which angles, triangles, quadrilaterals,
polygons and tessellations are introduced.
You will agree with me that in a fundamental sense, geometry is a natural outgrowth of our
exposure to the physical universe and in particular to the natural world. In your interaction with
your environment, you encounter physical shapes, such as buildings and other objects. These you
can organize by Patterns into groups and classes. You can put the sun and moon into ‘’round’’
category and building into another separate category. The concepts of point, lines, planes, rays,
lines and line segment play an important role in geometry. They provide clarity, and consistency
in categorizing geometrical shapes. Have you read about them? Do not get worried if you have not
done so. This unit will introduce you to these concepts. It is our hope that you will have no
difficulties in understanding the contents of the unit.
In this Unit, we will look at Learning, Teaching and Applying Geometry made up of the following
Sections:
Section 1.1 Angles at a point and angles and parallel lines
Section 1.2 Angles and triangles and their properties
Section 1.3 Properties of quadrilaterals and other polygons
Section 1.4 Learning about 3- Dimensional shapes: comparing polyhedral, forming 3-dimensional
shapes
Section 1.5 Learning about 2-dimensional shapes.
Section 1.6 Tessellations and applying these to the teaching of the JHS Mathematics curriculum
It is my hope that after studying this unit you will:
8
• Use manipulatives and other TLMs and models in a variety of ways in developing and
learning geometrical concepts.
• Develop an appreciation for geometry as a means of describing the physical world
• Demonstrate awareness of the world outside the classroom as a rich source of geometrical
ideas
I think this unit will equip you with enough knowledge that can help you improve your teaching
of geometry in school mathematics.
Let us move on to section 1 of this unit.
9
SECTION ONE: ANGLES AT A POINT AND ANGLES AND PARALLEL LINES
Introduction: This section aims to present to you Angles at a point and angles and parallel lines
Objectives: After completing this section 1 you will
• Interpret and analyse definitions of given plane geometry concepts that appear in the basic
school mathematics curriculum.
• Identify, select (or design) and use manipulatives and other TLMs and models to develop
geometrical concepts, as well as, suggest such materials can be used to teach given
concepts in the basic school curriculum
• Outline the usefulness of geometry and justify why it should be taught both orally and in
writing
What fundamental concepts related to points, lines and angles and their properties do you know?
What riddles do you know about points, lines, line segments, rays, and angles?
10
first, and another point on a ray.
Plane: a plane is a collection of points lying on a flat surface, which extends indefinitely in all
directions.
K
B B
A
m D
Parallel lines are equidistant (the same distance) from each other and never intersect. Right angles
are formed when two lines intersect and are perpendicular to each other. The symbol for a right
angle is a small square at the vertex of the angle (the endpoint shared by the two segments that
form the angle).
When segments, rays or lines meet or intersect, angles are created. Angles whose measure is
between 00 and 900 are called acute angles. A 900 angle is called a right angle. If the measure of
an angle is between 900 and 1800, the angle is an Obtuse angle. An angle whose measure is exactly
1800 is called a straight angle.
Straight angnles
Right angles
Acute angles
Obtuse angle
11
A protractor is used to measure angles. There are two rows of numbers on a protractor. Place the
crosshairs of the protractor on the vertex of the angle so that the horizontal line of the crosshairs
lies on one of the rays of the angle. The other ray of the angle will fall on one of the angle. The
other ray of the angle will fall on one of the numbers of the protractor. If the ray lands on the
numbers 700 and 1100, the measure of the angles is 700 if the angle is acute or 1100 if it is obtuse.
The angle in the next diagram Measures 1100 because it is an obtuse angle.
Use your protractor to verify these measurements. The rays that form angles continue indefinitely.
Extend the rays if you need more length to measure an angle. If the sum of two angles is 900, the
angles are complementary angles. Think of complementary and corner. The angles form a right
angle like the inside corner of a room. The complement of a 67° angle is a 23° angle (67°+ 23° =
90°). The complement of a 12° angle is a 78° angle (12° + 78° = 90°). If the sum of two angles is
180°, the angle are supplementary angles. Think of supplementary and straight. The angles form
a straight angle that appears as a straight line. The supplementary of a 146° angle is a 34° angle
(146° + 34° = 180°). The supplement of a 89° angle is a 91° angle (89° + 91° = 180°). Two
intersecting lines form four angles. Vertical angles are equal in measure. Angles that have the sane
measure are congruent. In the diagram below, angle a and c are vertical angles, as are angles b and
d. Any two of these angles are either congruent or supplementary. Suppose the measure of angle
a is 40°. That makes angle b its supplement with a measure of 140°. Angles b and c are
supplementary. If angle b measures 140°, then angle c measures 40°. You can continue this
discussion for angles c and d and for angles d and a. In this case, the pair of vertical angles a and
c are 40°, and angles b and d are each 140°.
b
c
a
d 12
b
When two parallel lines are cut by a transversal (a line that cuts across parallel lines), eight
(separate) angles are formed. Any two of these angles are either supplementary or congruent.
1 2
k
3 4
5 6
m
7 8
r
Imagine that you can place line m on k. then the pairs of angles 1 and 5, 2 and 6, 3 and 7, and 4
and 8 are called corresponding angles and are congruent. Angles 3 and 6, and angles 4 and 5, are
called alternate interior angles. They are on opposite sides of the transversal and are between the
two parallel lines. These are congruent. Angles 2 and 7, and angles 1 and 8, are called alternate
exterior angles. And they also are congruent. These angles are on opposite sides of the transversal
but lie outside the parallel lines. If the measure of angle 3 is 30°, you can find the measure of angle
8. You know that angles 3 and 7 are corresponding angles and that angles 7 and 8 are
supplementary. Thus angle 8 measures 150° 𝑡ℎ𝑎𝑡 𝑖𝑠 (180° − 30° = 150°).
Activities.
Activities
1. Find the angles marked a, b, and c in the diagram below.
c
a
100
………………………………………………………………………………………………………
13
………………………………………………………………………………………………………
………………………………………………………………………………………………………
………………………………………………………………………………………………………
………………………………………………………………………………………………………
………………………………………………………………………………………………………
2. Find the sizes of the angles marked a, b, c and d in the diagram below.
b d
70
°
………………………………………………………………………………………………………
………………………………………………………………………………………………………
………………………………………………………………………………………………………
………………………………………………………………………………………………………
………………………………………………………………………………………………………
Summary
Geometry is built upon a series of undefined terms. These terms are those which we accept as
known in order to define other terms.
Point: Although we represent points on paper with small dots, a point has no size, thickness, or
width.
Line: A line is a series of adjacent points which extends indefinitely. A line can be either curved
or straight; however unless otherwise stated, the term ‘’line’’ refers to a straight line.
Plane: A plane is a collection of points lying on flat surface, which extends indefinitely in all
directions.
14
The ceiling and floor of a room suggest parallel planes while the ceiling and side wall of a room
suggest perpendicular plane.
Ray: A ray is a series of points that lie to one side of a single endpoint.
Angle: An angle is a degree measure of a turn.
Vertical angles: Vertical angles are formed when two lines intersect. These angles are equal
Adjacent angles: Adjacent angle are two angles with a common vertex and a common side, but
no common interior points.
Right angle: A right angle is an angle whose measure is 90°
An acute angle: An acute angle is an angle whose measure is larger than 0°, but less than 90°
An obtuse angle: An obtuse angle is an angle whose measure is larger than 90° but less than 180°
A straight angle: A straight angle is an angle whose measure is 180°. Such an angle is, in fact, a
straight line.
A reflex angle: A reflex angle is an angle whose measure is greater than 180° but less than 360°.
Complimentary angles: Complimentary angles are two angles whose measures total 90°
Supplementary angle: Supplementary angle are two whose measures total 180°
Congruent angles: Congruent angles of equal measure.
Classifying angles according to their relative measures through exploration and mathematical
discourse.
15
SECTION TWO: ANGLES AND TRIANGLES AND THEIR PROPERTIES
Introduction
Dear learners, you are warmly welcome to unit 1 of section 2. In this section, we would discuss
angles and triangles and their properties. We advise that you investigate these properties with your
pupils in class as a prospective teachers.
Indicators: after going through this section you will be able to:
• Interpret and analyse types and properties of angles and trinagles of given plane geometry
concepts that appear in the basic school mathematics curriculum.
• Identify, select (or design) and use manipulatives and other TLMs and models to develop
geometrical concepts, based on angles, triangles and their properties.
• Outline the usefulness of geometry and justify why it should be taught both orally and in
writing
Exploring different ways of classifying triangles and their properties. (Paper folding, designs in
fabrics, etc)
TRIANGLES
Suppose we are given the lengths of three lines segments. Can we build a triangle with these
segments? This introduces us to an interesting property of triangles which states that the sum of
the lengths of any two sides is always greater than the length of the third side. You can have a
triangle whose sides are 4, 5, and 7 because (4+5) > 7, (4+7) > 5, and (5+ 7) > 4. You cannot have
a triangle whose sides are 3, 4, and 10 because (3+ 4) < 10. Take thin strips of paper that are 3
inches, 4 inches, and 10 inches long. Then try to arrange the strips to create a triangle. It can’t be
done!
16
Tell whether it is possible to form a triangle with sides of the given lengths.
1 1 3
3. 33, 19, 14 4. 122, 282, 154
All triangles have three angles and three sides. An equilateral triangle has three congruent sides
and three angles equal in measure. An isosceles triangle has two congruent sides and two congruent
angles. A scalene triangle has three sides of different lengths. One way to classify triangles is by
how many congruent sides they have. Scalene triangles have no congruent sides; isosceles
triangles have at least two congruent sides; and equilateral triangles have three congruent sides.
A second way to classify triangles is by their largest angle. All triangles have at least two acute
angles. A right triangle has one right angle; an obtuse triangle has one obtuse angle; and all the
angles are acute in an acute triangle. By combining the terms acute, obtuse, and right with
equilateral, isosceles, and scalene, we can describe any triangle. Therefore we can classify
triangles according to both sides and angles. Sketch an example of each of the following, or write
‘’impossible’’ if appropriate.
ACTIVITIES
1. Explain why it is impossible to have a triangle whose three sides are 6cm, 8cm, and 15cm.
2. Find the measure of the third angle of the triangle is the first two angles contain
a) 60°, 40° b) 100°, 20° c) 50°, 70°
3. Show whether the three angles can be the three angles of a triangle.
a) 30°, 70°, 80° b) 70°, 80°, 90° c) 30°, 110°, 40°
4. Find the number of degrees in each angle of an equiangular triangle.
17
5. Is it possible to have: a) two right angles? b) two obtuse angles? c) one right and one obtuse
angle? Why and why not?
6. What is the sum of the measures of the two acute angles of a right triangle?
7. If two angles in one triangle contain the same number of degrees as two angles in another
triangle, what must be true of the third pair of angles in the two triangles? Why?
8. In each part find the measures of the three angles of a triangle whose measures are in the
given continued ratio.
a) 1: 2 : 3 b) 1: 4 : 7 c) 2 : 3 : 4 d) 4 : 5 : 9 e) 1 : 3 : 4
9. In a triangle the measure of the second angle is 3 times the measure of the first angle, and
the measure of the third angles is 5 times the measure of the first angle. Find the number
of degrees in each angles of the triangle.
10. The longest side of a triangle is opposite the angles; the shortest side is opposite the smallest
angle.
11. If one angle of a triangle is obtuse, can another also be obtuse? Why or why not?
12. If one angle in a triangle is acute, can the other two angles also be acute? Why or why not?
13. Can a triangle have two right angles? Why or why not?
14. If a triangle has one acute angle, is the triangle necessarily acute? Why or why not?
Summary
• Triangles are closed figures containing three angles and three sides area is the measure of
the total surface of an object.
• When two straight lines meet at a point, they form an angle.
• Angles in any triangle add to 180°
18
Introductions
Hello our hardworking learner, you welcome to section unit 1 of section three. In the previous
section, you learnt about angles and triangles and their properties. We hope you have revised
section two and other previous sections extensively. In this section, you will learn about the idea
of properties of quadrilateral and other polygons.
INDICATORS: After going through this section you will be able to:
• Interpret and analyse definitions and properties of quadrilaterals and other polygons that
appear in the basic school mathematics curriculum.
• Identify, select (or design) and use manipulatives and other TLMs and models to develop
properties of quadrilaterals and other polygons, as well as, suggest such materials can be
used to teach given concepts in the basic school curriculum
• Outline the usefulness of quadrilaterals and other polygons and justify why it should be
taught both orally and in writing.
Parallelograms
If a quadrilateral has two pairs of opposite sides that are parallel, then the quadrilateral is a
parallelogram. A rectangle has two pairs of parallel sides, four right angles, and congruent
diagonals. A square is a special instance of a rectangle that has four congruent sides, four congruent
right angles, and congruent diagonals. A rhombus is a parallelogram with four congruent sides.
Trapezoid
If a quadrilateral has one pair of parallel sides, it is a trapezoid. Depending on how it is constructed,
19
a trapezoid may or may not have a right angle. It will be an isosceles trapezoid where the
nonparallel sides are equal in length and the base angles are equal.
A rectangle is parallelogram in which all four angles are right angles. A rhombus is also a
parallelogram in which all the sides are equal in length. A square is a rectangle all of whose sides
are equal in length. We can also describe a square as a rectangle which is a rhombus. It is also
possible to describe a square as a rhombus which has four right angles.
POLYGONS
Poly- is Greek words for ‘’many’’ and –gon comes from the Greek word for knee, bend, or angle.
Polygons get their names from the number of angles they have. Since they have the same number
of angles as sides, we identify a polygon by the number of sides it has.
Numbers of Names of
Angles Figure
3 Triangle
4 Quadrilateral
5 Pentagon
6 Hexagon
7 Heptagon
8 Octagon
9 Nonagon
10 Decagon
11 Undecagon
12 Duo-decagon
20
N n-gon
SUMMARY
1. Rectangles, rhombuses, and squares are members of the family of parallelograms.
Therefore, any property of the family of parallelograms must also be a property of
rectangles, rhombuses, and squares.
2. A square is a member of the family of rectangles. Therefore, any property of the family of
rectangles must also be a property of squares.
3. A square is a member of the family of rhombuses. Therefore, any property of the family of
rhombuses must also be a property of square.
4. All squares are parallelograms, but not all parallelograms are squares.
21
5. All squares are rectangles, but not all rectangles are squares.
6. Every square is a rhombus, but not all rhombuses are squares.
7. Opposite sides of a parallelogram are equal in length (or congruent).
8. Opposite angles of a parallelogram are equal in measure (or congruent).
9. Consecutive angles of a parallelogram are supplementary.
22
SECTION FOUR: LEARNING ABOUT 3-DIMENSIONAL SHAPES: COMPARING
POLYHEDRAL, FORMING 3- DIMENSIONAL SHAPES.
Introduction
You are welcome to section four of the first unit of this course. How did you find the previous
section? I hope you found it quite interesting. Section four promises to be even more interesting.
This is because most of the things discussed in this section are familiar to you. This section in a
way revises geometrical shapes. Geometrical shapes make up everything around us. Geometrical
shapes can be both two-dimensional, like the screen of your computer, and three-dimensional, like
a child’s ball. Each geometric shape has its own properties that make it different from other shapes.
However geometric shapes may share properties with other shapes, requiring them to be further
described to distinguish them from other shapes.
INDICATORS: After going through this section you will be able to:
• Interpret and analyse definitions of 3-Dimensional shapes that appear in the basic school
mathematics curriculum.
• Identify, select (or design) and use manipulatives and other TLMs and models to develop
3-Dimensional shapes, as well as, suggest such materials that can be used to teach given
concepts in the basic school curriculum
3-DIMENSIONAL SHAPES
Recall that polygons and polygonal regions are different. A polygon refers to the boundary,
whereas a polygonal region is the union of the boundary and the interior. Similarly, a
distinction is made with three-dimensional figures. Most of the geometric figures you have
worked with so far have been flat plane figures with two dimensions-base and height. In this
sections you will work with solids, like rocks and plants, are very irregular, but many others
are geometric. Some real-world geometric solids occur in nature: viruses, oranges, crystals, the
earth itself. Others are human-made: books, buildings, footballs, milk tins, ice cream cones.
Three-dimensional geometry plays an important role in the structure of molecules. For
example, when carbon atoms are arranged in a very rigid network, they form diamonds. A
solids formed by polygons that enclose a single region of space is called a polyhedron. The flat
polygonal surfaces of a polyhedron are called its faces. Although a face of a polyhedron
23
includes the polygon and its interior region, we identify the face by naming the polygon that
encloses it. A segment where two faces intersect is called an edge. The point of intersection of
three or more edges is called a vertex of the polyhedron. Just as a polygon is classified by its
number of sides, a polyhedron is classified by its number of faces. The prefixes for polyhedron
are the same as they are for polygons with one exception: A polyhedron with four face faces
is called a tetrahedron. Here are some examples of polyhedrons: Hexahedrons, Heptahedrons
and Decahedrons. If each face of a polyhedron is enclosed by a regular polygons, and each
face is congruent to the other faces, and the faces meet at each vertex in exactly the same way,
then the polyhedron is called a regular polyhedron.
Prism
A prism is a polyhedron formed by two congruent polygonal bases in parallel planes connected
by three or more parallelogram-shaped regions. A prism with lateral faces that are rectangular
regions is a right prism.
Pyramid
A pyramid is a polyhedron that has a polygonal base. Its lateral faces are triangular regions
with a common vertex.
The following table summarizes the different polyhedra and the numbers of faces, vertices, and
edges they have.
Figure Faces Vertices Edges
Cube 6 8 12
Triangular prism 5 6 9
Rectangular prism 6 8 12
Pentagonal prism 7 10 15
Triangular pyramid
Square pyramid
24
Models of 3-D shapes and their nets.
25
Cone
Activities
1. Prisms and pyramids are special types of polyhedra. In what ways are polyhedral like
polygons? In what ways are they different?
2. What is the least number of faces that can meet at a vertex of any polyhedron? What is
the least number of edges that can meet at a vertex of any polyhedron?
3. How are pyramids like prisms? How are they different
4. Sketch a net for a pyramid in which the base is five-sided. Name the pyramid
5. As the number of sides in the base of prism increases, what shape does the prism
approach?
6. As the number of sides in the base of a pyramid increases, what shape does the pyramid
approach
7. Complete the table
Regular polyhedron Faces Vertices Edges
Cube 6 8 12
Tetrahedron
Octahedron
Icosahedron 12 30
26
Dodecahedron 20 30
SUMMARY
In this section, we have name and classified the basic kinds of three-dimensional geometric
figures. From the explorations, you can see that a variety of seemingly different figures are
actually prisms. You have seen that prisms and cylinders are closed related. You have seen that
pyramids and cones are closely related.
27
SECTION FIVE: LEARNING ABOUT 2-DIMENSIONAL SHAPES: POLYGONS (
Introduction
This the fifth section of the first unit. You are going to look at two-dimensional shapes. Since you
have already looked at three-dimensional shapes in section 4, this section should not be difficult.
LEARNING INDICATORS: After going through this section you will be able to:
• Interpret and analyse definitions of 2-Dimensional shapes that appear in the basic school
mathematics curriculum.
• Identify, select (or design) and use manipulatives and other TLMs and models to develop 2-
Dimensional shapes, as well as, suggest such materials can be used to teach given concepts in
the basic school curriculum
• Outline the usefulness of geometry and justify why it should be taught both orally and in
writing
• Present and analyse designs in our fabrics and pictures of architectural and artistic, designs
that can be found in our communities
• Outline the influence of translation, rotation, and reflection on polygons in a plane;
• Recognise and analyse the importance of symmetry, congruence and similarity in real life
situations
TWO-DIMENSIONAL FIGURES
Think of geometric figures that people generally find pleasing. In this section, we will learn more
about geometric figures and shapes and you will learn and discover some of the secrets of the
patterns in the objects you have seen. We will examine how shapes can be put together to make
interesting designs you have seen, such as those in quilts, plants and animals, and buildings. Let
us look at some examples of two-dimensional figures.
SQUARES
28
A square is a quadrilateral with some special characteristics.
• First, all of sides are congruent. This is shown by the matching black on the sides of the
figure.
• Second, a square has four right angles. This is shown by the red right symbols symbols
• Finally, pairs of opposite sides are parallel. You can extend the sides, and the opposite sides
will never intersect.
PROPERTIES OF A SQUARE
PROPERTIES OF RECTANGLE
PROPERTIES OF RHOMBUS
Activities
1. In what ways are a square and rectangle alike? In what ways are they different?
2. In what ways are a square and parallelogram alike? In what ways are different?
3. In what ways are the triangle and hexagon alike? In what ways are different?
Summary
30
▪ The nature of side lengths can determine the types of a geometrical shape.
▪ Properties of some triangles
▪ Properties of some special quadrilaterals
31
SECTION SIX: TESSELLATIONS AND APPLYING THESE TO THE TEACHING OF
THE JHS MATHEMATICS CURRICULUM.
Introduction
What do you know about polygons that tessellate and those that cannot tessellate a plane?
Looking around your house, school or other places what designs have you come across that portray
tessellation? In this section we shall learn about tessellations and how to apply tessellations to the
teaching of the JHS mathematics curriculum.
LEARNING INDICATORS: After going through this section you will be able to:
▪ Outline the usefulness of geometry and justify why it should be taught in school
▪ Identify, select (or design and use manipulatives and other TLMs and models to develop
geometrical concepts.
32
overlapping or leaving a gap. What about regular heptagons? In any regular polygon with more
than six sides, each angle has a measure greater than 120°, so no more than two angles can fit about
a point without overlapping. So the only regular polygons that create monohedral tessellations are
equilateral triangles, squares, and regular hexagons. A monohedral tessellation of congruent
regular polygons is called a regular tessellation. Tessellations can have more than one type of
shape. You may have seen the octagon-square combination at right. In this tessellation, two regular
octagons and a square meet at each vertex. Notice that you put your pencil on any vertex and the
point is surrounded by one square and two octagons. So you can call this a 4.8.8 or a 4.8 2 tiling.
The numbers give the vertex arrangement, or numerical name for tiling. When the same
combination of regular polygons (two or more kinds) meet in the same order at each vertex of a
tessellation, it is called a semi-regular tessellation.
Activity
1. What do we mean when we say a shape tessellate?
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
2. Briefly explain the term regular tessellation
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
Summary
We have come to the end of section 6 and Unit 1. What have you learnt in this section? Remember
that a tessellation that uses only one shape is called a monohedral tiling. A monohedral tessellation
of congruent regular polygons is called a regular tessellation. Tessellations can have more than
one type of shape for a shape to fill the plane without gaps or overlaps, their angles, when arranged
33
around a point, must have measures that add up to exactly 360°. I am sure you are conversant with
the concept of tessellation. Well done!
34
UNIT TWO
GEOMETRICAL CONSTRUCTIONS
Learning, teaching and applying
SECTION ONE: CONGRUENCE, SIMILARITIES AND CONSTRUCTIONS
(TEACHING SYMMETRY, CONGRUENCE, SIMILARITY AND CONSTRUCTION OF
SHAPES).
Welcome to the first section of Unit 2. The whole unit will focus on teaching geometrical
constructions. In this section, we are going to have a look at congruence, similarities and
constructions (teaching symmetry, congruence, similarity and construction of shapes). Before we
proceed, let us ask ourselves: what is congruence and similarity of shapes? I hope you will enjoy
this section to the fullest.
LEARNING OUTCOMES
• Recognize professions and artisans who use the knowledge and skills derived from geometric
construction in their work
• Demonstrate knowledge and understanding of fundamental concepts based on constructions of
angles as found in the basic school mathematics curriculum.
• Demonstrate awareness of the world outside the classroom as a rich source of geometrical ideas
LEARNING INDICATORS
• Examine the usefulness of geometric construction and be able to justify the need for studying
this in basic school.
• Identify and provide sound argument about angles that are possible to construct and those that
cannot be constructed.
• Identify, select (or design) and use appropriate tools and equipment that can be used in solving
problems based on geometric construction and loci in the basic school mathematics curriculum.
Symmetry, congruence, similarity and construction of shapes
The concepts of congruence, similarity, and symmetry can be understood from the perspective of
geometric transformation. Fundamental are the rigid motions: translations, rotations, reflections,
and combinations of these, all of which are here assumed to preserve distance and angles (and
therefore shape in general). Reflections and rotations each explain a particular type of symmetry,
35
and the symmetries of an object offer insight into its attributes as when the reflective symmetry of
an isosceles triangle assures that its base angles are congruent.
Symmetry
In geometry, a figure is symmetrical if an operation can be done to it that leaves the figure
occupying an identical physical space. This can be accomplished in two ways (line of symmetry
and rotational symmetry). Line symmetry occurs when a line may be passed through an object
such that both halves of the object perfectly mirror each other. Consider the triangle below. An
equilateral triangle has three instances of line symmetry: one from each vertex to the midpoint of
the opposite side.
Rotational symmetry occurs when a shape may be rotated to occupy the same space as the original.
Let's take the same triangle again. An equilateral triangle has three degrees of rotational symmetry:
at 120, 240, and 360 degrees. Once we have determined rotational symmetry to 360 degrees, we
can stop, as the pattern will repeat itself after that.
36
Introducing Similarity
A similarity transformation is a transformation in which the image has the same shape as the pre-
image. Specifically, the similarity transformation are the rigid motions (reflection, translations and
rotation) as well as dilations.
Two plane figures are similar if and only if one can be obtained from the other by similarity
transformations (that is by a sequence of reflection, translation, rotation and dilations). The symbol
of similarity is ~. As with congruence, it is customary to write a similarity statement so that
corresponding vertices of the figures are listed in the same order.
If ΔA¹B¹C¹ is the image of ΔABC after a dilation with centre O and scale factor k, then the two
triangles are similar and you write ΔABC ~ΔA¹B¹C¹.
37
congruent. Once these triangle congruence criteria are established using rigid motions, they can be
used to prove theorems about triangles, quadrilaterals, and other geometric figures.
Recall that when two figures are similar, there is a sequence of similarity transformation that maps
one figure to the other. In particular, gives ΔABC ~ΔDEF, you can first a dilation to ΔABC to
make both triangle the same size. Then you can apply a sequence of rigid motions to the dilated
image of ΔABC to map it to DEF.
Because the similarity transformations that map ΔABC ~ ΔDEF preserve angle measure, you can
say that corresponding angles are congruent. Thus, ΔABC ~ΔDEF implies,
Also, the initial dilation that makes the two triangles the same size shows a that each side of ΔDEF
is larger or shorter than the corresponding side of ΔABC by the ratio given by the scale factor.
Assuming the dilation has the scale factor k, this means that:
𝐷𝐸 𝐸𝐹 𝐷𝐹
k = 𝐴𝐵 , k = 𝐵𝐶 and k = 𝐴𝐶
38
𝐷𝐸 𝐸𝐹 𝐷𝐹
= =
𝐴𝐵 𝐵𝐶 𝐴𝐶
Given that ΔRST~ΔUVW, write congruence statements for the corresponding angles and
proportions for the corresponding sides.
A. Corresponding angles are listed in the same position in each triangle name.
B. Corresponding sides are pair of letters in the same position in each triangle name.
ΔRST ~ ΔUVW
𝑈𝑉
=
𝑅𝑆
You have seen that when two triangles are similar, corresponding angles are congruent and
corresponding sides are proportional. The converse is also true. That is if you are given two angles
and you know that the corresponding angles are congruent and corresponding sides are
proportional, you can conclude that the triangles are similar
As with congruence, there are some shortcuts that make it a bit easier to prove that two triangles
are congruent. The most important of these is known as the AA similarity.
AA Similarity Criterion
If two angles of one triangle are congruent to two angles of another triangle, then the triangles are
similar.
39
Activities
…………………………………………………………………………………………………
…………………………………………………………………………………………………
…………………………………………………………………………………………
2. If you know that two figures are similar, can you conclude that corresponding angles are
congruent? Why or why not?
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
…………………………………………………………………………………...
3. A student identified ̅̅̅̅ ̅̅̅̅as corresponding sides. The student wrote 𝑅𝑆 = 𝑉𝑊. Is this
𝑅𝑆 and 𝑈𝑉 𝑈𝑉 𝑆𝑇
a correct proportion? Why or why not? If the proportion is not correct, explain how to write
correctly.
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
……………………………………………………………………………………
4. Is triangle similarity transitive? That is, if ΔABC ~ΔDEF and ΔDEF~ΔGHK, can you
conclude that ΔABC~ΔGHK? Explain.
………………………………………………………………………………………………
………………………………………………………………………………………………
……………………………………………………………………………………………..
5. What can you conclude about similar triangles and how can you prove triangles is similar?
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
Summary
40
We have come to the end of section 1 of Unit 2. What have you learnt in this section? Remember
that similarity transformation is a transformation in which the image has the same shape as the
pre-image. Two plane figures are similar if and only if one can be obtained from the other by
similarity transformations (that is by a sequence of reflection, translation, rotation and dilations).
I am sure you are conversant with symmetry, similarity and congruence of shapes in geometry.
Congratulations!
41
SECTION TWO: TEACHING MEASUREMENT OF A LINE
Hello my dear students. It is very assuring that we are progressing steadily through this course.
How have you being doing so far with your study and the activities. Well done! I wish to remind
you that geometric construction is an important concept in geometry. In the previous section, we
looked at the concept of congruence and similarity in geometry. Am sure by now you are
conversant with these concepts. In this section, we are going to discuss construction and bisecting
a line segment, and constructing a perpendicular from a point outside a line to the line. Let’s look
at these concepts in details.
Geometric Construction
An important part of Euclidean geometry is the construction of plane geometric figures using only
two tools. Straightedge and a compass. A straightedge is similar to a ruler except that it does not
contain any marks used for measuring length. A compass is a tool that can be used to copy a line
segment, draw a circle of a given radius, and draw an arc. We will now demonstrate the use of
both of these tools, through seven simple constructions.
Construction of a line
Let us look at how to construct a line segment.
5cm
Procedure
a. Using a sharp pencil and a ruler, draw a line of length longer than 5cm
b. With the aid of a ruler and a pencil mark the line towards the left end and label that
point A
c. With your pair of compasses, measure 5cm on your ruler
42
d. With the point of compasses at A, draw an arc to cut the line towards the right hand
end.
e. Label the intersection of the arc and the line as B.
f. Using a pair of dividers, check that the line segment AB constructed is of length 5cm.
Procedure
43
Constructing a perpendicular from a point outside a line to the line.
Procedure
a. Draw a line AB
b. Choose a point P above AB
c. Open your pair of compasses to a convenient radius and place the point a P, construct two
arcs to cut AB at C and D.
d. With centre C and a convenient radius, draw an arc to cut the first one at Q.
e. Join P and Q to cut AB at E
f. PE is the perpendicular from P to AB and the shortest distance from P to AB.
Activities
44
2. How will you bisect a given line segment?
………………………………………………………………………………………………
………………………………………………………………………………………………
……………………………………………………………………………………….
Summary
Let us pause here and reflect on what we have learnt so far. What have we covered in this section?
In this section, we dealt with how to construct and bisect a given line segment and how to construct
a perpendicular from a point outside a line to the line. I am sure you have enjoyed our discussion.
Let us now look at the third section of Unit 1.
45
SECTION THREE: BISECTION OF A LINE AND ANGLES AND CONSTRUCTION
OF BASIC ANGLES (60°, 90°, 30°, 15°, 45°).
Hello my dear students, well done for working successfully through the first two sections. I
welcome you to section 3 of Unit 2. In this section, we will focus on another important aspect
of geometric construction that deals with bisecting and construction of angles.
Procedure
a. With the compass pin placed on O, the vertex of the angle, draw arcs on the arms of the
angle with the same radius.
b. The point of intersection of the arcs on the arms of the angle are labelled A and B.
c. Above the angle that is to be bisected, draw two intersecting arcs using the same radius
with their centres (where the compass pin is placed) being the points A and B.
d. Label the point of intersection of the arcs R.
e. Join O and R. this line OR divides angle ‘a’ into two equal halves.
46
Constructing an angle of 90° and 45°
Procedure
a. Draw a straight line and mark a point O, the vertex of the angle to be constructed.
b. With the compass pin on O, draw a semicircle and mark its ends on the line as A and B.
Angle AOB is a straight angle (180°) whose arms are OA and OB.
c. With the same radius, put the compass pin on any of A or B (say B) and draw an arc on the
semicircle. Label this point C.
d. Draw a line through O and C. Angle BOC=60°, automatically angle AOC = 120° (60° and
120° are pair of supplementary angles and they share a common arm OC)
e. Bisect angle BOC =30°
f. Bisect angle BOL= 15°
47
Activities
Summary
In this section, we have discussed the bisection of a line and angles and construction of basic
angles (60°, 90°, 30°, 15°, 45°). I urge you to try your hands on these concepts for better
understanding. Good luck in your attempt.
48
SECTION FOUR: CONSTRUCTION OF OTHER ANGLES
For any angle to be constructed, by the use of a pair of compasses and a ruler, a straight
angle (180°) is the first to construct.
Two angles that sum up to 180° are called supplementary angles and any pair of supplementary
angles can be constructed the same way. The two angles share the one common arm and their other
arms are the different arms of the straight angle.
Procedure
49
Constructing an angle of 75˚
a. To construct an angle of 75˚ at point O, bisect the angle between the 60˚ and 90˚ lines (60˚
+15˚) =75˚.
b. Trisect a straight angle (180˚) into three (60°s).
c. Bisect the middle 60° into two (30 ˚s). i.e. ∠POD = ∠DOQ = 30°.
d. Note that this same bisector also bisects the straight angle into (90°s). ∠AOC = ∠BOD
e. Bisect any of the two (30˚s) into two (15˚s). ∠AOC = ∠TOQ = 15°
f. ∠BOT = 75° . i.e ∠AOD + ∠DOT
50
Constructing an angle of 150°
Activities
51
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………
Summary
In this section you have learnt how to construct other angles such as 75 ˚, 105 ˚, 135 ˚, 150 ˚ etc.
We went through the various procedures of constructing these angles. I am sure you will practice
and master the construction of these and other angles. Well done!
52
SECTION FIVE: TEACHING CONSTRUCTION OF TRIANGLES
I am pleased to welcome you to section 5 of Unit 2. How have you being doing you are doing well
in your study and activities? In this section, we are going to examine the construction of triangles.
I am sure this concept is not new to you. You must have come across it many times in your
academic pursuit.
Construction of Triangles
The construction of a triangle with the length of all the three sides (given)
Worked Example
Procedure
53
Note: You can choose to start with any of the three given sides.
Construction of a triangle with the length of two sides and an interior angle given
Worked Example
Construct ∆ PQR with ∠RPQ = 60°, │PQ│= 10cm and │PR│= 12cm
Procedure
54
Construction of a triangle with two interior angles and the length of a side given
Example
Construct ∆ABC with ∠CAB = 75°, │AB│= 12cm and angle ABC = 45°.
Procedure
55
Activities
…………………………………………………………………………………………………
…………………………………………………………………………………………………
…………………………………………………………………………………………………
…………………………………………………………………………………………………
……………………………………………………………………………………
2. Explain how to construct a triangle with the length of two sides and an interior angle given.
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
……………………………………………………………………………………
3. Construct a triangle with two interior angles and the length of a side given.
………………………………………………………………………………………………
………………………………………………………………………………………………
………………………………………………………………………………………………
…………………………………………………………………………………….
Summary
Congratulations for working through this section on the construction of triangle. We have also
explain how to construct a triangle with the length of two sides and an interior angle given. Finally
we had a discussion on constructing a triangle with two interior angles and the length of a side
given. It is my hope that all these discussions have gone well with you. Well done!
56
SECTION SIX: TEACHING QUADRILATERALS AND LOCI AND THEIR
APPLICATIONS IN THE BASIC SCHOOL MATHEMATICS CURRICULUM
Hello, my dear students. I welcome you to the last section of Unit 2. Our focus of the unit has
been on teaching geometric construction. I am sure you have seen some appreciable level of
improvement in your skills and understanding of geometric construction. Congratulations! In this
sections, we will concentrate on construction of quadrilaterals. Am sure you know what
quadrilaterals are? What about examples of quadrilaterals? Well done.
Construction of Quadrilaterals
Construction of square
Worked Example
Procedure
NB: If there is no error, then the two diagonals must be equal in length.
Construction of a Rectangle.
Worked Example
Procedure
NB: the pair of the opposite sides have the same lengths.
Construction of a parallelogram
Worked Example
Construct a parallelogram ABCD with │AB│= 12cm, and │BC│= 10cm and angle ABC=105°,
BAD =75°.
Procedure
Loci
59
A locus is the set of all possible positions of a point, which varies according to certain conditions.
The plural of locus is loci. All the points on these positions have a common property/properties.
This locus is a circle whose centre is a fixed point with a given radius (the equal distance).
Worked Example
A point moves in a plane so that its distance from a fixed point is 3cm. Construct this locus.
Solution
60
Locus of a points equidistant from two fixed points.
This locus is the perpendicular bisector of the line joining two given points. All the
points on this line have common property. Let P be any variable point on this line
(locus). If the two points are Q and R then │PQ│=│PR│.
Worked Example 1
│P1Q│=│P1R│
│P2Q│=│P2R│
│P3Q│=│P3R│
Worked Example 2
A point moves in a plane such that it is always equidistant from two given points A and B, which
are a fixed distance of 8cm apart. Describe briefly and construct its locus.
Solution
The locus of the point will be the perpendicular bisector of the line joining A and B
61
Locus of points equidistant from two intersecting straight lines
The locus is also the angle bisector of the angle between the two intersecting straight lines.
Let OT and OQ be two lines intersecting at O. the locus is the bisector of angle TOQ. E very point
on this bisector has a common property. If P is a variable point on the bisector, then the length of
the perpendicular from P to OT, is the same as the length of the perpendicular from P to OQ.
Worked Example 3
If point P, moves in a plane such that its distance from two given intersecting straight lines AB
and AC is equidistant, what is its locus?
Solution
62
Locus of points equidistant from two parallel lines.
This locus is the third parallel line which is midway the two given parallel lines.
If AB is parallel to CD, then the third line indicating the locus is parallel to both and is half way
the perpendicular distance between them.
Worked Example
A point moves in a plane such that its distance from two given parallel lines is constant. Given that
the distance between the parallel lines is 12cm, describe and construct the locus of the point.
Solution
a. From points A and B, draw perpendicular lines to meet the other parallel line at C and D
such that │AC│=│BD│=12cm.
63
b. Adjust your compass such that the radius is 6cm (half of 12cm).
c. With the compass pin at A, draw an arc on AC and name it P1
d. Repeat the process with compass pin at B and mark on BD a point P2, 6cm from B.
e. Join P1 and P2. This is the required line (locus) which is equidistant from AB and CD.
Activity 1
………………………………………………………………………………………………………
………………………………………………………………………………………………………
………………………………………………………………………………………………
Activity 2
64
………………………………………………………………………………………………
……………………………………………………………………………………
2. a quadrilateral ABCD such that │AB│=8cm, │BC│=│AD│= 6.5cm, ABC=60˚ and angle
BAD=75˚. Construct the locus l1 of points equidistant from A and B. construct the locus l2
of points equidistant from B and C. Locate a point P where P is the point of intersection of
l1 and l2.
3. a parallelogram with sides 6cm and 9cm, the angles between these sides being 60°.
Measure the diagonals.
4. a rectangle measuring 7.4cm by 10.3cm and measure the length of the diagonal.
Summary
I hope you have by this time realized that to construct a shape or a diagram means to draw it
accurately. The most accurate way to construct a diagram is to use compasses and a ruler only.
Note that it is not advisable to draw lines which are longer than the required segments, using a
ruler or a pair of compasses to mark off the required points. Read more about construction of
quadrilaterals so that you can improve upon your construction skills. Congratulations!
65
UNIT 3
TRIGONOMETRY, VECTORS AND BEARINGS
Introduction
Dear student, you are welcome to the third unit of this course. The first three sections of the unit
will introduce you to basic trigonometry. It covers concepts such as right-angle triangles,
Pythagorean triples and basic trigonometric ratios (sine, cosine and tangent). You are encouraged
to use your prior learning of Pythagoras rule, measure of angles in various quadrants of a circle
and right-angle triangles to help you achieve the section objectives. We know after completing the
entire unit, you will come to love these concepts and accept that indeed they are relevant concepts
with real-life applications.
Unit Content
In this unit, we will explore the following concepts in six sections;
Section 1: Angles
Section 2: Right-angled Triangles
Section 3: Trigonometry Ratios and their Application to real life situations
Section 4: Algebra of Vectors and Vector Notations
Section 5: Vector Operations
Section 6: Magnitude and Direction of Vectors
SECTION 1: ANGLES
Dear student, consider a situation where we have to measure the height of a skyscraper without
necessarily climbing it or to measure the height of an electric pole in front of a lecture block. What
about measuring the height of the faculty block? Imagine a boy wants to swim across a fast-flowing
river. If he swims directly across, he will be swept downstream away by the high current. In your
view, where should he aim at so that he can safely swim to the opposite ends of the river? The
solution in all these scenarios involve the application of the concept of trigonometry. These
therefore makes the learning of trigonometry imperative. This section focuses on trigonometric
angles as they are foundational to the whole concept of trigonometry.
66
Section objectives
By the end of this section, you will be able to;
• Explain angles
• Identify various kinds of angles
• Explain angular measures in degrees and radians and their relationships
• Use the three basic trigonometric ratios to determine angular value s in degrees in the
quadrants of a circle
We will take a brief insight into where the concepts of trigonometry came from, how they were
discovered, and how their uses in the past relate to how they are currently used and taught. This
will provide you with the extra understanding you need to put these concepts to use, be it as
measuring devices, or as functions. With this better insight, one would be able to see the value in
studying trigonometry as a component of mathematics, instead of a detached unit from the subject.
The term “trigonometry,” although not of native Greek origin, it derives its name from the Greek
words trigonon, meaning “triangle,” and metria, meaning “measurement.” It means “measurement
of a triangle”. As the name implies, trigonometry ultimately developed from the study of right
triangles by applying the relationships between the measures of its sides and angles to the study of
similar triangles. Trigonometry is concerned with the relationship between the size of the angles
and the length of sides of a plane figure such as triangles. Therefore, we can conclude that
trigonometry is that branch of mathematics concerned with the measurement of sides and angle of
a plane triangle and the investigations of the various relations which exist among them and their
applications. Hipparchus of Nicaea is the known figure in trigonometry. He is regarded as the
father of trigonometry though other scholars made contributions in the field.
ANGLES
An angle is simply the degree of turn from a fixed point. It is also defined as the measure of the
rotation of a line from one position to another about a fixed point on it.
An angle is also formed by rotating a ray around its endpoints. The two rays are known as the
initial side and the terminal side. These two rays form the sides of the angle. The endpoint then
becomes the vertex of the angle..
67
Figure 3.1
In figure 3.1, the first position OX is called initial line (position) and second position OP is called
terminal line or generating line(position) of XOP. If the terminal side resolves in anticlockwise
direction the angle described is positive as shown. If terminal side resolves in clockwise direction,
the angle described is negative.
Angle Rotations
Positive Angles
An angle, becomes a positive angle if it is generated by a counter clockwise rotation about the
origin.
Negative Angles
An angle is a negative angle if it is generated by a clockwise rotation about its origin.
Standard angle
An angle is described as being in standard position when the starting side (initial side) coincides
with the positive x-axis with its vertex at the origin.
Types of angles
Angles can take one of the following forms;
68
e) An angle is a reflex angle if it is between 180⁰ and 360⁰. Thus, 1800<<3600
Two positive angles can also sum up to form a right angle, thus 90◦. Such angles are said to be
complementary angles while two positive angles that sum up to form a straight line, thus 180◦ are
said to be supplementary angles.
69
Example: 45° 7’ 12’’ is read as forty-five degrees, 7 minutes and 12 seconds.
Which when interpreted is 45° + 7/60° + 12/3600°
2𝜋 rad = 360°. Therefore, a radian has an angle equal to180/𝜋. A radian is approximately 57°.
In trigonometry, angles are measured in an anticlockwise direction from the positive x-axis from
0⁰ to 360⁰ using trigonometric ratios. What are these trigonometric ratios? The trigonometric ratios
are of three basic types. These are;
i. Sine (Sin)
ii. Cosine (Cos)
iii. Tangent (Tan)
These ratios may be positive or negative depending on the quadrant in which it lies. The sign of
the ratio in a particular quadrant can be remembered from the diagram below;
70
From the diagram,
In the 1st quadrant (0⁰-90⁰) all the ratios are positive
In the 2nd quadrant, (90⁰-180⁰) only sine is positive
In the 3rd quadrant (180⁰-270⁰) only tan is positive
In the 4th quadrant (270⁰-360⁰) only cos is positive
Quadrants:
Two mutually perpendicular straight lines divide the plane into four equal parts, each part is called
quadrant. Thus, there is the 1st, 2nd, 3rd and 4th quadrants respectively.
In first quadrant, the angle varies from 0o to 90o in anti-clockwise direction and from – 270o to –
360o in clockwise direction.
In second quadrant, the angle varies from 90o to 180o in anti-clockwise direction and – 180o to –
270o in clockwise direction.
In third quadrant, the angle varies from 180o to 270o in anticlockwise direction and from – 90o to
– 180o in clockwise direction.
In fourth quadrant the angle Fig.3.4 vary from 270o to 360o in anticlockwise direction and from 0o
to – 90o in clockwise direction.
71
Activity 1
1. Determine the type of angle with the following measurement;
a) 56⁰ b) 107⁰ c) 180 d) 278 e) 349 f) 360
3. Using your answers in (2), determine the type of angle for each letter.
Suggested answers
1. a) Acute b) obtuse c) straight line d) reflex e) reflex f) circle
Summary
In a nut shell, we have learnt that;
• The term “trigonometry’ comes from the Greek word trigonon, meaning “triangle,” and
the Greek word -metria, meaning “measurement.”
• Trigonometry therefore means the “measurement of a triangle”
• Hipparchus of Nicaea is the father of trigonometry
72
• An angle is formed by rotating a ray about a fixed point.
• Two positive angles can also sum up to form a right angle, thus 90◦. Such angles are said
to be complementary angles while two positive angles that sum up to form a straight line,
thus 180◦ are said to be supplementary angles
• An angle, becomes a positive angle if it is generated by a counter clockwise rotation about
the origin.
• Also, an angle is a negative angle if it is generated by a clockwise rotation about its origin.
• An angle is described as being in standard position when the starting side (initial side)
coincides with the positive x-axis with its vertex at the origin.
• Angles can be either acute, obtuse, straight, right or reflex depending on its measure in
degrees.
• Angles are usually measured in degrees and radians
• The central angle that is subtended by an arc with same length as the radii is called a
radian.
• The symbol for radian is rad
• The use of degree as a unit of measurement of angles was developed by the Babylonians
about 4000 years ago.
• Angles may form within four main quadrants, 1st, 2nd, 3rd or 4th and that determines whether
it will be negative or positive.
73
Chapter 1 SECTION 2: RIGHT ANGLE TRIANGLES
Introduction
Dear students, we want to continue our study of trigonometry in this section by looking at triangles,
and for a while we will consider only right triangles. Once we have understood right triangles, we
will know a lot about other triangles as well. It is worth knowing that there are some triangles that
have no right angles. Such triangles are known as the oblique triangles. They usually contain either
three acute angles or two acute and one obtuse angle. Take note of these triangles to help you
differentiate them from what our discussion today will be about.
Section objectives
By the end of this section, you should be able to;
74
A fundamental property of right-angled triangles is that Pythagoras’ Theorem holds. This
triangle has three sides; a longest side which is always called the hypotenuse, a side of length x
and one of length y. You also have three angles; the right-angle itself (depicted by the small
square) and two others given by (theta) and (phi). If you look carefully you can see that the
hypotenuse is the side that is opposite the right angle – in a way the hypotenuse plays no part in
the right-angle which is defined by the join between sides x and y.
The hypotenuse is always opposite the right angle in a right-angled triangle. Definitions of adjacent
and opposite sides are different for the angle . Can you see that the side x is opposite the angle
and that side y is adjacent to ? This is the crucial point here, when you are defining the opposite
and adjacent sides of an angle, the sides you choose depend on the angle you are interested in.
Moreover, the hypotenuse always remains the longest side, the one opposite the right-angle. This
idea of sides which are opposite and adjacent with respect to certain angles is essential in defining
the trigonometric ratios. The longer of the sides (x or y) is always opposite the larger of the angles
().
75
If we know the length of two sides of a 90° triangle, we can determine the angular measure of the
reference angle . The reference angle determines which side we refer to as opposite and which is
referred to as adjacent. Wherever angle is, this is the reference point. The side opposite of Ɵ is
simply referred to as the “side opposite” and the side connected to is called “side adjacent”
Consider the diagrams below;
Square of the hypotenuse = sum of the squares of the other two sides.
𝑐2 = 𝑎2 + 𝑏2
𝑏2 = 𝑐2 - 𝑎2 OR b = √𝑐 2 − 𝑎2
NB: the longest side or the side opposite to the right angle is always the hypotenuse
Example 1
Solution
By Pythagoras’ theorem;
|𝑃𝑄|2 = |𝑅𝑃|2 + |𝑄𝑅|2
= 122 + 52
= 144 + 25
= 169
|𝑃𝑄|2 = 169
|𝑃𝑄| = √169
= 13 cm
Example 2
Find the length of AC in the figure below;
77
Solution
By Pythagoras’ theorem;
|𝐵𝐶|2 = |𝐴𝐵|2 + |𝐴𝐶|2
52 = 32 + |𝐴𝐶|2
25= 9 + |𝐴𝐶|2
|𝐴𝐶|2 = 25 - 9
|𝐴𝐶|2 = 16
|𝐴𝐶| = √16
|𝐴𝐶| = 4 cm
Example 3
Solution
|𝑄𝑇|2 = |𝑃𝑇|2 + |𝑄𝑃|2
|𝑃𝑇| = √64
= 8 cm
Example 4
In the triangle ABC below, |𝐴𝐵| = |𝐵𝐶| = 5cm and |𝐴𝐶| = 8cm. find |𝐵𝐷|
Solution
From the right-angle triangle ABD
|𝐴𝐵|2 = |𝐴𝐷|2 + |𝐵𝐷|2
|5|2 = |4|2 + |𝐵𝐷|2
1 1
Since |𝐴𝐷| = of |𝐴𝐶| , |𝐴𝐷| = 2 of 8 = 4
2
|𝐵𝐷|2 = 52 - 42 = 25-16 = 9
|𝐵𝐷|2 = 9
|𝐵𝐷| = √9 = 3cm
Example 5
In the diagram, <PQR = PRS = 90⁰, |𝑃𝑆|= 13cm, |𝑃𝑄|= 3cm and |𝑄𝑅|= 4cm. find |𝑅𝑆|
79
Solution
By Pythagoras’ theorem;
|𝑃𝑅|2 = |𝑃𝑄|2 + |𝑄𝑅|2
|𝑃𝑅|2 = |3|2 + |4|2
|𝑃𝑅|2 = 9 + 16 = 25
132 = 52 + |𝑅𝑆|2
169- 25 = |𝑅𝑆|2
|𝑅𝑆|2 = 144
EXAMPLE 6
Show that a triangle with sides 3, 4, and 5 is a Right angled-triangle
Solution
We can apply statement to see if it is a right triangle. In
fact, 52 = 25 = 32 +42 ,
so, the angle opposite the side of length 5 is a right angle.
NB: that we can use statement of the Pythagorean theorem to solve this problem.
80
APPLICATION OF PYTHAGORAS THEOREM
Problems
1. A ladder leans against a vertical wall of height 12m. If the foot of the ladder is 5m away
from the wall, calculate the length of the ladder.
2. A ladder of 4.5 m long. The foot of the ladder is 2m away from the base of the wall. How
far up the wall is the top of the ladder?
Suggested Answers
1. 13m long
2. 4.03 (2 d.p) up the wall
1. A) In the quadrilateral ABCD below, |𝐴𝐵| = 3cm, |𝐵𝐶|=4cm, |𝐶𝐷| = 12cm and angle
ABC = 90⁰ and ACD=90⁰.
Calculate
i. The perimeter of ABCD
ii. The area of ABCD
81
ii.
3. An isosceles triangle has equal sides 6cm long and a base 4cm long. Find the altitude of
the triangle, leaving your answer in surd form.
4. The sides of a rectangular floor are x m and (x+7) m. the diagonal is (x+8) m. calculate
i. The value of x
ii. The area of the floor
5. A rectangular box has length 10 cm, breadth 9cm and 6cm high. What is the longest length
of the longest stick that will fit into the box?
Suggested answers
1. i. 32cm
2. i. 15.68cm ii. 13cm
82
3. 4√2
4. i. 5m ii. 60m2
5. 14.73m
6. 16cm
Pythagorean Triple
Sometimes, the three lengths of a right-angled triangle are all whole numbers. When this occurs,
they are called a Pythagorean Triple or Triad. The most commonly known of these is (3,4,5)
representing the triangle.
Pythagorean triple is a set of three integers which obey the Pythagoras theorem. Thus, the sum of
the squares of two of the sides of a right-angles triangle equals the square of the third side.
For any two positive integers a and b, where a>b. the three sides of a right-angle triangle can be
expressed in terms of a and b to generate the Pythagorean triplets as shown below;
However, if only one side of the right-angled triangle is known, then the Pythagorean triple can
be generated using the sides.
1 1
{ ( 𝑎2 − 1), ( 𝑎2 + 1) }
2 2
To generate Pythagorean triplet, the formula is
1 1
{ ( 𝑎2 + 1)} 2
≡ {2 ( 𝑎2 − 1)} 2
+ 𝑎2 where a is an odd integer greater than 1.
2
83
i. [3, 4 and 5]
ii. [5, 12 and 13]
iii. [6, 8 and 10]
iv. [9, 12 and 15]
v. [12, 16 and 20]
vi. [15, 20 and 25]
vii. [7, 24 and 25]
viii. [8, 15 and 17]
ix. [15, 36 and 39]
x. [9, 40 and 41]
Other Triads can be based on multiples of the base Triads; the Triads (6,8,10), (9,12,15), (12,16,20)
... are based on the base Triad (3,4,5).
This means that to get other triplets, multiply each of (3,4,5) by a whole number.
For instance, (3×2=6), (4×2=8) and (5×2=10) hence the next triplets are (6, 8 and 10)
1 2 3 4 5 6
2 4 6 8 10 12
3 6 9 12 15 18
4 8 12 16 20 24
5 10 15 20 25 30
6 12 18 24 30 36
Keys
Each of the colored columns of the chart represent a set of Pythagorean triples.
For instance;
The set for Green {9,12 and 15} =92 + 122 = 152 , etc.
Proof
For these numbers to be Pythagorean triplets, then the sum of the squares of two of the sides
must equal the square of the third side;
Thus
𝑐2 = 𝑎2 + 𝑏2
Or
𝑎2 + 𝑏2 = 𝑐2
For [3, 4 and 5]
𝑎2 + 𝑏2 = 𝑐2
Where a=3, b=4 and c=5
32 + 42 = 52
9 + 16 = 25
25 = 25 VALID
Hence, the numbers [3, 4 and 5] are Pythagorean triplets.
Example 1
Find the Pythagorean triplets if x=3 and y=2
Solution
𝑥 2 = 32 = 9
𝑦 2 = 22 = 4
𝒙 𝟐 − 𝒚𝟐 = 9 - 4 = 5
2xy = 2 (3×2) = 12
85
𝒙𝟐 + 𝒚𝟐 = 9 + 4 = 13
Therefore, the Pythagorean triples are {5, 12 and 13}
Example 2
The sides of a right-angled triangle are (x-1) cm, (x+3) cm and (x+7) cm. Find the value of x
Solution
𝑥 2 -10x – 39 = 0
𝑥 2 -13x + 3x-39=0
x(x-13) + 3 (x-13) = 0
(x+3) (x-13)=0
(x+3) =0
X=-3
(x-13) =0
X=13
Hence x=13
Example 3
The diagonals of a rhombus are 8cm and 10cm long. Calculate correct to one decimal place the
length of side of the rhombus.
Solution
86
Let the vertices of the rhombus be ABCD
By Pythagoras’ theorem;
87
2. Generate Pythagorean triplets using the following pairs of positive integers.
a) 3 and 1
b) 4 and 1
c) 4 and 2
d) 5 and 2
3. If three sides of a triangle are given as 𝑥 2 − 𝑦 2 , 2xy and 𝑥 2 + 𝑦 2 . Show that the triangle
is a right angles triangle.
Suggested answers
1. i. Yes ii. Yes iii. Yes iv. No
v. Yes vi. Yes vii. No viii. No
3. Clue: 𝑥 2 + 𝑦 2 = 𝑥 2 − 𝑦 2 + 2xy
Summary
As a recap, we learnt in this section that;
• A right-angled triangle is therefore any triangle which has one of its angles measuring 90⁰
(right-angle)
• This triangle has three sides; a longest side which is always called the hypotenuse, the
adjacent and the opposite.
• The hypotenuse is always opposite the right angle in a right-angled triangle.
• There are some other triangles that may not have right angles. Such triangles are known as
the oblique triangles.
• Oblique triangles usually contain either three acute angles or two acute and one obtuse
angle.
• A fundamental property of right-angled triangles is that Pythagoras’ Theorem holds
• The Pythagorean theorem is named after the Greek philosopher Pythagoras
• Pythagoras’ theorem states that for any right-angled triangle: Square of the hypotenuse is
equal to the sum of the squares of the other two sides.
• We can apply the theorem of Pythagoras to determine the unknown sides of a right-angle
triangle.
• Pythagorean triple is a set of three integers which obey the Pythagoras theorem.
88
• For any three sides of a right triangle to be a Pythagorean triple, the sum of the squares of
two of the sides of a right-angled triangle must be equal to the square of the third side.
• The most commonly known of the Pythagorean triple is (3,4,5) representing a triangle.
• Other Triads can be based on multiples of the base Triads; the Triads (6,8,10), (9,12,15),
(12,16,20) ... are based on the Triad (3,4,5).
• This means that to get other triplets, multiply each of (3, 4,5) by a whole number. For
instance, (3×2=6), (4×2=8) and (5×2=10) hence the next triplets are (6, 8 and 10)
• For any three numbers to be Pythagorean triplets, then the sum of the squares of two of
the sides must equal the square of the third side; Thus, 𝑐 2 = 𝑎2 + 𝑏2
2 2 2
Or 𝑎 + 𝑏 = 𝑐
89
Chapter 2 SECTION 3: TRIGONOMETRIC RATIO AND ITS APPLIACTION
Dear students, you are warmly welcome to section 3 of unit 3 of the course. In this section, we will
take a closer look at the trigonometric ratios that was mentioned in the beginning sections of this
unit. We will also establish the relationship among the three basic trigonometric ratios and how
they are applied to real life situations such as finding the height of a vertical object without
measuring, through investigations.
Section objectives
By the end of this section, you will be able to;
TRIGNOMETRIC RATIO
The trigonometric ratios are of three basic types.
These are;
i. Sine (Sin)
ii. Cosine (Cos)
iii. Tangent (Tan)
These ratios may be positive or negative depending on the quadrant in which it lies.
90
The simplest way to remember this is
Sine of an angle = SOH
Where S= Sine of an angle
O= Opposite
H= Hypotenuse
𝑠𝑖𝑑𝑒 𝑜𝑝𝑝𝑜𝑠𝑖𝑡𝑒 𝑡𝑜 𝑡ℎ𝑒 𝑎𝑛𝑔𝑙𝑒
Sine of an angle = SOH = ℎ𝑦𝑝𝑜𝑡𝑒𝑛𝑢𝑠𝑒
|𝑅𝑃|
Sin Q = |𝑃𝑄|
Definition: For any acute angle α, we draw a right triangle that includes α. The sine of α,
abbreviated sin a, is the ratio of the length of the leg opposite this angle to the length of the
hypotenuse of the triangle.
For Example, in the right triangle ABC (diagram above), sin a= a/c.
We can see immediately that this definition has a weak point: it does not tell us exactly which
right triangle to draw. There are many right triangles, large ones and small ones that include a
91
given angle a.
Let us try to answer the following questions.
C =20
A =1O
For Example, we might let the length of the hypotenuse be 20. Then the length of the side
opposite the 30° angle measures 10 units. So
10 1
sin 30° = 20 = = 0.5
2
We know, from geometry, that whatever the value of the hypotenuse, the side opposite the 30°
angle will be half this value. so, sin 30° will always be 1/2. This value depends only on the
measure of the angle, and not on the lengths of the sides of the particular triangle we used
|𝑅𝑄|
Cos Q = |𝑃𝑄|
Definition: In a right triangle with acute angle a, the ratio of the leg adjacent to angle a to the
hypotenuse is called the cosine of angle a, abbreviated cos a.
Notice that the value of cos a, like that of sin a, depends only on a and not on the right triangle
that includes a. Any two such triangles will be similar, and the ratio cos a will thus be the same
in each.
93
Tangent of an angle = TOA
Where T= Tangent of an angle
O = Opposite
A= Adjacent
𝑠𝑖𝑑𝑒 𝑜𝑝𝑝𝑜𝑠𝑖𝑡𝑒 𝑡𝑜 𝑡ℎ𝑒 𝑎𝑛𝑔𝑙𝑒
Tangent of an angle = TOA = 𝑠𝑖𝑑𝑒 𝑎𝑑𝑗𝑎𝑐𝑒𝑛𝑡 𝑡𝑜 𝑡ℎ𝑒 𝑎𝑛𝑔𝑙𝑒
|𝑅𝑃|
Tan Q = |𝑅𝑄|
You may notice that the tangent represents the gradient of a line (usually defined as the change
in y divided by the change in x. the word “tangent” is shortened to “tan” with tan denoting the
tangent of the angle , again there is no multiplication involved.
Interestingly you can use the definitions sine and cosine above to show that
In other words:
The tangent of an angle is the sine of that angle divided by its cosine
TRY QUESTIONS
1. Use the figure below to answer the questions
a) Find sin(θ).
b) Find cos(θ).
c) Find tan(θ).
d) Find sin2(θ)+ cos2(θ).
94
5
2. Assume that sin(α) = , where α is acute. (A)
13
a) Find cos(α).
b) Find tan(α).
c) Find sin2(α)+ cos2(α)
Solution
Using Ө as our reference, we can see that side opposite was given, 6cm. Also, the hypotenuse
was given, 10cm. To determine angle Ɵ, we will use sine to help us, because it uses the side
opposite and hypotenuse to determine angle Ɵ
𝒔𝒊𝒅𝒆 𝒐𝒑𝒑𝒐𝒔𝒊𝒕𝒆 𝒕𝒐 𝒕𝒉𝒆 𝒂𝒏𝒈𝒍𝒆
Sine of an angle = SOH = 𝒉𝒚𝒑𝒐𝒕𝒆𝒏𝒖𝒔𝒆
𝟔
Sine of an angle = = 0.6
𝟏𝟎
𝑆𝑖𝑛−1(0.6) = 36.87⁰
Q.1 If sin θ = 2/3, and the terminal side of the angle lies in the second quadrant, find the
remaining trigonometric ratios of θ
95
Q.2 If sin θ = 3/8, and the terminal side of the angle lies in the second quadrant, find the
remaining trigonometric ratios.
√3
Q.3 If cos θ = − and the terminal side of the angle lies in the third quadrant, find the
2
remaining trigonometric ratios of θ.
Q.4 If tan θ = 3/4, and the terminal side of the angle lies in the third quadrant, find the remaining
trigonometric ratios of θ.
Suggested Answers
96
ANGLE OF ELEVATION
Imagine a course mate is on the 3rd floor of the Faculty block. If you, standing on the ground floor
looks up to find him, there is an angle formed between the ground level and the line of your sight.
Again, Mr. Abaidoo looked upwards at an orange hanging on top of his tree. There is an angle
formed between the horizontal and the line of his sight in order for him to see the top of the tree.
This angle is what is referred to as the Angle of Elevation.
The angle of Elevation of an object, B is from an observer at A, who is below the level of B, is the
angle which AB makes with the horizontal.
For instance
The length of the shadow of the faculty block, 89m tall is 93cm. find the angle of elevation of the
sun.
Solution
𝑨𝑩 𝟖𝟗
TanB= =
𝑨𝑪 𝟗𝟑
B = 𝑡𝑎𝑛−1 (0.957)
= 43.74⁰
Therefore, the angle of elevation is 44⁰
ANGLE OF DEPRESSION
97
In the same but opposite vein, if a course mate is on the 3rd floor of the Faculty block of UEW and
you standing on the ground floor, the course mate looks downwards to find you, there is an angle
formed between the line of his sight and the ground level. Again, if Mr. Baidoo happened to be on
top of the tree and looked downwards at an orange that has fallen on the ground. The angle formed
between the line of his sight and horizontal ground in order for him to see the orange now becomes
the Angle of Depression.
In other words, if you look downwards at an object, the angle formed between the horizontal and
your sight in order to see the object below is called the angle of depression.
The angle of depression of an object Q from an observer at R, who is above the level of Q, is the
angle which RQ makes with the horizontal
EXAMPLE 1
An airplane is 1200m above the ground, its angle of elevation from point P on the ground is 30⁰.
How far is the airplane from P by line of sight?
Solution
𝑨𝑩
= sin30 = 𝑨𝑷
𝟏𝟐𝟎𝟎
= sin30 = 𝑨𝑷
98
𝟏𝟐𝟎𝟎
= 0.5 = 𝑨𝑷
𝟏𝟐𝟎𝟎
= AP = = = 2400m
𝟎.𝟓
EXAMPLE 2
If the shadow of a pole 12m high is 2/3 its length, what is the angle of elevation of the sun, correct
to the nearest degree?
Solution
𝑶𝑵 𝟏𝟐
Tan= =
𝑶𝑴 𝟖
= 𝑡𝑎𝑛−1(1.5)
= 56.31⁰
Therefore, the angle of elevation is 56⁰
EXAMPLE 3
A man standing 40m from a vertical pole observes that the angle of elevation of the top of the pole
is 18⁰. Assuming that his eye is 1.8m above the base of the pole, calculate to the nearest metre, the
height of the pole.
Solution
99
Form the diagram, BD represent the Pole and OA represent the man. Hence,
𝒔𝒊𝒅𝒆 𝒐𝒑𝒑𝒐𝒔𝒊𝒕𝒆 𝒕𝒐 𝒕𝒉𝒆 𝒂𝒏𝒈𝒍𝒆
tan = TOA = 𝒔𝒊𝒅𝒆 𝒂𝒅𝒋𝒂𝒄𝒆𝒏𝒕 𝒕𝒐 𝒕𝒉𝒆 𝒂𝒏𝒈𝒍𝒆
𝑩𝑪 𝒉
Tan18⁰= =
𝑨𝑪 𝟒𝟎
h = 40 × tan 18
h= 40 × 0.3249
h= 12.996m
therefore, the height of the pole BD
= 12.996 + 1.8
=14.796m
The height of the pole =15m
EXAMPLE 4
A ladder leans against a wall. The end of the ladder touches the wall 12m from the ground. The
foot of the ladder is 9m away from the foot of the wall. Find
i. The length of the ladder
ii. The angle that the ladder makes with the ground
Solution
100
Let the length of the ladder be x
Let the angle that the ladder makes with the ground be
i. Applying Pythagoras theorem;
|𝑥|2 = |12|2 + |9|2
|𝑥|2 = 144 + 81
|𝑥|2 = 225
|𝑥| = √225
X= 15
Therefore, the length of the ladder is 15m
ii. From the diagram,
𝒔𝒊𝒅𝒆 𝒐𝒑𝒑𝒐𝒔𝒊𝒕𝒆 𝒕𝒐 𝒕𝒉𝒆 𝒂𝒏𝒈𝒍𝒆
tan = TOA = 𝒔𝒊𝒅𝒆 𝒂𝒅𝒋𝒂𝒄𝒆𝒏𝒕 𝒕𝒐 𝒕𝒉𝒆 𝒂𝒏𝒈𝒍𝒆
𝟏𝟐 𝟒
Tan= = =
𝟗 𝟑
= 𝑡𝑎𝑛−1(1.333)
= 53⁰
Therefore, the angle that the ladder makes with the ground is 53⁰
Example 5
Two observers, P and Q 32m apart observe a kite (K) in the same vertical plane and from the same
side of the kite. The angle of elevation of the kite from P and Q are 35⁰ and 52⁰ respectively. Find
the height of the kite correct to two decimal places.
Solution
101
Let |𝑂𝑄| = x km
From triangle OPK
𝑶𝑲
Tan32⁰= 𝑶𝑷
𝑂𝐾
0.7002=32+𝑋
𝑂𝐾
1.2799 = 𝑋
X= 38.652
OK= 1.2799x
= 1.2799 (38.652)
= 49.471m
Therefore, the height of the kite from the ground is 49.47m
Summary
In sum, we have learnt that;
• When someone looks upwards at an orange hanging on top of his tree. There is an angle
formed between the horizontal and the line of his sight in order for him to see the top of
the tree.
• This angle is what is referred to as the Angle of Elevation.
• When someone at the top of a tree looks downwards at an orange that has fallen on the
ground.
• The angle formed between line of his sight and horizontal ground in order for the person
at the top of the tree to see the orange on the ground now becomes the Angle of
Depression.
103
Chapter 3 SECTION 4: ALGEBRA OF VECTOR AND ITS NOTATION
Hello dear, you are now on unit 3 section 4 of this course. Before we proceed to this section to
explore other aspects of vectors, let’s try to understand what the term” vector” in mathematics
means. What do you think a vector is?
Section objectives
By the end of this section, you will be able to;
Definition of vectors
Otoo and Aboagye (2019) explained vector as” the translation or displacement of a certain
distance in a certain direction.” Two key terminologies are emphasized in this definition. What
are they? Obviously, they are Distance and Direction. This means that a vector describes any
quantity that has magnitude and a direction.
A vector is a line segment. It has a starting point and an ending point. A vector has direction
and length. It may be easiest to think of a vector as a line on a road map. There are usually
several ways to describe the route. Look at the diagram below;
Components of Vectors
Telling someone to get from the beginning to the end, you could say “Go 5 miles at an angle (in
standard position) of 45 degrees.” This direct route is a vector. In this case, 5 is the magnitude, or
length, of the vector, and 45° is the amplitude, or direction, of the vector.
Often, when we travel, we cannot go along the vector. Suppose you want to tell a neighbor how to
go from your apartment to the library. In the diagram above, you may tell them to “Go east 3.5
104
miles then north 3.5 miles.” You are breaking the directions into components. We often break
vectors into two components; the i component is the number of units in the x-axis direction and
the j component is the number of units in the y-axis direction.
A vector (also called a direction vector) is just something that has both magnitude (length, or
size) and direction. It is different from a regular number, since it really has two components to it.
We see vectors represented by arrows, so we can remember that we need to get a length of a vector
(the magnitude), as well as the direction (which way it’s pointing).
We use vectors in mathematics, engineering, and physics, since many times we need to know both
the size (length) of something and which way it’s going. For Example, with an airplane, we can
use a vector to measure the speed of the plane (the “size”) and the direction it’s flying.
Scalar Quantity
One key concept that relate to vector is Scalar. A scalar is a mathematical quantity with magnitude
only (in physics, mass, pressure or speed are good examples).
Let us remember that two-dimensional vectors can be represented in different ways. Among these
ways are; To show that a quantity is a vector, it is printed in bold as m or underline as m. Whenever
the displacement is of x unit in the x-direction and y-unit in the y-direction, the vector is written
as; (𝑦𝑥 )
a. Geometrical Notation
Here, we use an arrow to represent a vector. Its length is its magnitude, and its direction is
indicated by the direction of the arrow.
The vector here can be written OQ (bold print) or → with an arrow above it. Its magnitude (or
𝑂𝑄
length) is written |𝑶𝑸| (absolute value symbols). An alternate notation is the use of two-unit
vectors I (1, 0) and j (0, 1)
105
b. Rectangular Notation (a, b)
A vector may be located in a rectangular coordinate system, as is illustrated here.
The rectangular coordinate notation for this vector is v (6, 3) or → (6, 3). Note the use of angle
𝑉
brackets here.
Amplitude is the angle of the vector. We measure the angle as if the vector was the terminal side
of an angle in standard position.
Forms of vectors
A vector can take one of the following forms;
1. Distance bearing form
Example: (k, )
2. Component form
Example: (𝑦𝑥 )
3. Cartesian form
Example: (xi + yj)
106
Similarly, a vector from the origin, O to the points A and B is written as → and → with components
𝑂𝐴 𝑂𝐵
(𝑦𝑥 ) and (𝑎𝑏) respectively.
→ =→ - →
𝐴𝐵 𝑃𝐵 𝑃𝐴
EXAMPLE 1
If the coordinate of the points P and Q from the origin O are (-3,5) and (7,2) respectively, find →
𝑃𝑄
Solution
The vector PQ should be calculated as the distance from the origin (O) hence;
→ = → - →
𝑃𝑄 𝑂𝑄 𝑂𝑃
→ = (72) - (−35)
𝑃𝑄
= (7+3
2−5
)
10
= ( −3 )
EXAMPLE 2
Solution
→ =→ - →
𝑅𝑄 𝑂𝑄 𝑂𝑅
→ =→ - → ( since P is common)
𝑅𝑄 𝑃𝑄 𝑃𝑅
= (−68) - (−2
4
)
= (−6+2
8−4
)
= (−44)
(iii) → (𝑖𝑣) →
𝐶𝐴 𝐶𝐵
(𝑉) →
𝐵𝐴
Suggested answers
1. i. Vector: northwest (direction) and 60 miles (distance)
ii Scalar: only distance (110 miles)
iii Vector: straight down (direction) and 12 miles(distance)
Summary
In this section, we have learnt that;
108
• Also, a vector from the origin, O to the points A and B is written as → and → with
𝑂𝐴 𝑂𝐵
components (𝑦𝑥 ) and (𝑎𝑏) respectively.
• When the origin of the point A and B is say P instead of O, then → =→ - →
𝐴𝐵 𝑃𝐵 𝑃𝐴
109
Chapter 4 SECTION 5: VECTOR OPERATIONS
My dear students, you have done very well so far and I hope you can recall all that we have learnt
from section 4 of this unit. This is the fifth section of unit 3 and we want to explore the various
operations with vectors. We shall look at addition, subtraction and scalar multiplications involving
vectors.
Section objectives
• Addition
Addition of vectors can be expressed by a diagram. Placing the vectors end to end, the vector from
the start of the first vector to the end of the second vector is the sum of the vectors. One way to
think of this is that we start at the beginning of the first vector, travel along that vector to its end,
and then travel from the start of the second vector to its end. An arrow constructed between the
starting and ending points defines a new vector, which is the sum of the original vectors.
Algebraically, this is equivalent to adding corresponding terms of the two vectors:
Addition of vectors is also called resultant of vectors. Resultant is a single vector that gives the
total effect of number of vectors. Resultant can be found by using
a) Triangle law of vectors
110
b) Parallelogram law of vectors
c) Polygon law of vectors
Two vectors can be added either by triangle law or parallelogram law of vectors. Addition
follows the parallelogram construction. Subtraction (a- b) is defined as the addition (a + (-b)). It
is useful to remember that the vector a - b goes from b to a. The following results follow
immediately from the above definition of vector addition:
(a) a + b = b + a (commutativity)
(b) (a + b) + c = a + (b + c) = a + b + c (associativity)
(c) a + 0 = 0 + a = a, where the zero vector is 0 = [0, 0, 0].
(d) a + (-a) = 0
• Subtraction
Subtraction of vectors can be shown in a diagram form by placing the starting points of the two
vectors together, and then constructing an arrow from the head of the second vector in the
subtraction to the head of the first vector. The result of vector subtraction is called the difference
of the two vectors. Algebraically, we subtract corresponding terms:
NB: Vectors are added or subtracted by adding or subtracting their corresponding components
i. a+b
ii. a-c
iii. b-c
iv. a–c+b
Suggested Answers
i. a + b = (−7
0
) + (55)
= ( −7 + 5
0+ 5
)
−2
= ( 5)
ii. a – c = (−7
0
8
) - (−2 )
−7 − 8
= ( 0 + 2)
= (−152
)
• Scalar-Vector Multiplication
Another operation is scalar multiplication or scalar-vector multiplication, in which a vector is
multiplied by a scalar (i.e., number), which is done by multiplying every element of the vector by
the scalar. Scalar multiplication is denoted by juxtaposition, typically with the scalar on the left,
as in
Where
112
→ = (𝑦𝑥 ) then k → = (𝑘𝑦 𝑘𝑥
) where k is a scalar or number which can be negative or positive
𝐴𝐵 𝐴𝐵
whole number or fraction. When k is positive, it implies the vectors are parallel and in the same
direction. When k is negative, it means the vectors are parallel but in opposite directions. The
length of the new vector is k times the length of the original vector.
NB: To find the scalar multiplication of a vector, multiply each component of the vector by the
scalar.
Example 1
Solution
2 → = (2×(−1)
2 ×5
) =( −2
10
)
𝐴𝐵
and
3 → = (3×(−1)
3 ×5
) =( −3
15
)
𝐴𝐵
3 −2
2. If 𝒓 = ( 10
) and s = ( 5
), calculate 6(r + 2s)
Suggested answers
8 7 14
1. i. (−10) ii. ( 5
) iii. ( −8 )
−6
2. (120 )
Summary
In this section, we learnt that;
113
• Vector addition is associative when (a + b) + c = a + (b + c). We can therefore write
both as a + b + c.
• Adding the zero vector to a vector has no effect, a + 0 = 0 + a = a.
• Subtracting a vector from itself yields the zero vector, a - a = 0.
• To find the scalar multiplication of a vector, multiply each component of the vector by the
scalar.
114
Chapter 5 SECTION 6: MAGNITUDE AND DIRECTION OF A VECTOR
Dear student, we welcome you to the last section of unit 3, which is section 6. The section will
consist of two parts. The first part of this section will look at magnitude of vectors while the last
part will talk about its direction. We encourage you to use your prior learning of coordinate
geometry of determining the magnitude of a line to help you understand the magnitude of a vector
better. Do well to also link and apply our previous lesson on trigonometric ratio to the subsequent
lesson on direction of a vector.
In this first part of the section, we seek for you to gain a deeper understanding of magnitude as a
component of vector. Do you remember what we learnt earlier about the component of vectors?
Obviously, you recall that a vector quantity has magnitude and direction. Displacement, velocity,
momentum, force, and acceleration are all vector quantities. For Example, in Greek mathematics
textbooks, a vector is determined by three components: magnitude, path, sense, whereas, in Greek
physics textbooks, a vector quantity is determined by two components: magnitude and direction
(including the concepts of path and sense)
Section objectives
By the end of this section, you will be able to;
Magnitudes of Vectors
The length of a vector is its magnitude. The symbol for magnitude of a vector can be written → or
𝑉
like an absolute value, |𝑣| , or just written “magnitude”. If you are given the components of a
vector, use the distance formula to find the magnitude.
To calculate the magnitude (length) of a vector, we may use some plane geometry. First, sketch
your vector on the coordinate plane. Because the angle between the x- and y-axes is 90º, it should
be straightforward to draw a right triangle with the arrow as hypotenuse and sides parallel to the
axes. Then the length of the hypotenuse may be calculated from the lengths of the sides using the
Pythagorean theorem 𝑐 2 = √𝑎2 + 𝑏 2 as in the Example below.
115
Example
What is the magnitude of vector A drawn in the diagram below?
Solution
If you zoom in on the arrow, the blown-up picture looks like: where the vector arrow is the
hypotenuse of a right triangle that has a horizontal side with length of 4 units and vertical side of
length 1 unit.
Plugging this into the Pythagorean theorem:
𝑐 2 = √𝑎2 + 𝑏 2
√42 + 12
√16 + 1
√17
If you wish, you could also plug the coordinates into the distance formula,
√(𝒙𝟐 − 𝒙𝟏 )2 + (𝒚𝟐 − 𝒚𝟏 )2
As this formula is derived by putting generic coordinate points into the Pythagorean theorem, the
calculation is essentially identical to the geometric method used in the Example. In the Example
above, if you plug the points ( 𝒙𝟏 , 𝒚𝟏 ) = (5,1) and ( 𝒙𝟐 , 𝒚𝟐 ) = (1, 2) into the distance formula, you
will calculate the same result as √17
Since the magnitude of a vector is like a length measurement, the magnitude is always positive.
116
Example 1
if → =(−68), find |→ |
𝑃𝑄 𝑃𝑄
Solution
|→ | is the length of →
𝑃𝑄 𝑃𝑄
|→ | = √𝑥 2 + 𝑦 2
𝑃𝑄
= √(−6)2 + 82
= √36 + 64
= √100
= 10 unit
If A (𝑥1, 𝑦1, ) and 𝐵(𝑥2, 𝑦2, ) are two given points, then the vector joining A and B is given by
→ =→ - → = a-b
𝐴𝐵 𝑂𝐵 𝑂𝐴
Also,
→ =d–c
𝐶𝐷
NB: Note the order of the subtraction thus, the vector → = position vector of B – position
𝐴𝐵
vector of A
Check Your Progress
Example 1
117
Example 2
A triangle ABC has vertices A (-2, -4), B (10,1) and C (3,8). Find the length of the sides AB and
AC. Show that the triangle is isosceles.
Example 3
A triangle XYZ has vertices X (5,5), Y (6,2) and Z (4,8). Find whether triangle XYZ is an
equilateral triangle.
Suggested Answers
1. → = → - → = =(−1
5
) - (48)= (−5
−3
)
𝐴𝐵 𝑂𝐵 𝑂𝐴
→ = → - → = =(48) - (−1
5
) = (53)
𝐵𝐴 𝑂𝐴 𝑂𝐵
Therefore, since the two sides of the triangle are of the same length=13units, the triangle is
isosceles because two of its sides are equal.
3. For XYZ to be an equilateral triangle, each of the length of its sides should be of the same
length. Find the length of AB, BC and AC to be sure they are of same length.
Summary
In a nut shell, we have learnt that;
• If A (𝑥1, 𝑦1, ) and 𝐵(𝑥2, 𝑦2, ) are two given points, then the vector joining A and B is given
by → = → - → = a – b
𝐴𝐵 𝑂𝐵 𝑂𝐴
• The magnitude of a vector is its length.
• The symbol for magnitude of a vector can be written → or like an absolute value, |𝑣| , or
𝑉
just written “magnitude”
• We use the distance formula to find the magnitude of a vector
• To find the length of the hypotenuse, you calculate from the lengths of the sides using the
Pythagorean theorem 𝑐 2 = √𝑎2 + 𝑏 2
• Since the magnitude of a vector is like a length measurement, the magnitude is always
positive.
118
DIRECTION OF A VECTOR (BEARINGS)
Dear student, we welcome you once more to the second part of section 6 of unit 3. In this part of
the section, we aim at helping you to gain a deeper understanding of another major component of
vectors. Do you remember what we learnt earlier about the component of vectors? Obviously, you
recall that a vector quantity has magnitude and direction. In the first part of this unit, we learn a
lot about the magnitude of a vector. We now want to throw a spotlight on the second component
of a vector; Direction. Imagine a motorist riding from left of a point P to another point Q of 300m
on an angle of 120⁰ observed a beer bar (R)150m away on angle of 050⁰. How far west do you
think is P from the beer bar? How far north is also the point R from Q.
The ideas from these situations in our introduction bring to mind the idea of bearings. What then
is a bearing? Let us look at it. A bearing is an angle measurement made with reference to the north
and measured in clockwise direction usually expressed as a three-digit number of figures. This
means that its measurement starts from the north pole
A bearing gives direction in terms of an angle.
The four cardinal directions are
i. North (000⁰)
ii. South (180⁰)
iii. East (090⁰)
iv. West (270⁰)
The directions NE, NW, SW, SE are frequently used for compass bearings.
119
Kinds of bearing
There are two kinds of bearing
i. Compass bearings
ii. Three-figure bearings
A. Compass Bearing
Compass bearings are based on the four main cardinal points; North, South, East and West. It is
an angle measured from the north or south. The directions half-way between the cardinals are
marked in the figure below;
For Example
N 50⁰ E means from N measure 50⁰ towards E where N and E are North and East respectively.
S 40⁰ W from S measure 40⁰ towards W.
N 20⁰ E is pronounced as 20⁰ north east.
The compass bearing of N 32⁰ E means an angle of 32⁰ measured from the north towards east and
the 3-digit bearing is (90⁰ - 32⁰) = 58⁰
Again, a compass bearing of N50⁰W means an angle of 50⁰ measured from the north towards west
and the 3-digit bearing is (360⁰ - 50⁰) = 310⁰
120
Also, a compass bearing of S 15⁰ E means an angle of 15⁰ measured from the south towards east
and the 3-digit bearing is (180⁰ - 15⁰) = 165⁰
Figure 1
Trial Question
Explain the compass bearing of S75⁰W. Find the 3-digit bearing.
B. Three-Figure Bearings
When we talk of bearings nowadays, we mean “three-figure bearings”. A three-figure bearing is
always:
121
Bearing of 065⁰ Bearing of 235⁰
Types of Bearing
Back Bearing
Consider heading out to some place and have to return along the same line of travel. This is where
the idea of back bearing becomes so useful.
Back bearing as the name indicates, is the reverse of a given bearing. For Example, the back
bearing of the bearing P from Q is the bearing Q from P. In general, if the bearing of P from Q is
, then the bearing of Q from P is
122
2. When the direction of travel bearing is Greater than 180⁰. If for Example, your bearing
is 225⁰, which is greater than 180⁰, then your back bearing works out to be 45⁰
Back bearing = (Direction of travel bearing- 180⁰)
BB = B - 180⁰
BB = 225⁰ - 180⁰
= 45⁰
Note
Example 1
The bearing of Y from X is 320⁰. Find the bearing of X from Y.
123
Solution
The bearing of X from Y = 320⁰ - 180⁰ = 140⁰
Example 2
The bearing of a point B from a point C is 062⁰. What is the bearing of C from B?
Solution
The bearing of C from B = 62⁰ + 180⁰ = 242⁰
Example 3
A ship is on a bearing of S65⁰W from a harbor. What is the bearing of the harbor from the ship?
Solution
The back bearing in cardinal S65⁰W is the same as N65⁰E
So, the back bearing of the harbor from the ship is N65⁰E
Example 4
If P is equidistant from Q and R. The bearing of Q from P is 80⁰ and the bearing of R from P is
130⁰. What is the bearing of R from Q?
Solution
From the question |𝑃𝑄|=|𝑃𝑅|since P is equidistant from Q and R. The bearing of R from Q is
measured from the north pole of Q in the clockwise direction to the line RQ
∠𝑃𝑄𝑅 = 65⁰
The bearing of R from Q = 270⁰ - (10⁰+65⁰)
= 195⁰
Example 5
Solution
= (15km, 240⁰)
→ = → - →
𝐶𝐷 𝑂𝐷 𝑂𝐶
= (75) - (53)
= (22)
125
= |→ | = √(2)2 + (2)2
𝐶𝐷
= √4 + 4
= √8
= 2√2
𝑜𝑝𝑝𝑜𝑠𝑖𝑡𝑒 2
tan 𝜃 = 𝑎𝑑𝑗𝑎𝑐𝑒𝑛𝑡 = = 𝑡𝑎𝑛−1 (1) = 45⁰
2
2. Three towns P, Q and R are such that P is 3km on a bearing of 090⁰ from Q. R is 9km due
south of P. find the bearing of Q from R.
3. A jet fighter flew from its base to the enemy’s camp, a distance of 510km on a bearing of
060⁰. It was then forced to fly a distance of 450km on a bearing of 150⁰ to a fuel station
to refill. Calculate
i. The distance between the fuel station and the base (2 decimal place)
ii. The bearing of the base from the fuel station to the nearest whole number.
5. Three towns P, Q and R in the same plane are such that → = (32km, 222⁰) and → =
𝑃𝑄 𝑄𝑅
(56km, 312⁰). Calculate
a) |→ |
𝑃𝑅
b) The bearing of Q from R
c) A town M is on → such that |→ | : |→ | = 2:3, find |𝑀𝑄|
𝑃𝑅 𝑃𝑀 𝑀𝑅
126
6. A (-3,5) and B (-1,2) are two points. Express AB in the form (k, 𝛼) where k is the
magnitude and 𝛼 is the bearing)
d) The points M, N and R are three villages in the same plane of a district. |→ |= (ykm,
𝑀𝑁
042⁰), |→ |= (= (22km, 315⁰) and |→ |= 32km.
𝑅𝑁 𝑀𝑅
a. Find y
b. T is a point on |→ | such that
𝑁𝑅
∠𝑀𝑇𝑁 = 30⁰ find |→ |
𝑀𝑇
Suggested Answers
i. From ∆𝐵𝐶𝑆
= √462600
= 680.147
The distance between the base and the fuel station is 680.147km
510
𝛽 = 𝑡𝑎𝑛−1 (450)
= 48.58⁰
Bearing of the base from the fuel station
= 360⁰ - (48.58⁰ + 30⁰)
= 360⁰ - 78.58⁰
= 281.42⁰ = 281⁰
Summary
Recapping our lesson in this section, we learnt that;
128
UNIT 4
MENSURATION
Introduction
Dear student, you are heartily commended for journeying with us through the beginning units to
the unit 4 of this course. It is our hope that you will be able to synchronize all that you have learnt
in the first three units to help you gain conceptual grounds in this unit. In this fourth unit, you will
be introduced to mensuration and it is a promise that you will come to love this unit also. The unit
reintroduces the fundamentals of geometry; the three dimensions, and the origins of length, area
and volume calculations. Concepts such as measurement of length (arc length, radius, diameter,
chord), teaching area of a sector, area of segments volume of cone, cylinder as well as application
of mensuration to real life problems will be brought up. It will finally reinforce the mathematical
vocabulary related to the concept of mensuration.
Unit Content
This unit is about:
Section 1: Concept of Mensuration
Section 2: Parts of a Circle
Section 3: Measurement of Length (arc length, radius, diameter and a chord)
Section 4: Area of a Sector and Segment
Section 5: Volume of Cone and Cylinder
Section 6: Mathematical Vocabularies in Mensuration.
129
Chapter 6 SECTION 1: MENSURATION
Introduction
Dear student, welcome to the first section of unit 4 of this course. In this unit, we will take a walk
to the world of mensuration to explore what the term is and the major component of mensuration.
Section objectives
By the end of this section, you will be able to;
Mensuration
Dear student, you must have noted that since the beginning of civilization, man has explored the
question of "how much"? Man started measuring land, water or milk etc. This was the origin of
mensuration which later became the basis of architecture and engineering. Mensuration is a branch
of mathematics which deals with lengths of lines, area of surfaces and volume of solids. A student
studying mensuration should be conversant with metric system of units and their conversion. The
concepts of area and volume should be known to the student. This brings us to another important
unit of the course. In this unit, we shall discuss about mensuration, its teaching, learning and
application. You have studied about perimeters and areas of plane figures like rectangles, squares,
triangles, trapeziums, circles, kites etc. These are called plane figures because each of them lies in
a plane. However, most of the objects that we come across in daily life do not lie in a plane. Some
of these objects are bricks, balls, ice cream cones, drums, and so on. These are called solid objects
or three-dimensional objects. The figures representing these solids are called three dimensional or
solid figures. Some common solid figures are cuboids, cubes, cylinders, cones and spheres. Many
students seem to get confused by whether they should use m2, cm3 or whatever. They also
sometimes confuse about the formulae for volume/area/length. This discussion is designed to
review and reinforce the difference between these concepts.
Mensuration is the aspect of geometry that emphasize measurement of length of lines, perimeter,
area, surface area and volume of both solid and plane shapes. The idea of geometry is useful to us
as student so we can apply this in our real life of today.
Mensuration is a therefore a branch of mathematics which deals with the lengths of lines, areas of
surfaces and volumes of solids.
Types of Mensuration
130
The two basic types of mensuration are:
1. Plane Mensuration
It deals with the sides, perimeters and areas of plane figures of different shapes.
2. Solid Mensuration
It deals with the areas and volumes of solid objects
Suggested Answers
1. Mensuration is a branch of mathematics which deals with the lengths of lines, areas of
surfaces and volumes of solids. (Search online for other definitions)
2. We think solid mensuration should be fundamental since that is what is real to learners and
is used in their daily activities
3. lines, length, areas, surfaces, volumes, cones, circles, triangles, sphere, cubes, cylinder,
cuboid etc.
Summary
We have studied in this section that;
131
Chapter 7 SECTION 2: A CIRCLE
Introduction
Dear students, you are now in section 2 of this unit. Your background knowledge of circles will
be very helpful in this section. In this section, we shall talk extensively about circles and learn
some basic terminologies that are associated with circle in mensuration. Stay with us as we go
through these concepts together because of its applicability in the learning of the subsequent
sections of this unit.
Section objectives
• Identify the major parts of any circle using the right terminology
• Apply circle terminologies in the next section to solve mensuration problems
A CIRCLE is the locus of a set of points which are equidistant (equal distance) from a fixed point.
The circular part of a circle is called the CIRCUMFERENCE. Thus, it is the perimeter of the
circle. See fig A
Any part of the circumference of a circle is called an ARC. Thus, the distance along the
circumference is the arc. See fig
The fixed point is known as the CENTRE. In the figure above, the centre is the dot and usually
termed as the origin (o)
132
The distance from the centre to any part or point of the circumference is called the RADIUS. Thus,
a line joining the centre of a circle to any of the points on the circle is known as the radius. See fig
A and B
A line that divides the circle into two equal parts is called a DIAMETER. A diameter is formed
from two radius. Each of the radius is represented below as (r)
A straight line joining two points on the circumference is called a CHORD. Thus, the chord AB
is the Diameter. The line CD forms a chord which is not a diameter. Can you explain why?
A chord divides the circle into two parts called the SEGMENT. A segment is an area bounded by
a chord and intercepted by an arc.
133
Image source: https://www.mathlearnit.com/perimeter-of-segment.html
The area bounded by two radii of the circle is called a SECTOR of the circle. The unshaded region
in figure above is a sector of the circle. See the two figure below;
The shaded part of the figure below is called a minor sector of the circle. See fig below;
The shaded part in the circle below is called the major sector of the circle.
134
When a circle is divided into four equal parts, one quarter of the area of the circle is called a
QUADRANT. The shaded part of the figure below forms one of the four quadrant of a circle.
A SEMI-CIRCLE is half the area of the circle. Thus, when the circle is divided into two equal
parts, each part becomes a semi-circle. See fig G
The names of the principal parts of a circle are shown below;
Suggested answer
1. Search online for other definitions of the terms.
2. Draw to show each term. Be accurate in your drawing
Summary
In summary, we have learnt that a circle has many key parts that are used in mensuration.
Among these key parts of a circle are the Circumference, Arc, Origin of a circle, Radius,
Diameter, Chord, Segment, Minor Segment, Major Segment, Sector, Quadrant and a Semi-circle.
It is our hope that you will always remember these concepts to be able to apply them in our next
section.
136
Chapter 8 SECTION 3: MEASUREMENT OF LENGTH
Introduction
Congratulation, my dear student for painstakingly going through the first two section of the unit 4
with us. Welcome to section 3 of unit 4 and in this section, we shall explore various concepts of
length in mensuration. These concepts include learning about the length of arc, perimeter,
circumference, and sector. It is our hope that your prior knowledge of circle theorem and parts of
a circle will be used to aid the effective learning of measurement of length in this section.
Section objectives
In the figure, the length of arc AB of the sector AOB is l. if 𝜃 is the angle of the sector ad r is the
radius of the circle from which the sector is, then the length of the arc of the sector will be the
product of the ratio of the angle subtended by the arc to the total angle of the circle and the
circumference of the circle.
Hence, the length of arc is given by
𝜃
× circumference of a circle
360
137
𝜃
× 2𝜋𝑟
360
Example 1
Find the length of an arc that subtend an angle of 80⁰ at the centre of a circle with radius 7m
Solution
𝜃
the length of arc =360 × 2𝜋𝑟
= 9.78m
Hence, the length of the arc of that circle is 9.78m
Suggested Answers
1. 45.5
2. 1400
138
Chapter 10 B. Perimeter / Circumference of A Circle
The circumference of a circle is the perimeter of that circle. Thus, it is the total distance around
the circle. The circumference of a circle with radius (r) is given by
C=2𝜋𝑟
22
Where 𝜋 = 3.142 𝑜𝑟 and r = radius
7
= C= 𝜋d where d=diameter or 2r
In conclusion, the circumference of a circle with radius (r) or diameter(d) is given by;
C = 2𝝅𝒓 or
C= 𝝅𝒅
Example 1
A circle has radius 7cm. Find its circumference.
Solution
C = 2𝜋𝑟
22
Where 𝜋 = 3.142 𝑜𝑟 and r= radius
7
C= 2 × 3.142 × 7
= 14 × 3.142
= 43.988cm
Example 2
Find the perimeter of a circle whose radius is 3.2m.
Solution
Perimeter of that circle is the circumference of a circle
139
C = 2𝜋𝑟
22
Where 𝜋 = 3.142 𝑜𝑟 and r= radius
7
C= 2 × 3.142 × 3.2
= 6.4 × 3.142
= 20.1m
Suggested answer
Circumference = 2𝜋𝑟
22
Where 𝜋 = 3.142 𝑜𝑟 and r= radius
7
C= 2 × 3.142 × 8
= 16× 3.142
= 50.272 cm
The perimeter of a sector is the sum of all the length of the boundaries of the sector. The
perimeter of a sector whose length is L in a circle with a radius r is given by
P= length of the arc + 2 of its radius
P= L + 2r
Where P= perimeter, L=length of arc and r=radius
EXAMPLE
If the radius of a sector is 21cm and its sector angle is 65⁰. Calculate the
a) Length of its arc
b) Perimeter of the sector
140
SOLUTION
a) The length of arc is given by
65
L=360 × 2𝜋𝑟
65
L=360 × 2(3.142 ×21)
65
L= 360 × 2(65.982)
13
=72 × 2(65.982)
L= 23.8cm
Suggested answers
1. 63cm
2. 1800
141
Summary
In brief, we have learnt from the section that;
142
Chapter 12 SECTION 4: MEASUREMENT OF AREA
Introduction
Dear students, we are going to study about how to find and calculate areas of various shapes in
this section. We must always remember that the unit for measuring area is a square. Examples of
area are 𝒄𝒎𝟐 , 𝒎𝟐 , etc
Section objectives
By the end of this section, you will be able to calculate for the;
• Area of a circle
• Area of a given sector
• Area of a given segment
A= 𝝅𝒓𝟐
Or
𝟏
A= 𝟒 𝝅𝒅𝟐 where 𝝅 = 3.142 𝑎𝑛𝑑 𝒅 = 𝑑𝑖𝑎𝑚𝑒𝑡𝑒𝑟 (d=2r)
EXAMPLE 1
22
Find the area of a circle with radius 7cm. (Take 𝜋 = )
7
Solution
22
A= 𝜋𝑟 2 where r-=7 and 𝜋 = 7
22
A= × 72
7
A= 22 × 7
= 154𝑐𝑚2
143
EXAMPLE 2
Substituting
22 2
154 = 𝑟
7
154 × 7 = 22𝑟 2
1078
𝑟2 = 22
r= √49
r= 7cm
C= 2 × 3.142 × 7
= 14 × 3.142
= 43.988cm = 44cm
Suggested Answers
144
Chapter 14 Area of A Sector
𝜃
A= × 𝜋𝑟 2
360
EXAMPLE
Find the area of a sector that subtends an angle of 40⁰ at the centre of circle with radius 8cm.
22
(Take 𝜋 = 7 )
Solution
𝜃
A= × 𝜋𝑟 2
360
22
Where 𝜃 = 40⁰, 𝜋 = and r = 8
7
40 22
= × × 82
360 7
1
= 9 × 201.143
=22.35𝑐𝑚2
Example 2
Find the area of the sector that has radius 7.5 with an angle of 150⁰.
Solution
The area of a sector is given by
𝜃
A= × 𝜋𝑟 2
360
22
Where 𝜃 = 150⁰, 𝜋 = and r = 7.5
7
150 22
= × × 7.52
360 7
5
= 12 × 176.786
=73.66𝑐𝑚2
145
Check your progress
1. Find the area of sector OAB given that angle AOB is 60 and the radius is 7cm.
2. The area of the circle is 48𝑐𝑚2 . Find the area of its sector.
The segment is the area formed by a chord and an arc. Consider the diagram below;
The shaded portion is the minor segment while the other portion is the major segment
The area of the minor segment is given by
Area of a sector POQ- Area of the triangle OPQ
𝜃 1
= 360 × 𝜋𝑟 2 - 𝑟 2 𝑠𝑖𝑛𝜃
2
Example 1
In the diagram below, MN is a chord of a circle of radius 5cm. the chord subtends an angle of
96⁰ at the centre. Find the area of the minor segment cut off by MN.
146
Solution
𝜃 1
Area of a segment = × 𝜋𝑟 2 - 𝑟 2 𝑠𝑖𝑛𝜃
360 2
96 22 1
= 360 × × 52 - 52 𝑠𝑖𝑛96
7 2
= 20.95 – 12.43
=8.52𝑐𝑚2
Example 2
22
Calculate the area of the shaded segment of the sector in the diagram below. (Take 𝜋 = )
7
147
Solution
𝜃 1
Area of a segment = 360 × 𝜋𝑟 2 - 𝑟 2 𝑠𝑖𝑛𝜃
2
75 22 1
= 360 × × 122 - 122 𝑠𝑖𝑛75
7 2
= 94.29 – 69.55
=24.74c𝑚2
2. Six horse are tethered with ropes measuring 14m each to the four corners of a rectangular
grass land 21m × 24m in dimension. Find;
a) The maximum area that can be grazed by the horses
b) The area of the grass that remains ungraced.
Suggested Answers
3.. i. Length of its arc=15.71cm
hence perimeter of quadrant = l + 2r = 15.71 + 20= 35.71 cm
148
𝜃
• The Area of a sector of a circle with centre angle 𝜃 is given by 𝐴 = × Area of a
360
𝜃
circle and this is the same as A= × 𝜋𝑟 2
360
• The segment is the area formed by a chord and an arc.
𝜃
• The area of a segment is calculated as Area of a sector - Area of the triangle = 360 × 𝜋𝑟 2
1
- 𝑟 2 𝑠𝑖𝑛𝜃
2
149
Chapter 16 SECTION 5: MEASUREMENT OF VOLUME
Introduction
Dear students, we are going to study about how to find and calculate the volume, surface areas
and total surface area of various shapes in this section. We must always remember that the unit
for measuring volume is a cube. Examples of the unit for measuring volume are 𝒄𝒎𝟑 , 𝒎𝟑 , etc
Section objectives
By the end of this section, you will be able to calculate for the;
Chapter 17 Cylinder
A cylinder is a plane figure that has a curved surface and two parallel and congruent bases. This
means that a cylinder has three surfaces; a top, bottom and middle. Most cylinders have circular
base which means the top and bottom are circles. A cylinder can also be viewed as a prism with a
circular cross-section. An Example of a cylindrical objects are milk or milo tin.
150
• Volume
To find the volume is to find the amount of space within the cylinder. First, find the area of the
two circular shapes at the top and bottom and multiply it by the height of the cylinder. The two
circular bases are circular hence their area is 𝜋𝑟 2 = 𝜋𝑟 2
= 𝜋𝑟 2 ℎ
=𝑐𝑖𝑟𝑐𝑢𝑚𝑓𝑒𝑟𝑒𝑛𝑐𝑒 × ℎ𝑒𝑖𝑔ℎ𝑡
=2𝜋𝑟 ×ℎ
=2𝜋𝑟ℎ
=2𝜋𝑟2+2𝜋𝑟ℎ
151
Example 1
A water tank with one end open has diameter 140mm and height 50mm. calculate
i. Area of the base
ii. Curved surface area
iii. Total surface area
iv. Volume of the tank.
Solution
1 1
Given the diameter of 140mm, radius=- of the diameter= - 2×140 = 70mm
2
=770,000mm3
152
Example 2
A closed cylinder has radius 7cm and height 15cm. find
a) The area of the curved surface
b) The area of its base
c) The total surface area
d) The volume of the cylinder
Solution
Diagram
=659.82𝑐𝑚2
=967.736𝑐𝑚2
3. Kofi Asare has a cylindrical tank of volume 1100𝑚3 . If the diameter of the base is 10m,
22
determine its height. (Take 𝜋 = 7 )
4. Mr. Asare half-filled his cylindrical bucket with water. If the diameter of the bucket is
22
14cm with height 20cm, calculate the volume of the water in that bucket. (Take 𝜋 = 7 )
Suggested answers
1. 188.57cm3
2. i. 15,400 𝑚𝑚2 ii. 22,000m𝑚2 iii. 37,400𝑚𝑚2 iv. 770, 000𝑐𝑚3
3. Height is 7 cm
4. 3080 𝑚2
Chapter 18 Cone
A cone is a type of geometric shape. There are different kinds of cone. They all have flat surfaces
on one side that tapers to a point on the other side. In order to calculate the surface area and
volume of the cone, we first need to know these;
(A) RADIUS (r) is the distance from the center of the circle to the edge of the circle at the
end
(B) HEIGHT(h) is the distance from the center of the circle to the tip of the cone
(C) SLANT HEIGHT (L) is the length from the edge of the circle to the tip of the cone.
22
(D) PI (π) is a special number approximately equal to 3.142 or 7
• Volume
Volume is how much space takes up the inside of cone. The answer is always in cubic unit.
Since the base of the cone is a circle, it has a radius(r). When a line is drawn from the apex of the
cone, it forms a height (h). The circular base of the circle has an area = 𝜋𝑟 2
1
= 3 of 𝜋𝑟 2 𝑜𝑓 ℎ
1
= 3𝜋𝑟 2 ℎ
Example 1
The slant height and the height of a cone is 40 cm. If the radius of the cone is 32cm. Calculate
a) The volume of the cone
155
b) The curved surface area
c) The total surface area
Solution
1
a) Volume of a cone = 3𝜋𝑟 2 ℎ
1
=3 × 3.142 × 322 × 40
= 42898.77𝑐𝑚3
=7239.168𝑐𝑚2
OR
Example 2
A conical tent is of the diameter 24 m at the base and its height is 16.m.
a) Find the slant height
b) The canvas required in square meters
c) At most how many persons can the tent accommodate if each person required 54 m3 of air?
22
(Take 𝜋 = 7 )
Solution
Ask 1: What data is given in the problem?
The diameter of the cone = 24 m
156
The height of the cone = 16 m
The volume of air required by each person = 54 m3
A) Slant height. I= √𝑟 2 + ℎ2
=√122 + 162
= √144 + 256
= √400
=20m
= 44.7
157
=45 persons
Check Your Progress
1. A cone is 14cm deep and the base radius is 4.5cm. calculate the volume of water that is
22
exactly half the volume of the cone. (Take 𝜋 = 7 )
2. The height of a right circular cone is 4cm. the radius of its base is 3cm. find its curved
22
surface area. (Take 𝜋 = 7 )
3. A sector containing an angle of 120⁰ is cut from a circle of radius 21cm and folded into a
cone. Find
a) Curved surface area of the cone
b) Total surface area of the cone
4. Let O and C be the center of the base and the vertex of a right circular cone. Let B be any
point on the circumference of the base. If the radius of the cone is 6cm and if angle OBC
=60, find
a) The height of the cone
b) The curved surface area of the cone
5. Find the total surface area of a cone of base radius 4cm and with slant height 9cm.
Suggested Answers
1. 148.5 cm3
2. 47.14cm2
5. Total surface area = 𝜋𝑟 2 + 𝜋𝑟𝑙
= 3.142 × 42 + (3.142×4×9)
= 50.272 + 113.112
=163.384𝑐𝑚2
Cube
A cube is a three-dimensional figure with six faces. All its angles are right –angled. All the faces
of a cube are equal. It is a shape covered by six squares. In a cube, all the lengths are equal since
it is formed from a square.
158
Area
To find the area of the face of a cube, multiply the length by the other length= 𝑙𝑒𝑛𝑔ℎ𝑡 × 𝑙𝑒𝑛𝑔𝑡ℎ
=𝑙 ×𝑙
=𝐿2=L2
Volume
The cube is a special case of a cuboid, where l = b = h. Hence, volume of cube = L× L × L =𝐿3
=6𝐿2
Example 1
If the volume of a cube is 2197, find;
a) length of one of its sides
b) surface area.
Solution
i. Let the edge (length) of the cube be x cm.
159
So, its volume = 𝑎 𝑐𝑚3
Therefore, from the question, we have
a 𝑐𝑚3 = 2197
3
a 𝑐𝑚3 = 13×13×13 or √2197
So, a = 13
Therefore, the length of the cube = 13 cm
= 1014 𝑐𝑚2
Suggested Answers
1. a) 4 × 4 × 4 = 64𝑐𝑚3
b) 1.5 × 1.5 × 1.5 = 3.375𝑐𝑚3
2. 125cm3
3. 2 cm
4. 96cm3
160
Chapter 19 Cuboid
A cuboid is also a three-dimensional object with six faces. All its angles are right-angled and
opposite faces are equal. A cuboid is the same as a rectangle prism or a rectangular solid. In a
cuboid, the length, width and height are different.
• Volume
To find the volume of a cuboid, multiply the dimensions; length, width and height.
V =𝑙𝑤ℎ
• Surface Area
To calculate the surface area of the cuboid, we first calculate the area of each of the six faces and
add up all the area to get the total surface area.
First, find the area of the rectangle at the top and bottom surface. the top and bottom have only
length and breadth, hence its area is calculated by multiplying the length by the breadth =
𝑙 × 𝑏 = 𝑙𝑏
TOP BOTTOM
𝑙𝑏 + 𝑙𝑏 = 2𝑙𝑏
Secondly, find the area of the front and back surface. The front and back has only length and
height = 𝑙 × ℎ = 𝑙ℎ
FRONT BACK
𝑙ℎ + 𝑙ℎ = 2𝑙ℎ
Thirdly, find the area of the two other side surfaces. The two-side surface has only breadth and
height = 𝑏 × ℎ = 𝑏ℎ
161
𝑆𝐼𝐷𝐸 𝑆𝐼𝐷𝐸
𝑏ℎ + 𝑏ℎ = 2𝑏ℎ
=2𝑙𝑏+2𝑙ℎ+2𝑏ℎ
Example 1
Find the height of a cuboid whose volume is 275𝑐𝑚3 and base area is 25𝑐𝑚2 .
Solution
Volume of a cuboid = Base area × Height
Hence height of the cuboid
Volume of cuboid
= 𝐵𝑎𝑠𝑒 𝑎𝑟𝑒𝑎
275
=
25
= 11 cm
Height of the cuboid is 11 cm
Example 2
Example 9: A key soap box is in the form of a cuboid of measures 60 m × 40 m × 30 m. How
many cuboidal boxes can be stored in it if the volume of one box is 0.8 m3?
Solution
Volume of one box = 0.8 m3
Volume of key soap box = 60 × 40 × 30 = 72000 m3
Number of boxes that can be stored in the key soap box =
Volume of key soap box
𝑣𝑜𝑙𝑢𝑚𝑒 𝑜𝑓 𝑜𝑛𝑒 𝑏𝑜𝑥
162
72000
0.8
= 90,000
Hence the number of cuboidal boxes that can be stored in the key soap box is 90,000.
Example 3
Consider a rectangular box with breadth 5cm and length 4cm. If the box has a height of 3cm,
find
a) its total surface area
b) its volume
Solution
= 94𝑐𝑚2
= 60𝑐𝑚3
Example 4
Find the volume of a cuboidal stone slab of length 3m, breadth 2m and thickness 25cm.
Solution
Here, l = 3m, b = 2m and h = 25/100 cm =1/4m
Note that here we have thickness as the third dimension in place of height) So, required
163
volume = l×b×h
= 3×2×1/4m
= 1.5𝑐𝑚3
1. The base area of a rectangular box is 30𝑐𝑚2 and its height is 3 𝑐𝑚2 . Find its volume
2. The volume of a key soap box which is 4cm long and 6cm high is 72𝑐𝑚3 . Find its breadth
3. Find the volume of a rectangular block 15 cm long, 5cm wide and 10cm long.
4. A cuboid of base 12.5cm by 20cm holds exactly 1 litre of water. What is the height of the
cuboid? Note that 1000cm3= 1 litre
5. The area of a rectangular floor is 13.5𝑚2 . One side is 1.5m longer than the other. Calculate
the dimensions of the floor.
Suggested answers
1. 90𝑐𝑚3
2. 3 cm
3. 750 𝑐𝑚3
4. 4cm
5. 3 m × 4.5 m
Sphere
A sphere is a perfectly round geometrical object that is three dimensional, with every point on its
surface equidistant from its center. Many commonly-used object such as balls, globes are spheres.
• Volume
164
It takes two cones of water to fill a sphere. But we know the volume of a cylinder is the area of
the base multiplied be the height.
𝑣𝑜𝑙𝑢𝑚𝑒 𝑜𝑓 𝑎 𝑐𝑦𝑙𝑖𝑛𝑑𝑒𝑟 = 𝜋𝑟 2 ℎ
But it takes three times the volume of the cone to get the volume of a cylinder. This is given as;
𝑣𝑜𝑙𝑢𝑚𝑒 𝑜𝑓 𝑎 𝑐𝑜𝑛𝑒 = 𝑣𝑜𝑙𝑢𝑚𝑒 𝑜𝑓 𝑐𝑦𝑙𝑖𝑛𝑑𝑒𝑟/3
𝜋𝑟 2 ℎ
= 3
1
= 3 𝜋𝑟 2 ℎ
So, since it takes two cones to fill the sphere, the height of the cone is equal to the height of the
corresponding cone which is also equal to the height of the sphere. So, adding the volume of the
two cones will be equal to the volume of the sphere. That is
2𝜋𝑟 2 ℎ
= 3
The r + r = height and the height in a sphere is also equal to the diameter
Which is r + r = diameter
2r = diameter
Since the diameter (2r) is equal to the height, substitute 2r in place of h
2𝜋𝑟 2 ℎ
= sphere = but h=2r
3
2𝜋𝑟 2 (2𝑟)
= 3
4𝜋𝑟 3
= 3
4𝜋𝑟 3
Therefore, the volume of a sphere = 3
= 2𝜋𝑟ℎ
But the cylinder has height equal to that of the sphere. Therefore, the height of the sphere is equal
to the diameter of the sphere. So, the height is equal to the sum of two radius which is
mathematically represented by height
h= r+r
h = 2r
Now substituting h= 2r into the formula above is;
= 2𝜋𝑟(2𝑟)
= 2𝜋𝑟 × 2𝑟
= 4πr 2
Example 1
Find the curved surface area of a solid hemisphere that has a total surface of 675𝜋 𝑐𝑚2 .
Solution
From the question, the total surface area of that hemisphere is 675𝜋 𝑐𝑚2
= 𝑟 2 = 225𝑐𝑚2 .
= 1413𝑐𝑚2
166
1. Find the area available if a hollow sphere in which a circus motorist performs his stunt
has an inner diameter of 14m.
2. A solid circular cylinder has radius 14cm and height 8cm. finds its;
a) Curved surface area
b) Total surface area
3. The total surface area of a solid circular cylinder is 1540𝑐𝑚2 . If the height if five times
the radius of the base, then find the height of the cylinder.
4. Calculate the surface area of a sphere of radius 7 cm
5. A hollow sphere was completely filled with 1 litre of oil. Calculate the internal radius of
the sphere in cm
Suggested Answers
4. 616 𝑐𝑚2
5. 6.2 cm (2 significant figures)
Chapter 20 Pyramid
A pyramids are solids whose base is a polygon and the faces are triangles. There are three important
parts in any pyramid namely; base, face and apex. The base of a pyramid may be of any shape.
The faces of the pyramid are mostly isosceles triangles. All the triangular faces meet at a single
point called the apex.
There are different types of pyramid which are named using their base shape of the pyramid. Some
of these pyramids are below;
3 triangle
Triangular pyramid
4 square
Square pyramid
5 pentagon
167
Pentagonal pyramid
6 hexagon
Hexagonal pyramid
7 heptagon
Heptagonal pyramid
8 octagon
Octagonal pyramid
9 nonagon
Nonagon pyramid
10 decagon
Decagonal pyramid
Then after finding the area of a triangle, you multiply the result by the number of lateral
triangular faces that is represented by a variable (n). So, area of lateral triangular face of the
pyramid is equal to the area of one triangular face multiplied by the number of lateral faces
represented by n. Representing the formula mathematically is,
Area of lateral triangular faces of n pyramid = area of one triangular face x {n} lateral
faces
1
Where {n} is the number of lateral faces = 𝑏ℎ × {𝑛}
2
1
= 𝑏ℎ𝑛
2
168
Therefore, the total surface area of a pyramid is equal to the sum of area of lateral triangular
faces and the area of the pyramid base.
Total surface area of a pyramid = area of the lateral triangular faces + area of pyramid base
Volume of A pyramid
Pyramids can be compared with prisms with the same base and height. But it takes three times
the number of a pyramid to get the volume of a prism. So, if 3 pyramid = 1 prism
1
Then 1 pyramid = 𝑜𝑓 𝑎 𝑝𝑟𝑖𝑠𝑚
3
1
So, the volume of a pyramid is equal to the product of of the area of base of the pyramid and
3
the height.
Therefore,
1
Volume of a pyramid = x area of the base x height
3
Example 1
The height of a pyramid on a square base is 15cm. if the length of a side of the base is 5cm, find
the volume of the pyramid.
Solution
1
Volume of a pyramid = x area of the base x height
3
169
2. The height of a right pyramid on an equilateral triangular base is 7.8cm. If the length of a
side of the base is 18cm, find
a) Volume
b) Surface area
3. A right pyramid is on a square base of side 4 cm. the slanting edge of the pyramid is 3.5
cm. Calculate the volume of the pyramid.
4. The height of a pyramid on a square base is 15 cm. If the volume is 80 cm3, find the
length of the sides of the base.
Suggested answers
3. 56 𝑐𝑚3
4.. 4 cm
Summary
In the section 5 of this unit, we learnt that;
• A cuboid is a three-dimensional object with six faces and its opposite faces being equal.
• The volume of a cuboid is 𝑙𝑒𝑛𝑔ℎ𝑡 × 𝑤𝑖𝑑𝑡ℎ × ℎ𝑒𝑖𝑔ℎ𝑡 =LWH and its total surface area is
2 (𝑙𝑏 + 𝑙ℎ + 𝑏ℎ)
• A cylinder is a solid figure that has a curved surface and two parallel and congruent
bases.
• The volume of a cylinder is 2𝜋𝑟 2 ℎ, 𝑤ℎ𝑖𝑙𝑒 𝑡ℎ𝑒 Curved Surface Area is 2𝜋𝑟ℎ and the Total
Surface Area is 2𝜋𝑟 (𝑟 +ℎ)
1
• The volume of a cone is given as 3𝜋𝑟 2 ℎ, its curved surface area=𝜋𝑟𝑙 while the total
surface area is also given as 𝜋𝑟 2 + 𝜋𝑟𝑙
• To find the area of the face of a cube, multiply the length by the other length
• The volume of a cube is also computed as L× L × L
• To calculate for the total Surface Area of the cube, dd up the area of all the six square
faces =𝐿2 +𝐿2 +𝐿2 +𝐿2 +𝐿2 +𝐿2 = 6L2
• A sphere is a perfectly round geometrical object that is three dimensional, with every
point on its surface equidistant from its center.
1
• Volume of a pyramid = 3 x area of the base x height
4𝜋𝑟 3
• The volume of a sphere is , its curved surface area is = 4πr 2
3
• Pyramids are solids whose bases are polygons and the faces are triangles
• Area of lateral triangular faces of n pyramid = area of one triangular face x {n}
170
1 1
lateral faces = 𝑏ℎ × {𝑛} = 𝑏ℎ𝑛
2 2
171
SECTION 6: MATHEMATICAL VOCABULARY ASSOCIATED WITH
MENSURATION
Introduction
Good job! You have finally made it to the last section of unit 4. We want to conclude the entire
unit in this section by looking at some of the mathematical vocabulary that are key to the unit on
mensuration. Do well to go through all of these terms with us.
Section objectives
By completing this unit, you will be able to:
Mensuration: It is a branch of mathematics which deals with the lengths of lines, areas of
surfaces and volumes of solids.
Plane Mensuration: It deals with the sides, perimeters and areas of plane figures of different
shapes.
Solid Mensuration: It deals with the areas and volumes of solid objects.
Rectangle: A rectangle is a plane; whose opposite sides are equal and diagonals are equal. Each
angle is equal to 90˚
Square: A square is a plane figure bounded by four equal sides having all its angle as right
angles
Solid: Bodies which have three dimensions in space are called solid. For example, a block of
wood. A body, which has the three dimensions length, breadth and height, is a solid, whereas a
body with only two dimensions (length and breadth) is not a solid.
Prism: A prism is a solid, bounded by plane faces of which two opposite sides known as bases
are parallel and congruent polygons.
Base: The congruent and parallel faces of a prism are called its bases. The other faces of a prism
can be either oblique to the faces or perpendicular to them.
Right prism: A right prism is a prism in which lateral sides are rectangular or perpendicular to
their bases.
Lateral faces: The side faces of a prism are called its lateral faces.
Lateral surface area: The sum area of all the lateral faces of a prism is called its lateral surface
area.
172
Pyramid: A solid of triangular lateral sides having a common vertex and plane rectilinear bases
with equal sides is called pyramid.
Height of the pyramid: The length of the perpendicular drawn from the vertex of a pyramid to
its base is called the height of the pyramid. The side faces of pyramid form its lateral surface.
Regular pyramid: If the base of a pyramid is a regular figure i.e., a polygon with all sides equal
and all angles equal, then it is called a regular pyramid.
Right pyramid: If the foot of the perpendicular from the vertex of a pyramid to its base is the
centre of the base then it is called a right pyramid.
Slant height of a regular right pyramid: The slant height of a regular right pyramid is the
length of the line segment joining the vertex to the mid-point of one of the sides of the base.
Tetrahedron: When the base of a right pyramid is a triangle, then it is called a tetrahedron.
Regular tetrahedron: A right pyramid with equilateral triangle as its base is called a regular
tetrahedron.
Activity 2
Make a sketch to represent each of the terminologies learnt.
Congratulations for going through the activity with us.
Suggested answer
Refer to the lesson on unit 4 section 6
173
Summary
In brief, we have studied in this section of unit 4 of the course, that the concept of mensuration is
imbedded with lot of terminologies which we need to have at finger tips as student teachers.
Knowing these terminologies helps us to be able to pose and also apply them in solving real life
problems that involve the application of mensuration in teaching real life phenomena. We hope
after painstakingly going through the entire unit with us, you have now come to agree with us that
indeed the concept of mensuration is not meant for elective mathematics students only because of
its applicability in our daily life activities. You are encouraged once again to go through the
remaining units for more interesting concepts of this course.
REFERENCES
Allen, H. (1977). The Teaching of Trigonometry in the United States and Canada: A
Consideration of Elementary Course Content and Approach and Factors Influencing
Change. Thesis: Rutgers University-Graduate School of Education. Vol. 1.
Amihere, K.A. (2017). Mathematics Made Easy. Winneba; UEW Printing Press.
Aseidu, P. (2017). Aki-Ola Series Mathematics for Junior high Schools. (6th ed.). Accra: Aki-Ola
Publications
Ball, W. & Rouse, W. (1960). A Short Account of the History of Mathematics. New York:
Dover Publications, Inc.
Bell, E. T. (1965). Men of Mathematics. New York: Simon and Schuster, Inc.
Blitzer, R. (2004). Precalculus. 2nd Edition. Upper Saddle River: Prentice Hall.
Boyer, C. B. (1991). A History of Mathematics. Toronto:
Hogben, L. (1960). Mathematics for the Million, New York: W.W. Norton & Company Inc.,
John Wiley and Sons, Inc
Otoo, D. & Aboagye, S. (2019). Concise Core Mathematics for Senior High Schools. Sunyani:
Goyal Brothers Prakshan
174
UNIT 5 GLOBAL MATHEMATICS AND INTRODUCTORY STATISTICS (PATTERNS
IN DATA)
1. The earth as a sphere and measuring latitudes
2. Measuring longitudes and solving problems
3. Teaching ways of collection of data
4. Teaching measures of central tendencies and locations
5. Teaching measures of dispersion
6. Teaching graphical representation (cumulative frequency)
UNIT INTRODUCTION
Dear Learners,
In this unit, we are going to learn the basic concepts in global mathematics and introductory
statistics. Therefore, our learning indicators are:
1. the earth as a sphere and measuring latitudes
2. measuring longitudes and solving problems
3. teaching ways of collection of data
4. teaching measures of central tendencies and locations
5. teaching measures of dispersion
6. teaching graphical representation (cumulative frequency).
Take your resources such as ruler, pencil, calculator and paper as we explore these concepts.
175
UNIT 5 SECTION 1 THE EARTH AS A SPHERE
INTRODUCTION
In bearing, we learned that it is a clockwise angular movement between two distance places. And
the three rules to using bearings taking reading in bearing starts from the North Pole in clockwise
direction and ends also at the North pole (i.e., North – North Pole reading), all angles formed while
taking reading in bearing is equal to 360 degrees, and all questions in bearing leads into the
formation of a triangle. In this section, our learning indicators are:
• Explain the concept of the earth as a sphere and the great circle.
• Explain the concept of the latitude and measure distances.
176
When we measure distances that a boat or aircraft travels between any two places on the Earth, we
do not use straight line distances, since we need to go around the curve of the Earth from one place
to another. This is going to be a lot less than the distance a plane flies around the surface of the
Earth.
A great circle is defined as any circle drawn on a globe (or another sphere) with a centre that
includes the centre of the globe. Thus, a great circle divides the globe into two equal halves. Since
they must follow the circumference of the Earth to divide it, great circles are about
40,000 km (24,854 miles) in length along meridians. At the equator, great circle is a little bit longer
as the Earth is not a perfect sphere.
In addition, great circles represent the shortest distance between two points anywhere on the Earth's
surface. Because of this, great circles have been important in navigation for hundreds of years but
their presence was discovered by ancient mathematicians.
Also, great circles are easily based on the lines of latitude and longitude. Each line of longitude,
or meridian, is the same length and represents half of a great circle. This is because each meridian
has a corresponding line on the opposite side of the Earth. When combined, they cut the globe into
equal halves, representing a great circle. For example, the Prime Meridian at 0° is half of a great
circle. On the opposite side of the globe is the International Date Line at 180°. It too represents
half of a great circle. When the two are combined, they create a full great circle which cuts the
Earth into equal halves.
Furthermore, the only line of latitude, or parallel, characterized as a great circle is the equator
because it passes through the exact centre of the Earth and divides it in half. Lines of latitude north
and south of the equator are not great circles because their length decreases as they move toward
the poles and they do not pass through Earth's centre. As such, these parallel lines are considered
small circles.
The commonest use of great circles is for navigation because they represent the shortest distance
between two points on a sphere. Due to the earth’s rotation, sailors and pilots using great circle
routes must constantly adjust their route as the heading changes over long distances. The only
places on Earth where the heading does not change is on the equator or when traveling due north
or south. Because of these adjustments, great circle routes are broken up into shorter lines, which
cross all meridians at the same angle.
Latitude is a measurement on a globe or map of location north or south of the Equator. Technically,
there are different kinds of latitude—geocentric, astronomical, and geographic (or geodetic)—but
177
there are only minor differences between them. In most common references, geocentric latitude is
implied.
Given in degrees, minutes, and seconds, geocentric latitude is the arc subtended by an angle at
Earth’s centre and measured in a north-south plane from the Equator. Thus, a point at 3015'20' ' N
subtends an angle of 3015'20' ' at the centre of the globe.
The arc between the Equator and either geographic pole is 90° (one-fourth the circumference of
1
the Earth, or 360 ), and thus the greatest possible latitudes are 90N and 90S . As aids to
4
indicate different latitudinal positions on maps or globes, equidistant circles are plotted and drawn
parallel to the Equator and each other, known as parallels of latitude.
Geographic latitude, which is the kind used in mapping, is calculated using a slightly different
process. Because Earth is not a perfect sphere (i.e. the planet’s curvature is flatter at the poles),
geographic latitude is the arc subtended by the equatorial plane and the normal line that can be
drawn at a given point on Earth’s surface. The normal line is perpendicular to a tangent line
touching Earth’s curvature at that point on the surface.
Different methods are used to determine geographic latitude, namely by taking angle-sights on
certain polar stars or by measuring with a sextant the angle of the noon Sun above the horizon.
The length of a degree of arc of latitude is approximately 111 km (69 miles) , varying, because of
the non-uniformity of Earth’s curvature, from 110.567 km (68.706 miles) at the Equator to
111.699 km (69.41 miles) at the poles.
o Measures the position of a point on the earth's surface in terms of the angular
distance between the equator and the poles.
o Indicates how far north or south of the equator a particular point is situated.
o North latitude: all points north of the equator in the northern hemisphere
o South latitude: all points south of the equator in the southern hemisphere
• Latitude is measured in angular degrees from 0° at the equator to 90° at either of the poles.
o A point in the northern hemisphere 40 degrees north of the equator is labeled Lat.
40° N.
o Forty degrees south of the equator, the label changes to Lat. 40° S.
• The north or south measurement of latitude is actually measured along the meridian which
passes through that location
178
o It is known as an arc of the meridian.
Example 1
A. Prime Meridian
B. Parallel of Latitude
C. Great Circle
D. Both 1 and 2
E. Both 2 and 3
179
A. They always meet at the poles
Parallels of latitude decrease in length with increasing latitude. Mathematically, the length of
parallel at latitude x is cos(x) (length of equator) .
The length of each degree is obtained by dividing the length of that parallel by 360°. For instance,
the cosine of 60° is 0.5. So the length of the parallel at that latitude is one half the length of the
equator.
Since the variation in lengths of degrees of latitude varies by only 1.13 km (0.7 miles) , the standard
length of 111.325 km (69.172 miles) is used. For instance, anywhere on the earth, the length
represented by 3° of latitude is 3111.325 km or 333.975 km .
When either latitude or longitude are expressed in degrees, minutes and seconds, the number of
degrees is followed by the symbol º , the number of minutes is followed by the symbol ´ , and the
number of seconds is followed by the symbol ´´. Thus, 3244'10' ' means 32 degrees, 44 minutes
and 10 seconds. (Since there are 60 minutes in a degree and 60 seconds in a minute, neither the
number of minutes nor the number of seconds can ever exceed 60.)
Therefore, it is easy to convert latitude or longitude expressed in degrees, minutes and seconds to
1 1 1
decimal form since 1 min ute = deg rees and 1 sec ond = deg rees
60 60 60
Example 2
Solution
180
1 1 1
4524'30' ' = 45 + 24 + 30 = 45.40833
60 60 60
Solution
1 1 1
2254'17' ' = 22 + 54 + 17 = 22.904722
60 60 60
Solution
Find the number of minutes in the remaining 0.40833333 by dividing by 1/60. i.e.
1
45.40833 = 45.40833 60 = 24 , with 0.4999998 of a minute remaining.
60
Find the number of seconds in the remaining 0.4999998minutes by dividing by 1/60. i.e.
0.4999998= 0.4999998 1 / 60 = 29.999988, or 30 seconds.
This conversion missed by a slight amount on the exact seconds because 455.4083333 was an
approximation of the original 4524'30' ' .
Activity 5.1
1. Describe the relationship between the Cartesian coordinate system and the geographic grid.
3. Explain why the length of a degree of longitude decreases as one approaches the poles.
Summary
181
• The Earth is very close to a sphere (ball) shape, with an average radius of 6,371 km .
• A great circle is defined as any circle drawn on a globe (or another sphere) with a centre
that includes the centre of the globe.
• Latitude is a measurement on a globe or map of location north or south of the Equator.
• Different methods are used to determine geographic latitude, namely by taking angle-sights
on certain polar stars or by measuring with a sextant the angle of the noon Sun above the
horizon.
• The length of a degree of arc of latitude is approximately 111 km (69 miles) , varying,
because of the non-uniformity of Earth’s curvature, from 110.567 km (68.706 miles) at the
Equator to 111.699 km (69.41 miles) at the poles.
• Geographic latitude can also be measured in degrees, minutes, and seconds.
• The length of each degree is obtained by dividing the length of that parallel by 360°.
• It is easy to convert longitude or latitude expressed in degrees, minutes and seconds to
1 1 1
decimal form since 1 min ute = deg rees and 1 sec ond = deg rees
60 60 60
182
INTRODUCTION
In the previous section, we learned that the Earth is very close to a sphere, with an average radius
of 6,371 km , the great circle is any circle drawn on a globe, latitude is a measurement on a globe
1
or map of location north or south of the Equator, and 1 min ute = deg rees or
60
1 1
1 sec ond = deg rees .
60 60
In this section, our learning indicators:
• Explain the concept of the longitude
• Discuss how to measure longitudes and distance.
Measured also in degrees, minutes, and seconds, longitude is the amount of arc created by drawing
first a line from the Earth’s centre to the intersection of the Equator and the prime meridian and
183
then another line from the Earth’s centre to any point elsewhere on the Equator. Longitude is
measured 180 both east and west of the prime meridian.
As aids to locate longitudinal positions on a globe or map, meridians are plotted and drawn from
pole to pole where they meet. The distance per degree of longitude at the Equator is about
111.32 km (69.18 miles) and at the poles, 0.
The longitude measures the position of a point on the earth’s surface east or west from a specific
meridian, called the prime meridian. The following are some other characteristics:
• The longitude of a place is the arc, measured in degrees along a parallel of latitude from
the prime meridian.
• The most widely accepted prime meridian is based on the Bureau International de l'Heure
(BIH) Zero Meridian:
o Defined by the longitudes of many BIH stations around the world.
o Passes through the old Royal Observatory in Greenwich, London, England.
o The prime meridian has the angular designation of 0° longitude.
o All other points are measured with respect to their position east or west.
o Longitude ranges from 0° to 180°, either east or west.
• Since the placement of a prime meridian is arbitrary, countries often use their own.
o For the purposes of measurement, no one prime meridian is better than another
o Widely accepted meridian allows comparison between maps in different areas.
• The distance represented by a degree of longitude varies upon where it is measured.
o The length of a degree of longitude along a meridian is not constant because of polar
flattening.
o At the equator, the approximate length is determined by dividing the earth’s circumference
(24,900 miles) by 360 degrees: 111.05 kilometers (69 miles).
o The meridians converge at the poles, and the distance represented by one degree decreases.
o At 60° N latitude, one degree of longitude is equal to about 55.52 kilometers (34.5 miles).
o The Radius (R) of the Earth is 6400 km
184
Because the earth is not a perfect sphere, the equatorial circumference does not equal that of the
meridians.
On a perfect sphere, each meridian of longitude equals one-half the circumference of the sphere.
Measurement along meridians of longitude accounts for the earth’s polar flattening:
Since a difference in latitude or longitude of one second is quite small (less than 1/50 of a mile),
then longitudes and latitudes of geographical locations like cities are usually expressed only in
degrees and minutes, not seconds.
Example 1
These lengths are based on an ellipsoid and are similar to the lengths computed with the spherical
formula.
When calculating distances over large areas, the authalic sphere can be used as a reference surface.
The shortest distance between two points on a sphere is the arc on the surface directly above the
true straight line.
The difference between the sphere and ellipsoid is important when working with large areas.
185
At a scale of 1 : 40,000,000 , a 23 km error in distance would equal a pen line (0.5 mm) on paper.
Example 2
1. Two towns P(42W , 30 N ) and Q(18E, 30 N ) are on the surface of the earth. Find the time
it will take a pilot flying at 600kmh −1 to travel from P to Q.
Solution
18°
42° E
W
30°
N
Q
Q
P
186
1
The radius of the latitudes is r = R cos or r = 6400cos30 , where = .
2
60
The distance between P and Q is PQ = 2r = 2 (6400 cos 30)
360 360
60
Therefore, PQ = 2 (6400 cos 30) = 5,804.91
360
The points P(40 N , 22E ) and Q(40 N , 48W ) are on the earth surface. Assuming the earth is a
sphere of 6,400 km and = 3.142 , calculate:
Solution
22°
40° E
W
40°
N
P
Q
The
T 187
α
r r
O
If r and R are the radii of the small and great circles respectively, then:
70
PQ = 2 3.142 4,902.4 = 5,990.1881 km
360
Activity 5.2
1. Two towns P(45W , 30 N ) and Q(25E, 30 N ) are on the surface of the earth. Find the time
it will take a pilot flying at 500kmh −1 to travel from P to Q.
2. Two towns P(48W , 60 N ) and Q(22E, 60 N ) are on the surface of the earth. Find the time
it will take a pilot flying at 700kmh −1 to travel from P to Q.
3. The points P(50 N , 25E ) and Q(50 N , 48W ) are on the earth surface. Assuming the earth is
a sphere of 6,400 km and = 3.142 , calculate:
4. The points P(30 N , 28E ) and Q(30 N , 52W ) are on the earth surface. Assuming the earth is
a sphere of 6,400 km and = 3.142 , calculate:
188
i. the radius of the circle of latitude through P and Q
Summary
• The longitude of a place is the arc, measured in degrees along a parallel of latitude from
the prime meridian.
• Since the prime meridian is arbitrary, other countries have often used their own.
• The distance represented by a degree of longitude varies upon where it is measured.
• Because the earth is not a perfect sphere, the equatorial circumference does not equal that
of the meridians.
• The length of each degree is equal to the circumference divided by 360.
• Measurement along meridians of longitude accounts for the earth’s polar flattening.
• Degree lengths along meridians are not constant:
• Mathematically, Length of a degree of longitude is cos(latitude) 111.325 km .
• If r and R are the radii of the small and great circles respectively, then r = R cos , and
R = 6,400 km .
• The distance, |PQ| between the points, P and Q is given by PQ = 2r .
360
Dis tan ce
• The time taken for a moving object on the earth surface is .
Speed
189
INTRODUCTION
190
Dear Students,
In Unit 5 Section 2, we learned that the longitude of a place is the arc, measured in degrees along
a parallel of latitude from the prime meridian and measurement along meridians of longitude
accounts for the earth’s polar flattening. In this section, we are going to delve into another area of
mathematics called ‘Statistics’. One school of thought regards ‘Statistics’ as numerical information
expressed in quantitative terms. This information may relate to objects, subjects, activities,
phenomena, or regions of space.
As a matter of fact, data have no limits as to their reference, coverage, and scope. At the national
and international level, these data share the contribution of agriculture, manufacturing, and
services in Gross Domestic Product (GDP). At local level, individuals, firms (small or large) and
institutions, produce extensive statistics on their operations. The annual reports of all institutions
contain variety of data on sales, production, expenditure, inventories, capital employed, and other
activities. These data are often field data, collected by employing scientific survey techniques and
regularly updated.
On another hand, a student knows statistics more intimately as a subject of study just economics,
mathematics, chemistry, physics, and others. To this extent, ‘Statistics’ is a discipline, which
scientifically deals with data, and is often described as the science of data. In both cases, statistics
has developed appropriate methods of collecting, presenting, summarizing, and analysing data.
191
The word ‘statistics’ is used in two senses--- plural and singular. In the plural sense, it refers to a
set of figures or data. In the singular sense, statistics refers to the whole body of tools that are used
to collect data, organise and interpret data and, draw conclusions from the data.
Statistics is a collection of methods for collecting, displaying, analyzing, and drawing conclusions
from data.
Statistics is a very broad subject, with applications in a vast number of different fields. In generally
one can say that statistics is the methodology for collecting, analyzing, interpreting and drawing
conclusions from information.
It should be noted that both aspects of statistics are important if the quantitative data are to serve
their purpose. If statistics, as a subject, is inadequate and consists of poor methodology, we could
not know the right procedure to extract from the data and the information they contain. Similarly,
if our data are defective or that they are inadequate or inaccurate, we could not reach the right
conclusions even though our subject is well developed.
Example 1
Statistics include numerical facts and figures:
a. The largest earthquake was measured 9.2 on the Richter scale.
b. Men are at least 10 times more likely than women to commit murder.
c. One in every 8 Africans has ever been infected with malaria virus.
d. By the year 2020, there will be 15 people aged 65 and over for every new baby born.
3. King defines ‘statistics’ as the science of method of judging collective, natural or social
phenomena from the results obtained by the analysis or enumeration or collection of estimates.
4. Seligman defines ‘statistics’ as a science that deals with the methods of collecting, classifying,
presenting, comparing and interpreting numerical data collected to throw some light on any sphere
of enquiry.
192
5. Spiegal defines ‘statistics’ as a scientific method for collecting, organising, summa rising,
presenting and analyzing data as well as drawing valid conclusions and making reasonable
decisions on the basis of such analysis.
6. Prof. Horace Secrist defines ‘statistics’ as the aggregate of facts, affected to a marked extent by
multiplicity of causes, numerically expressed, enumerated or estimated according to reasonable
standards of accuracy, collected in a systematic manner for a pre-determined purpose, and placed
in relation to each other.
7. The science of statistics deals with the collection, analysis, interpretation, and presentation of
data. We see and use data in our everyday lives.
Example 2
Identify a major flaw with each interpretation in the following statistics.
a). A new advertisement for fan yoghurt and fan ice introduced last year late February resulted in
a 30% increase in yoghurt sales for the months. Thus, the advertisement was effective.
A major flaw is that yoghurt consumption generally increases in the months of March, April and
May regardless of advertisements. This effect is called a history effect and leads people to interpret
outcomes as the result of one variable when another variable (in this case, one having to do with
the passage of time) is actually responsible.
b) The more churches in a city, the more crime there is. Thus, churches lead to crime.
193
A major flaw is that both increased churches and increased crime rates can be explained by larger
populations. In bigger cities, there are both more churches and more crime. This problem refers to
the third-variable problem. Namely, a third variable can cause both situations; however, people
erroneously believe that there is a causal relationship between the two primary variables rather
than recognize that a third variable can cause both.
c). 75% more inter-tribal marriages are occurring this year than 25 years ago. Thus, our society
accepts inter-tribal marriages.
A major flaw is that we do not have the information that we need. What is the rate at which
marriages are occurring? Suppose only 1% of marriages 25 years ago were inter-tribal and so now
1.75% of marriages are inter-tribal (1.75 is 75% higher than 1). But this latter number is hardly
evidence suggesting the acceptability of inter-tribal marriages.
In addition, the statistic provided does not rule out the possibility that the number of inter-tribal
marriages has seen dramatic fluctuations over the years and this year is not the highest.
Again, there is simply not enough information to understand fully the impact of the statistics.
As a whole, these examples show that statistics are not only facts and figures; they are something
more than that. In the broadest sense, ‘statistics# refers to a range of techniques and procedures for
analyzing, interpreting, displaying, and making decisions based on data!
Activity 5.3
1. ‘The word ‘statistics’ is used in two senses--- plural and singular’. Explain this statement.
2. There are many ways of defining statistics. Which of the definitions is best to you and why?
2. Data may relate to an activity of our interest, a phenomenon, or a problem situation under study.
They derive as a result of the process of measuring, counting and/or observing.
3. Statistical data, therefore, refer to those aspects of a problem situation that can be measured,
quantified, counted, or classified.
4. Any object subject, phenomenon, or activity that generates data through this process is termed
as a variable. In other words, a variable is one that shows a degree of variability when successive
measurements are recorded.
194
5. “Data” refers to the information that has been collected from an experiment, a survey, or an
historical record (i.e. “data” is plural and ‘datum’ is singular)
b). The types of data
In statistics, data are classified into two broad categories, namely quantitative data and qualitative
data. This classification is based on the kind of characteristics that are measured.
Quantitative data
Quantitative data are those that can be quantified in definite units of measurement. These refer to
characteristics whose successive measurements yield quantifiable observations. Depending on the
nature of the variable observed for measurement, quantitative data can be further categorized as
continuous and discrete data.
Quantitative data are numerical measurements that mostly arise from a natural numerical scale.
Continuous data
Continuous data represent the numerical values of a continuous variable. A continuous variable is
the one that can assume any value between any two points on a line segment, thus representing an
interval of values.
The values are quite precise and close to each other, yet distinguishably different. All
characteristics such as time, distance, weight, length, height, thickness, velocity, temperature,
tensile strength, and class scores represent continuous variables. Thus, the data recorded on these
and similar other characteristics are called continuous data.
It may be noted that a continuous variable assumes the finest unit of measurement. Finest in the
sense that it enables measurements to the maximum degree of precision.
Discrete data
Discrete data are the values assumed by a discrete variable. A discrete variable is the one whose
outcomes are measured in fixed numbers. Such data are essentially count data.
These are derived from a process of counting, such as the number of items possessing or not
possessing a certain characteristic. Common examples are the number of customers visiting a
departmental store everyday, the incoming flights at an airport, and the defective items in a
consignment received for sale, the number of students in a classroom and the performance rates.
Qualitative data
Qualitative data refer to qualitative characteristics of a subject or an object. A characteristic is
qualitative in nature when its observations are defined and noted in terms of the presence or
absence of a certain attribute in discrete numbers.
195
Qualitative data are measurements for which there is no natural numerical scale, but which consist
of attributes, labels, or other non-numerical characteristics.
Nominal data
Nominal data are the outcome of classification into two or more categories of items or units
comprising a sample or a population according to some quality characteristic.
Classifications of students according to sex (as males and females), of workers according to skill
(as skilled, semi-skilled, and unskilled), of objects according to colours (white, black, and yellow),
of employees according to the level of education (as diploma, first degree, and masters), of lecture
halls according to room numbers, of jerseys according to positions, of mobile numbers, and so on.
Given any such basis of classification, it is always possible to assign each item to a particular class
and make a summation of items belonging to each class. The count data so obtained are called
nominal data.
Rank data
Rank data, on the other hand, are the result of assigning ranks to specify order in terms of the
integers 1,2,3, ..., n. Ranks may be assigned according to the level of performance in a test (first,
second, third, etc.), a contest, a competition, an interview, or a show.
The candidates appearing in an interview may be assigned ranks in integers ranging from I to n,
depending on their performance in the interview. Ranks so assigned can be viewed as the
continuous values of a variable involving performance as the quality characteristic.
Example
1. Identify the following measures as either quantitative or qualitative:
i. The 30 high-temperature readings of the last 30 days.
ii. The scores of 40 students on a Statistics test.
iii. The blood types of 120 teachers in a basic school.
iv. The last four digits of social security numbers of all teachers in a class.
v. The numbers on the jerseys of 53 football players on a football team.
vi. The genders of the first 40 newborns in the Winneba Trauma hospital in one year.
vii. The natural hair colour of 20 randomly selected female teachers.
viii. The ages of 20 randomly selected students.
ix. The fuel economy in miles per gallon of 20 new cars purchased last month.
x. The political affiliation of 500 randomly selected voters.
Solution
196
i. Quantitative.
ii. Quantitative.
iii. Quantitative.
iv. Qualitative
v. Qualitative
vi. Qualitative.
vii. Qualitative.
viii. Quantitative.
ix. Quantitative.
x. Qualitative.
2. Classify the following variables as qualitative and quantitative and as discrete or continuous:
e). A survey conducted with 1,015 people indicates that 40 of them are subscribers to a broadband
Internet service.
f). The electronic indicates that the player radar last snapped ball 82,3mi / h
g). The time spent for a person to make a Takoradi drive to Tarkwa is approximately 2: 40h at an
average speed of 100km/h
Solution
197
a. Quantitative discrete
b. qualitative
c. quantitative continuous
d. quantitative continuous
e. quantitative discrete
f. qualitative
g. quantitative continuous
h. qualitative
i. quantitative continuous
j. quantitative continuous
k. quantitative discrete
i). The indicated variables, which are quantitative and which are qualitative?
Solution
i). a. Quantitative (A), (D), (E), (F), (H)
ii. Are continuous quantitative variables (E), (F) and optionally (A); the variable Age is also
continuous, it can take any value in a range, although it is usually treated as discrete)
198
Data sources are mainly of two types, namely secondary and primary. However, the third source
(internet) is a hybrid of the two sources. These can be defined as under:
Secondary data
They already exist in some form: published or unpublished -in an identifiable secondary source.
They are, generally, available from published source(s), though not necessarily in the form actually
required.
Primary data
These data do not already exist in any form, and thus have to be collected for the first time from
the primary source(s). By their very nature, these data require fresh and first-time collection
covering the whole population or a sample drawn from it.
Internet data
These data exist in the internet and can be collected for statistical analysis. By their nature, they
neither classified as primary nor secondary. Common examples are wiki, google, yahoo, Facebook
and Whatsapp.
Types of Statistics
There are two major divisions of statistics, namely descriptive statistics and inferential statistics.
Descriptive statistics
1. Descriptive statistics, also known as deductive statistics, is the branch of statistics that involves
organizing, displaying, and describing data.
2. Descriptive statistics is the branch of statistics that involves organizing, displaying, and
describing data.
3. Descriptive Statistics: deals with procedures used to summarize the information contained in a
set of measurements.
4. The term descriptive statistics deals with collecting, summarizing, and simplifying data, which
are otherwise quite unwieldy and voluminous. It seeks to achieve this in a manner that meaningful
conclusions can be readily drawn from the data.
5. Descriptive statistics is the branch of statistics concerned with describing and summarizing data,
and a set of statistics such as the mean, standard deviation, and skew that describes a distribution.
6. Descriptive statistics may be seen as comprising methods of bringing out and highlighting the
latent characteristics present in a set of numerical data. It not only facilitates an understanding of
the data and systematic reporting thereof in a manner; and also makes them amenable to further
discussion, analysis, and interpretations.
199
7. Descriptive statistics are numbers that are used to summarize and describe data. For example, if
we are analyzing birth certificates, a descriptive statistic might be the percentage of certificates
issued in Ghana, or the average age of the mothers. Any other number we choose to compute also
counts as a descriptive statistic for the data from which the statistic is computed. Several
descriptive statistics are often used at one time to give a full picture of the data.
8. Descriptive statistics are just descriptive. They do not involve generalizing beyond the data at
hand. Generalizing from our data to another set of cases is the business of inferential statistics.
Examples of descriptive statistics include measures of central tendency, dispersion, skewness, and
kurtosis.
9. Organizing and summarizing data is called descriptive statistics. Two ways to summarize data
are by graphing and by using numbers (e.g. finding an average).
Inferential statistics
1. Inferential statistics, also known as inductive statistics, goes beyond collecting, summarizing,
and meaningfully presenting the related data. Instead, it consists of methods that are used for
drawing inferences, or making broad generalizations, about a totality of observations on the basis
of knowledge about a part of that totality.
2. Inferential statistics is the branch of statistics that involves drawing conclusions about a
population based on information contained in a sample taken from that population.
3. Inferential statistics obtains a particular value from the sample information and uses it for
drawing an inference about the entire population.
4. Inferential statistics helps to evaluate the risks involved in reaching inferences or generalizations
about an unknown population on the basis of sample information. For example, an inspection of a
sample of five battery cells drawn from a given lot may reveal that all the five cells are in perfectly
good condition. This information may be used to conclude that the entire lot is good enough to buy
or not.
200
In a different situation, one may have to repeat this exercise for some future year and use the quick
estimate of average body weight for a comparison. This may be needed to decide whether the
weight of the students has undergone a significant change over the years compared.
Since inference is based on the examination of a sample of limited number of cells, it is equally
likely that all the cells in the lot are not in order. It is also possible that all the items that may be
included in the sample are unsatisfactory. This may be used to conclude that the entire lot is of
unsatisfactory quality, whereas the fact may indeed be otherwise. It may, thus, be noticed that there
is always a risk of an inference about a population being incorrect when based on the knowledge
of a limited sample.
The rescue in such situations lies in evaluating such risks. For these and other reasons, inferential
statistics provides the necessary methods. This requires an understanding of the what, why, and
how of probability and probability distributions to equip ourselves with methods of drawing
statistical inferences and estimating the degree of reliability of these inferences.
Population
A population is the complete set of observations a researcher is interested in. A population can be
defined in a manner convenient for a researcher. For example, one could define a population as all
girls in UEW, Winneba or the set of all girls in Level 100 students of UEW, Winneba.
A (statistical) population is the set of measurements (or record of some qualitative trait)
corresponding to the entire collection of units for which inferences are to be made.
Census
The desired information about a given population of our interest; may also be collected even by
observing all the units comprising the population. This total coverage is called census.
Getting the desired value for the population through census is not always feasible and practical for
various reasons. Apart from time and money considerations making the census operations
prohibitive, observing each individual unit of the population with reference to any data
characteristic may at times involve even destructive testing. In such cases, obviously, the only
recourse available is to employ the partial or incomplete information gathered through a sample
for the purpose.
Sample
201
A sample is a subset of a population, often taken for the purpose of statistical inference. Generally,
one uses a random sample.
A sample from statistical population is the set of measurements that are actually collected in the
course of an investigation.
The idea of sampling is to select a portion (or subset) of the larger population and study that portion
(the sample) to gain information about the population in details. Data are the result of sampling
from a population.
The totality of observations about which an inference may be drawn, or a generalization made, is
called a population or a universe. The part of totality, which is observed for data collection and
analysis to gain knowledge about the population, is called a sample.
Inferential statistics are computed from sample data in order to make inferences about the
population. Hence, to study the population, we select a sample.
Representative Sample
A representative sample is a sample chosen to match the qualities of the population from which it
is drawn. With a large sample size, random sampling will approximate a representative sample.
Usually, the stratified random sampling is used to make a small sample more representative.
The sample must contain the characteristics of the population in order to be a representative
sample. So, we must be interested in both the sample statistic and the population parameter in
inferential statistics. We use the sample statistic to test the validity of the established population
parameter.
In effect, a population is any specific collection of objects of interest. A sample2 is any subset or
subcollection of the population, including the case that the sample consists of the whole
population, in which case it is termed a census.
Parameter
Parameter is a value calculated in a population. For example, the mean of the numbers in a
population is a parameter. Compare with a statistic, which is a value computed in a sample to
estimate a parameter.
A parameter is a number that is a property of the population. Since we considered all mathematics
classes to be the population, then the average number of points earned per student over all the
mathematics classes is an example of a parameter.
202
A parameter is an unknown numerical summary of the population. A statistic is a known numerical
summary of the sample which can be used to make inference about parameters.
Statistic
Even though the term ‘statistics’ is a field of study concerned with summarizing data, interpreting
data, and making decisions based on data, there is a quantity, called ‘statistic’ calculated to estimate
a value in a population.
A statistic is a number that represents a property of the sample. For example, if we consider one
mathematics class to be a sample of the population of all mathematics classes, then the average
number of points earned by students in that one mathematics class at the end of the semester is a
statistic. The statistic is an estimate of a population parameter.
In effect, a parameter is a number that summarizes some aspect of the population as a whole, and
a statistic is a number computed from the sample data.
Example
1. Identify each of the following data sets as either a population or a sample:
a. The grade point averages (GPAs) of all students in a University.
b. The GPAs of a randomly selected group of students on North Campus of UEW, Winneba
c. The ages of the nine Supreme Court Justices of Ghana in 1999.
d. The gender of every second customer who enters a movie theatre.
e. The lengths of Keta boat undertaking fishing trip in the sea.
Solution
a. Population.
b. Sample.
c. Population.
d. Sample.
e. Sample.
2. A sociologist wishes to estimate the proportion of all adults in a certain region who have never
married. In a random sample of 1,320 adults, 145 have never married, hence 145 / 1320 = 0.11 or
about 11% have never married.
a. What is the population of interest?
b. What is the parameter of interest?
c. What is the statistic involved?
d. Based on this sample, do we know the proportion of all adults who have never married? Explain
fully.
203
Solution
a. All adults in the region.
b. The proportion of the adults in the region who have never married.
c. The proportion computed from the sample, 0.1.
d. No, not exactly, but we know the approximate value of the proportion.
Example
3. Determine what the key terms refer to in the following study.
We want to know the average (mean) amount of money first-year students spend on campus that
does not include school fees. We randomly survey 100 first year students at the college. Three of
those students spent ₵150, ₵200, and ₵225, respectively.
Solution
a. The population is all first year students on campus.
b. The sample could be all students enrolled in one section of a beginning statistics course
(although this sample may not represent the entire population).
c. The parameter is the average (mean) amount of money spent (excluding school fees) by first-
year students in this term.
d. The statistic is the average (mean) amount of money spent (excluding school fees) by first-year
students in the sample.
e. The data are the cedi amounts spent by the first-year students, namely ₵150, ₵200, and ₵225.
Solution
1. f; 2. g; 3. e; 4. d; 5. b; 6. c
204
5. Determine what the key terms refer to in the following study.
An insurance company would like to determine the proportion of all medical doctors who have
been involved in one or more malpractice lawsuits. The company selects 500 doctors at random
from a professional directory and determines the number in the sample who have been involved in
a malpractice lawsuit.
Solution
a. The population is all medical doctors listed in the professional directory.
b. The parameter is the proportion of medical doctors who have been involved in one or more
malpractice suits in the population.
c. The sample is the 500 doctors selected at random from the professional directory.
d. The statistic is the proportion of medical doctors who have been involved in one or more
malpractice suits in the sample.
e. The data are either: yes, was involved in one or more malpractice lawsuits, or no, was not.
Significance Level
In significance testing, the significance level is the highest value of a probability value for which
the null hypothesis is rejected. Common significance levels are 0.05 and 0.01. If the 0.05 level is
used, then the null hypothesis is rejected if the probability value is less than or equal to 0.05.
Significance Testing
Significance Testing is a statistical procedure that tests the viability of the null hypothesis. If data
(or more extreme data) are very unlikely given that the null hypothesis is true, then the null
hypothesis is rejected. If the data or more extreme data are not unlikely, then the null hypothesis
is not rejected.
If the null hypothesis is rejected, then the result of the test is said to be significant. A statistically
significant effect does not mean the effect is important.
Null Hypothesis
A null hypothesis is a hypothesis tested in significance testing. It is typically the hypothesis that a
parameter is zero or that a difference between parameters is zero. For example, the null hypothesis
might be that the difference between population means is zero. Experimenters typically design
experiments to allow the null hypothesis to be rejected.
Alternative Hypothesis
In hypothesis testing, the null hypothesis and an alternative hypothesis are put forward. If the data
are sufficiently strong to reject the null hypothesis, then the null hypothesis is rejected in favour
of an alternative hypothesis. For instance, if the null hypothesis is 1 = 2 then the alternative
hypothesis (for a two-tailed test) is 1 2 .
205
One Tailed
The last step in significance testing involves calculating the probability that a statistic would differ
as much or more from the parameter specified in the null hypothesis as does the statistics obtained
in the experiment.
A probability computed considering differences in only one direction, such as the statistic is larger
than the parameter, is called a one-tailed probability. For example, if a parameter is 0 and the
statistic is 12, a one-tailed probability (the positive tail) would be the probability of a statistic being
≥ to 12. Compare with the two-tailed probability which would be the probability of being either ≤
-12 or ≥12.
Two Tailed
The last step in significance testing involves calculating the probability that a statistic would differ
as much or more from the parameter specified in the null hypothesis as does the statistics obtained
in the experiment.
A probability computed considering differences in both direction (statistic either larger or smaller
than the parameter) is called two-tailed probability. For example, if a parameter is 0 and the statistic
is 12, a two-tailed probability would be the he probability of being either −12 or to 12 .
Compare with the one-tailed probability which would be the probability of a statistic being 12
if that were the direction specified in advance.
Type I Error
Type I Error in significance testing, is the error of rejecting a true null hypothesis.
Type II Error
Type II Error in significance testing, is the failure to reject a false null hypothesis.
The p-value is computed assuming the null hypothesis is true. The lower the probability value, the
stronger the evidence that the null hypothesis is false. Therefore, the null hypothesis is rejected if
the p-value is below 0.05.
The observed significance or p-value of a specific test of hypotheses is the probability, on the
supposition that H0 is true, of obtaining a result at least as contrary to H0 and in favour of Ha as the
result actually observed in the sample data.
206
The observed significance of a test of hypotheses is the area of the tail of the distribution cut off
by the test statistic (times two in the case of a two-tailed test).
In P-Value Approach;
• Identify the null and alternative hypotheses.
• Identify the relevant test statistic and its distribution.
• Compute from the data the value of the test statistic.
• Compute the p-value of the test.
• Compare the value computed in Step 4 to significance level α and make a decision: reject
H0 if p ≤ α and do not reject H0 if p > α.
• Formulate the decision in the context of the problem, if applicable.
Example
1. Make the decision in each test, based on the information provided.
a. Testing H0: μ = 82.9 vs. Ha: μ < 82.9 at α = 0.05, observed significance p = 0.038.
b. Testing H0: μ = 213.5 vs. Ha: μ ≠ 213.5 at α = 0.01, observed significance p = 0.038.
Solution
a. reject H0
b. do not reject H0
2. State the null and alternative hypotheses for each of the following situations (i.e., identify the
correct number μ0 and write H0: μ = μ0 and the appropriate analogous expression for Ha.)
a. The average July temperature in a region has been 74.5°F. Perhaps it is higher now.
b. The average weight of a female airline passenger with luggage was 145 pounds ten years ago.
The FDB believes it to be higher now.
c. The average stipend for doctoral students in a particular discipline in the University is ₵14,756.
The Dean of Graduate studies believes that the national average is higher.
d. The average room rate in hostels in a certain region is ₵82.53. A travel agent believes that the
average in a particular resort area is different.
e. The average farm size in a predominately rural state was 69.4 acres. The Minister of Agriculture
asserts that it is less today.
Solution
a. H0: μ = 74.5 vs. Ha: μ > 74.5
b. H0: μ = 145 vs. Ha: μ > 145
c. H0: μ = 14756 vs. Ha: μ > 14756
d. H0: μ = 82.53 vs. Ha: μ ≠ 82.53
e. H0: μ = 69.4 vs. Ha: μ < 69.4
207
3. Describe the two types of errors that can be made in a test of hypotheses.
Solution
A Type I error is made when a true H0 is rejected. A Type II error is made when
a false H0 is not rejected.
Activity
1. The intention was to make a study of the number of siblings of students in the JHS three of a
basic school. For this, a survey was carried out to which 60 students answered. Indicate:
2. The data are the number of machines in a gym. You sample five gyms. One gym has 12
machines, one gym has 15 machines, one gym has ten machines, one gym has 22 machines, and
the other gym has 20 machines. What type of data is this?
3. The data are the areas of lawns in square feet. You sample five houses. The areas of the lawns
are 144 sq. feet, 160 sq. feet, 190 sq. feet, 180 sq. feet, and 210 sq. feet. What type of data is this?
Summary
• Statistics is a study of data: describing properties of data (descriptive statistics) and drawing
conclusions about a population based on information in a sample (inferential statistics).
• Descriptive statistics make use of quantitative data (continuous and discrete), qualitative
(nominal and rank) and qualitative data.
• The distinction between a population together with its parameters and a sample together
with its statistics is a fundamental concept in inferential statistics.
• Information in a sample is used to make inferences about the population from which the
sample was drawn.
• Inferential statistics make use of significance level, significance testing, null hypothesis,
alternative hypothesis, one tailed, two tailed, type i error, type ii error and probability value
or p-value.
Scope of Statistics
These are mainly concerned with the following:
208
(i) It often becomes necessary to examine how two paired data sets are related. For example, we
may have data on the sales of a product and the expenditure incurred on its advertisement for a
specified number of years. Given that sales and advertisement expenditure are related to each other,
it is useful to examine the nature of relationship between the two and quantify the degree of that
relationship. As this requires use of appropriate statistical methods, these falls under the purview
of what we call regression and correlation analysis.
(ii) Situations occur quite often when we require averaging (or totalling) of data on prices and/or
quantities expressed in different units of measurement. For example, price of cloth may be quoted
per meter of length and that of wheat per kilogram of weight. Since ordinary methods of totalling
and averaging do not apply to such price/quantity data, special techniques needed for the purpose
are developed under index numbers.
(iii) Many a time, it becomes necessary to examine the past performance of an activity with a view
to determining its future behaviour. For example, when engaged in the production of a commodity,
monthly product sales are an important measure of evaluating performance. This requires
compilation and analysis of relevant sales data over time. The more complex the activity, the more
varied the data requirements. For profit maximising and future sales planning, forecast of likely
sales growth rate is crucial. This needs careful collection and analysis of past sales data. All such
concerns are taken care of under time series analysis.
(iv). Obtaining the most likely future estimates on any aspect(s) relating to a business or economic
activity has indeed been engaging the minds of all concerned. This is particularly important when
it relates to product sales and demand, which serve the necessary basis of production scheduling
and planning. The regression, correlation, and time series analyses together help develop the basic
methodology to do the needful. Thus, the study of methods and techniques of obtaining the likely
estimates on business/economic variables comprises the scope of what we do under business
forecasting.
(v). Keeping in view the importance of inferential statistics, the scope of statistics may finally be
restated as consisting of statistical methods which facilitate decision--making under conditions of
uncertainty. While the term statistical methods is often used to cover the subject of statistics as a
whole, in particular it refers to methods by which statistical data are analysed, interpreted, and the
inferences drawn for decision-making. Though generic in nature and versatile in their applications,
statistical methods have come to be widely used, especially in all matters concerning business and
economics.
(vi). These are also being increasingly used in biology, medicine, agriculture, psychology, and
education. The scope of application of these methods has started opening and expanding in a
number of social science disciplines as well. Even a political scientist finds them of increasing
209
relevance for examining the political behaviour and it is no surprise to find historians statistical
data, for history is essentially past data presented in certain actual format.
Example 1
Write a note on the scope and limitations of Statistics.
Importance of Statistics
There are three major functions in any business enterprise in which the statistical methods are
useful. These are as follows:
(i) The planning of operations: This may relate to either special projects or to the recurring
activities of a firm over a specified period.
(ii) The setting up of standards: This may relate to the size of employment, volume of sales,
fixation of quality norms for the manufactured product, norms for the daily output, and so forth.
(iii) The function of control: This involves comparison of actual production achieved against the
norm or target set earlier. In case the production has fallen short of the target, it gives remedial
measures so that such a deficiency does not occur again.
A point worth noting is that although these three functions-planning of operations, setting
standards, and control-are separate, but in practice they are very much interrelated.
Irwing W. Burr, dealing with the place of statistics in an industrial organisation, specifies a number
of areas where statistics is extremely useful.
• Customer wants and market research
• Development design and specification
• Purchasing, production, inspection, packaging and shipping
• Sales and complaints
• Inventory and maintenance
• Costs, management control, industrial engineering and research.
In the sphere of production, statistics can be useful in various ways:
210
• Statistical quality control methods are used to ensure the production of quality goods.
• Identifying and rejecting defective or substandard goods achieve this.
• The sale targets can be fixed on the basis of sale forecasts, which are done by using varying
methods of forecasting.
Analysis of sales affected against the targets set earlier would indicate the deficiency in
achievement, which may be on account of several causes:
(i) targets were too high and unrealistic
(ii) salesmen's performance has been poor
(iii) emergence of increase in competition
(iv) poor quality of company's product.
Another sphere in business where statistical methods can be used is personnel management. Here,
one is concerned with the fixation of wage rates, incentive norms and performance appraisal of
individual employee. The concept of productivity is very relevant here.
On the basis of measurement of productivity, the productivity bonus is awarded to the workers.
Comparisons of wages and productivity are undertaken in order to ensure increases in industrial
productivity.
Statistical methods could also be used to ascertain the efficacy of a certain product, say, medicine.
For example, a pharmaceutical company has developed a new medicine in the treatment of
bronchial asthma. Before launching it on commercial basis, it wants to ascertain the effectiveness
of this medicine. It undertakes an experimentation involving the formation of two comparable
groups of asthma patients.
One group is given this new medicine for a specified period and the other one is treated with the
usual medicines. Records are maintained for the two groups for the specified period. This record
is then analysed to ascertain if there is any significant difference in the recovery of the two groups.
If the difference is really significant statistically, the new medicine is commercially launched.
Example 2
Write a note on the scope of Statistics in education.
Limitations of Statistics
Statistics has a number of limitations, pertinent among them are as follows:
(i). There are certain phenomena or concepts where statistics cannot be used. This is because these
phenomena or concepts are not amenable to measurement. For example, beauty, intelligence,
211
courage cannot be quantified. Statistics has no place in all such cases where quantification is not
possible.
(ii). Statistics reveal the average behaviour, the normal or the general trend. An application of the
'average' concept if applied to an individual or a particular situation may lead to a wrong conclusion
and sometimes may be disastrous. For example, one may be misguided when told that the average
depth of a river from one bank to the other is four feet, when there may be some points in between
where its depth is far more than four feet. On this understanding, one may enter those points having
greater depth, which may be hazardous.
(iii). Since statistics are collected for a particular purpose, such data may not be relevant or useful
in other situations or cases. For example, secondary data (i.e., data originally collected by someone
else) may not be useful for the other person.
(iv). Statistics are not 100 per cent precise as is Mathematics or Accountancy. Those who use
statistics should be aware of this limitation.
(v). In statistical surveys, sampling is generally used as it is not physically possible to cover all the
units or elements comprising the universe. The results may not be appropriate as far as the universe
is concerned. Moreover, different surveys based on the same size of sample but different sample
units may yield different results.
(vi). At times, association or relationship between two or more variables is studied in statistics, but
such a relationship does not indicate cause and effect' relationship. It simply shows the similarity
or dissimilarity in the movement of the two variables. In such cases, it is the user who has to
interpret the results carefully, pointing out the type of relationship obtained.
(vii). A major limitation of statistics is that it does not reveal all pertaining to a certain
phenomenon. There is some background information that statistics does not cover. Similarly, there
are some other aspects related to the problem on hand, which are also not covered. The user of
Statistics has to be well informed and should interpret Statistics keeping in mind all other aspects
having relevance on the given problem.
Example 3
Write a note on the use of Statistics in education.
Misuses of statistics
Apart from the limitations of statistics mentioned above, there are misuses of it. Many people,
knowingly or unknowingly, use statistical data in wrong manner. Let us see what the main misuses
of statistics are so that the same could be avoided when one has to use statistical data.
212
The misuse of Statistics may take several forms some of which are explained below.
(i). Sources of data not given: At times, the source of data is not given. In the absence of the source,
the reader does not know how far the data are reliable. Further, if he wants to refer to the original
source, he is unable to do so.
(ii). Defective data: Another misuse is that sometimes one gives defective data. This may be done
knowingly in order to defend one's position or to prove a particular point. This apart, the definition
used to denote a certain phenomenon may be defective. For example, in case of data relating to
unemployed persons, the definition may include even those who are employed, though partially.
The question here is how far it is justified to include partially employed persons amongst
unemployed ones.
(iii). Unrepresentative sample: In statistics, several times one has to conduct a survey, which
necessitates to choose a sample from the given population or universe. The sample may turn out
to be unrepresentative of the universe. One may choose a sample just on the basis of convenience.
He may collect the desired information from either his friends or nearby respondents in his
neighbourhood even though such respondents do not constitute a representative sample.
(iv). Inadequate sample: Earlier, we have seen that a sample that is unrepresentative of the universe
is a major misuse of statistics. This apart, at times one may conduct a survey based on an extremely
inadequate sample. For example, in a city we may find that there are 1, 00,000 households.
When we have to conduct a household survey, we may take a sample of merely 100 households
comprising only 0.1 per cent of the universe. A survey based on such a small sample may not yield
right information.
(v). Unfair Comparisons: An important misuse of statistics is making unfair comparisons from the
data collected.
One way of unfair comparison is to construct an index of production choosing the base year where
the production was much less. Then he may compare the subsequent year's production from this
low base. Such a comparison will undoubtedly give a rosy picture of the production though in
reality it is not so.
Again, a source of unfair comparisons could be when one makes absolute comparisons instead of
relative ones. An absolute comparison of two figures, say, of production or export, may show a
good increase, but in relative terms it may turnout to be very negligible.
Also, an unfair comparison is when the population in two cities is different, but a comparison of
overall death rates and deaths by a particular disease is attempted. Such a comparison is wrong.
213
In addition, when data are not properly classified or when changes in the composition of population
in the two years are not taken into consideration, comparisons of such data would be unfair as they
would lead to misleading conclusions.
Again, sometimes one may not be sure about the changes in business environment in the near
future. In such a case, one may use an assumption that may turn out to be wrong.
In addition, source of unwarranted conclusion may be the use of wrong average. Suppose in a
series there are extreme values, one is too high while the other is too low, such as 800 and 50.
Lastly, the use of an arithmetic average in such a case may give a wrong idea. Instead, harmonic
mean would be proper in such a case.
(vii). Confusion of correlation and causation: In statistics, several times one has to examine the
relationship between two variables. A close relationship between the two variables may not
establish a cause-and-effect-relationship in the sense that one variable is the cause and the other is
the effect. It should be taken as something that measures degree of association rather than try to
find out causal relationship.
Example 4
What are the major misuses of Statistics in Ghana?
Activity 4.1
1. Explain the importance of statistics to education, trade, commerce and business.
2. What are the major limitations of Statistics? Explain with suitable examples.
Summary
We have learned that:
• The scope of statistics are pairing two data sets, averaging (or totalling) of data, examining
the past performance of an activity with a view to determining its future behaviour,
obtaining the most likely future estimates on any aspect(s), keeping in view the importance
214
of inferential statistics, facilitating decision--making under conditions of uncertainty, and
being increasingly used in biology, medicine, agriculture, psychology and education.
• There are three major functions of statistics, namely planning of operations, setting up of
standards, and function of control.
• The other limitations are sampling is generally used as it is not physically possible to cover
all the units or elements comprising the universe, association or relationship between two
or more variables does not indicate cause and effect' relationship, and statistics does not
reveal all pertaining to a certain phenomenon.
• The misuse of statistics are sources of data not given, defective data, unrepresentative
sample, inadequate sample, unwanted conclusions, and confusion of correlation and
causation.
INTRODUCTION
In the previous section we saw that there are two ways to describe data. This section defines the
three most common measures of central tendency, namely the mean, the median, and the mode.
The relationships among these measures of central tendency are paramount to this section.
215
This section also gives the basic definitions of the mean, median and mode. A further discussion
of the relative merits and proper applications of these statistics is presented. Another way to
summarize the data in a quantitative variable, plus tables and graphs, is to present them in the form
of the numeric values, called measures of central tendency. These measures, when calculated from
population data, are called parameters and, when calculated from sample data are called estimators
or statistics.
Learning indicators
1. Identify situations in which knowing the centre of a distribution would be valuable.
2. Give different ways the centre of a distribution can be defined.
3. Recognize, describe, and calculate the measures of the centre of data.
4. Recognize, describe, and calculate the measures of location of data: quartiles and percentiles.
5. The present lesson imparts understanding of the calculations and main properties of measures
of central tendency, including mean, mode, median, quartiles, and percentiles.
The measures of central tendency are required to enable us to compare two or more distributions
pertaining to the same time period or within the same distribution over time. For example, we
216
require the central tendency to measure the average consumption of rice in two different regions
at the same period or in one region at two different years.
The ‘centre’ of a data set is a way of describing location of the data. The three most widely used
measures of the ‘centre’ of the data are the mean (average), the mode and the median. To calculate
the mean weight of 50 people, add the 50 weights together and divide by 50. To find the mode of
50 people’s weight, locate the entry with the highest occurrence of the weight. To find the median
weight of the 50 people, order the data and find the number that splits the data into two equal parts.
The centre of a distribution could be defined three ways; namely the point on which a distribution
would balance, the value whose average absolute deviation from all the other values is minimized,
and the value whose squared difference from all the other values is minimized. The mean is the
point on which a distribution would balance, the median is the value that minimizes the sum of
absolute deviations, and the mean is the value that minimizes the sum of the squared deviations.
The median is generally a better measure of the centre when there are extreme values or outliers
because it is not affected by the precise numerical values of the outliers. The mean is the most
common measure of the centre. any of three are required to measure the central location of the
data.
Generally, descriptive measures are measures of position, of central tendency, of dispersion, and
of skewness and kurtosis. While describing data statistically or verbally, one requires the central
tendency to describe the data.
In everyday life, the word “average” is used in a variety of ways – batting averages, average life
expectancies, and so on but the meaning is similar, usually the centre of a distribution. In the
mathematical world, where everything must be precise, we require the central tendency to define
several ways of finding the centre of a set of data.
The first measure of central location is the usual “average” that is familiar to everyone. In the
formula, we normally introduce the standard summation notation Σ, where Σ is the capital Greek
217
letter sigma. In general, the notation Σ followed by a second mathematical symbol means to add
up all the values that the second symbol can take in the context of the problem.
The mean is the sum of all the values in a set, divided by the number of values. The mean of a
whole population is usually denoted by , while the mean of a sample is usually denoted by x .
The mean is sensitive to any change in value, unlike the median and mode, where a change to an
extreme (in the case of a median) or uncommon (in the case of a mode) value usually has no effect.
The mean is mostly used measure of central tendency, and is defined as the sum of the expected
values of all observations (observation is an element of a sample) divided by the number of
observations.
The Greek letter μ (pronounced ‘mew’) represents the population mean. One of the requirements
for the sample mean to be a good estimate of the population mean is for the sample taken to be
truly random.
The arithmetic mean is the most common measure of central tendency. It is simply the sum of the
numbers divided by the number of numbers. The symbol “M” is used for the mean of a sample,
Note that although the arithmetic mean is not the only “mean”. We have the geometric mean, the
harmonic mean, the trimean and the trimmed mean just to mention a few. However, it is by far the
most commonly used mean. However, if the term “mean” is used without specifying, it is assumed
to refer to the arithmetic mean.
218
Symbolically, the arithmetic mean, also called simply mean is x = x / n , where x is the data set,
n is the sample size and x is simple mean.
Examples 1
1. TB data indicating the number of months a patient with TB lives after taking a new antibody
drug are as follows (smallest to largest): 3; 4; 8; 8; 10; 11; 12; 13; 14; 15; 15; 16; 16; 17; 17; 18;
21; 22; 22; 24; 24; 25; 26; 26; 27; 27; 29; 29; 31; 32; 33; 33; 34; 34; 35; 37; 40; 44; 44; 47; calculate
the mean.
Solution
x=
x = 3 + 4 + 8 + ... + 44 + 47 = 23.6 .
n 40
2. The number of touchdown (TD) passes thrown by each of the 31 teams in the National Football
League in the 2010 season was shown below: 37, 33, 33, 32, 29, 28, 28, 23, 22, 22, 22, 21, 21, 21,
20, 20, 19, 19, 18, 18, 18, 18, 16, 15, 14, 14, 14, 12, 12, 9, 6. Calculate the mean number of
touchdown passes thrown.
Solution
x=
x = 37 + 37 + 33 + ... + 9 + 6 = 20.4516
n 31
3. A random sample of ten students was taken from the student body of a class and their GPAs
were recorded as follows: 1.90, 3.00, 2.53, 3.71, 2.12, 1.76, 2.71, 1.39, 4.00, 3.33. Find the sample
mean.
Solution
x=
x = 1.90 + 3.00 + 2.53 + ... + 4.00 + 3.33 = 2.65
n 10
Discrete frequency table or weighted arithmetic mean
When each value in the data set is not unique, the mean can be calculated by multiplying each
distinct value by its frequency and then dividing the sum by the total number of data values.
Symbolically, the mean is x = fx / n , where x is the data set, f is the frequency and x is simple
mean.
219
Example
1. A random sample of 19 women beyond child-bearing age gave the following data, where x is
the number of children and f is the frequency of that value, the number of times it occurred in the
data set. Use the table to find the sample mean.
x f
0 3
1 6
2 6
3 3
4 1
Solution
• The data are presented by means of a data frequency table.
• Each number in the first line of the table is a number that appears in the data set
• Each number below it is how many times it occurs. Thus the value 0 is observed three
times, that is, three of the measurements in the data set are 0, the value 1 is observed six
times, and so on.
• In the context of the problem, three women in the sample have had no children, six have
had exactly one child, and so on.
• The explicit list of all the observations in this data set is therefore 0,0,0,1,1,1,
1,1,2,2,2,2,2,2,3,3,3,4.
0 + 0 + 0 + 1 + ... + 3 + 3 + 4
• The sample mean could have been x = x / n = = 1.6316
19
However, because the sample size comes directly from a table, without first listing the entire data
set, the sum of the frequencies n = 3 + 6 + 6 + 3 + 1 = 19 .
The sample mean can be computed directly from the table as well:
0 3 + 1 6 + 2 6 + 3 3 + 4 1
x = fx / n = = 1.6316
19
2440
x = fx / n = = 61
40
3. Suppose Aku has secured Mid-term test (30), Laboratory (25) and Final exam (20) in three tests.
Assuming that the weights assigned to the three tests are 2 points, 3 points and 5 points in Mid-
term test, Laboratory and Final exam respectively, calculate her mean score.
Solution
Type of test Relative (w) weight Mark wx
Mid term 2 30 60
Laboratory 3 25 75
Final exam 5 20 100
Totals 10 235
x=
xw = 60 + 75 + 100 = 23.5 .
n 2+3+5
4. An investor is fond of investing in equity shares. During a period of falling prices in the stock
exchange, a stock is sold at ₵120 per share on one day, ₵105 on the next and ₵90 on the third day.
The investor has purchased 50 shares on the first day, 80 shares on the second day and 100 shares
on the third' day. What average price per share did the investor pay?
Solution
Day Price per share (x) Number of shares (w) Amount paid (wx)
1 120 50 6000
2 105 80 8400
3 90 100 9000
Total 230 23,400
221
x=
xw = 6000 + 8400 + 9000 = 101.70
n 50 + 80 + 100
5. The mean of the following frequency distribution was found to be 1.46. use the table to calculate
the missing frequencies.
Solution
Here we are given the total number of frequencies and the arithmetic mean. We have to determine
the two frequencies that are missing. Let us assume that the frequency against 1 accident is x and
against 2 accidents is y. If we can establish two simultaneous equations, then we can easily find
the values of X and Y.
x=
fx = 0.46 + (1.x) + (2. y) + (3.25) + (4.10) + (5.5) =
n 200
(1.x) + (2. y ) + 140
1.46 = =
200
1.46(200) = 1.x + 2. y + 140 or 1.46(200) = x + 2. y = 152 ……………..(1)
x + y = 200 − (46 + 25 + 10 + 5) or x + y = 114 ………………………..(2)
Now subtracting equation (ii) from equation (i), we get y = 38
Substituting the value of y = 38 in equation (2), we get x = 76
Hence, the missing frequencies are:
Against accident 1: 76
Against accident 2: 38
222
A frequency table is a data representation in which grouped data is displayed along with the
corresponding frequencies.
To calculate the mean from a grouped frequency table we can apply the basic definition of mean:
mean = data sum number o f data values. We simply need to modify the definition to fit within the
restrictions of a frequency table.
Since we do not know the individual data values we can instead find the midpoint of each interval.
lowerboundary + upperboundary
The midpoint is x = .
2
We can now modify the mean definition to be Mean o f Frequency Table is x = fx / f ,
where f is the frequency of the interval and x is the midpoint of the interval.
When each value in the data set is not unique to the interval data, the mean can be calculated by
finding the midpoints of the class intervals, multiplying each distinct frequency by its midpoint
and then dividing the sum of the products by the total number of data frequencies (Stats3).
Symbolically, the mean is x = fx / f , where x is the data set, f is the frequency, f is the
sum of the frequencies and x is simple mean. Note that f is the same as n.
223
86.5–92.5 4
92.5–98.5 1
Solution
• Find the midpoints for all intervals
• Calculate the sum of the product of each interval frequency and midpoint.
• Calculate the mean by dividing the sum of the product of each interval frequency by the
midpoint: x = fx / f = 1460.25 / 19 = 76.86 .
2. The following table gives the marks of 58 students in Statistics. Calculate the average marks of
this group.
Marks Number of students
0-10 4
10-20 8
20-30 11
30-40 15
40-50 12
50-60 6
60-70 2
224
Solution
Marks Midpoint (x) Number of students (f) fx
0-10 5 4 20
10-20 15 8 120
20-30 25 11 275
30-40 35 15 525
40-50 45 12 540
50-60 55 6 330
60-70 65 2 130
Total 58 1940
x = A+
fd = A + fd where A is the arbitrary or assumed mean, f is the frequency, and d
n f ,
is the deviation from the arbitrary or assumed mean.
When the values are extremely large and/or in fractions, the use of the direct method would be
very cumbersome. In such cases, the short-cut method is preferable. This is because the calculation
work in the short-cut method is considerably reduced particularly for calculation of the product of
values and their respective frequencies.
However, when calculations are not made manually but by a machine calculator, it may not be
necessary to resort to the short-cut method, as the use of the direct method may not pose any
problem.
The second term in the formula ( fd n) is the correction factor for the difference between the
actual mean and the assumed mean.
225
If the assumed mean turns out to be equal to the actual mean, ( fd n) will be zero. The use of
the short-cut method is based on the principle that the total of deviations taken from an actual mean
is equal to zero.
As such, the deviations taken from any other figure will depend on how the assumed mean is
related to the actual mean. While one may choose any value as assumed mean, it would be proper
to avoid extreme values, that is, too small or too high to simplify calculations.
Note that a value apparently close to the arithmetic mean and either usually come from the class
with the highest frequency or class in the middle should be chosen.
Example
The following table gives the marks of 58 students in Statistics. Calculate the average mark of this
group.
Marks Number of students
0-10 4
10-20 8
20-30 11
30-40 15
40-50 12
50-60 6
60-70 2
Solution
Marks Midpoint (x) Number (f) Deviations (d=35) fd
0-10 5 4 -30 -120
10-20 15 8 -20 -160
20-30 25 11 -10 -110
30-40 35 15 0 0
40-50 45 12 10 120
50-60 55 6 20 120
226
60-70 65 2 30 60
Total 58 -90
x = A+
fd = 35 + − 90 = 33.45 or 33 marks approximately.
n 58
(iii) Step-deviation or coded method
In the case of step-deviation, the concepts of arbitrary mean and coded value are followed. The
formula for calculation of the arithmetic mean by the step-deviation method is given below:
x = A+
fu C = A + fu C where A is the arbitrary or assumed mean, f is the frequency,
n f ,
u is the quotient from the assumed mean and C is the class interval of the ‘A’ class. The term
(u = d / C ) is the correction factor for the deviations and the assumed mean.
If the class intervals turn out to be equal, then (C ) will be equal. The use of the step-deviation
method is based on the principle that the interval intervals are equal and that any other class chosen
will not affect the product ( fu ) .
Example
The following table gives the marks of 58 students in Statistics. Calculate the average mark of this
group.
Marks Number of students
0-10 4
10-20 8
20-30 11
30-40 15
40-50 12
50-60 6
60-70 2
Solution
Marks Midpoint (x) Number (f) Deviations (d=35) u=d/c fu
0-10 5 4 -30 -3 -12
10-20 15 8 -20 -2 -16
20-30 25 11 -10 -1 -11
30-40 35 15 0 0 0
40-50 45 12 10 1 12
50-60 55 6 20 2 12
227
60-70 65 2 30 3 6
Total 58 -9
x = A+
fu c = 35 + − 9 10 = 33.45 or 33 marks approximately.
n 58
Comparisons of the three strategies
It will be seen that the answer in each of the three cases is the same. However, the step-deviation
method is the most convenient on account of simplified calculations.
It may also be noted that if we select a different arbitrary assumed mean and recalculate deviations,
we should get the same answer.
Again, the arithmetic mean can be calculated by using any of the three different methods.
Since the sum of the deviations in the positive direction is equal to the sum of the deviations in the
negative direction, the arithmetic mean is regarded as a measure of central tendency.
2. The sum of the squared deviations of the individual items from the arithmetic mean is always
minimum. In other words, the sum of the squared deviations taken from any value other than the
arithmetic mean will be higher.
3. As the arithmetic mean is based on all the items in a series, a change in the value of any item
will lead to a change in the value of the arithmetic mean.
4. In the case of highly skewed distribution, the arithmetic mean may get distorted on account of
a few items with extreme values. In such a case, it may cease to be the representative characteristic
of the distribution.
9. It can be computed even if the detailed distribution is not known but some of the observations
and number of the observations are known.
1. One disadvantage of the mean is that a small number of extreme values can distort its value. For
example, the mean of the set {1, 1, 1, 2, 2, 3, 3, 3, 200} is 24, even though almost all of the
members were very small.
A variation called the trimmed mean, where the smallest and largest quarters of the values are
removed before the mean is taken, can solve this problem.
2. The mean value may not exist among the observations.
3. The interpretations of the mean may be absurd if it is decimal.
4. It can give fallacy conclusions of different data sets.
5. The mean value is meaningless in qualitative data.
6. The mean cannot be estimated from any graph.
7. The mean cannot be computed when class intervals have open ends
Geometric Mean
The geometric mean is computed by multiplying all the numbers together and then taking the nth
root of the product. For example, for the numbers 1, 10, and 100, the product of all the numbers is
1 10 100 = 1,000 .
Since there are three numbers, we take the cubed root of the product (1,000) which is equal to 10.
The formula for the geometric mean is therefore
Therefore, the equation tells us to multiply all the values of X and then raise the result to the 1/nth
power. Raising a value to the 1/nth power is the same as taking the nth root of the value. In this
case, 10001 / 3 is the cube root of 1,000 .
229
The geometric mean has a close relationship with logarithms. The arithmetic mean of three logs is
1. The anti-log of this arithmetic mean of 1 is the geometric mean. The anti-log of 1 is 101 = 10 .
Again, the geometric mean is a measure of central tendency. The geometric mean of n numbers is
obtained by multiplying all of them together, and then taking the nth root of them. For instance,
for the numbers 1, 10, and 100, the product of all the numbers is 1 10 100 = 1000
Since there are three numbers, we take the cubed root of the product (1,000) which is 10.
Alternatively, geometric mean is defined at the nth root of the product of n observations of a
distribution. Symbolically, GM = n x1 x 2 x 3 ... x n
If we have only two observations, x1 and x 2 then GM = x1 x 2 . Similarly, if there are only
three observations, then we have to calculate the cube root of the product of these three
observations such as GM = 3n x1 x 2 x 3 ; and so on.
When the number of items is large, it becomes extremely difficult to multiply the numbers and to
calculate the root. To simplify the calculations, logarithms are used. That is, log GM =
log x
n
log x
and GM = anti log
n .
Note that the geometric mean only makes sense if all the numbers are positive. The geometric
mean is an appropriate measure to use for averaging rates.
Solution
log GM =
log x = log 2 + log 4 + og8
n 3
log 2 + log 4 + og8 0.3010 + 0.6021 + 0.9031
= = 0.60206
3 3
GM = anti log(0.60206) = 4
230
2. Bayor has invested ₵5,000 in the stock market. At the end of the first year the amount has grown
to ₵6,250; she has had a 25 percent profit. If at the end of the second year the principal has grown
to ₵8,750, the rate of increase is 40 percent for the year. What is the average rate of increase of
her investment during the two years?
Solution
GM = x1 x 2 = 1.251.40 = 1.323
The average rate of increase in the value of investment is therefore 1.323−1 = 0.323, which if
multiplied by 100, gives the rate of increase as 32.3% .
Alternatively, we can also derive a compound interest formula from the above set of data.
Now, 1.251.40 = 1.75 . This can be written as 1.75 = (1 + 0.323) .
2
Let P2 = 1.75 , P0 = 1 , and r = 0.323 , then the above equation can be written as P2 = (1 + r ) or
2
P2 = P0 (1 + r ) ,where P2 the value of investment at the end of the second year, P0 is the initial
2
This, in fact, is the familiar compound interest formula, which can be written in the generalize
form as Pn = P0 (1 + r ) .
n
Note that if the arithmetic mean is used, the resultant figure will be wrong. This is because the
25 + 40
average rate for the two years is percent per year, which comes to 32.5.
2
165
Applying this rate, we get Pn = 5,000 = 8,250
100
This is obviously wrong, as the figure should have been ₵8,750.
3. An economy has grown at 5 percent in the first year, 6 percent in the second year, 4.5 percent
in the third year, 3 percent in the fourth year and 7.5 percent in the fifth year. What is the average
rate of growth of the economy during the five years?
Solution
Year Rate Value x Log x
1 5 105 2.02119
231
2 6 106 2.02531
3 4.5 104.5 2.01912
4 3 103 2.01284
5 7.5 107.5 2.03141
Total 10.10987
log x
GM = anti log = anti log 10.10987 = 105.19
n 5
Hence, the average rate of growth during the five-year period is 105.19 −100 = 5.19 percent per
annum.
In case of a simple arithmetic average, the corresponding rate of growth would have been 5.2
percent per annum.
4. Consider a stock portfolio that began with a value of ₵1,000 and had annual returns of 13%,
22%, 12%, -5%, and -13%. Use the table below with the value after each of the five years to
calculate the geometric mean.
Solution
The question is how to compute average annual rate of return. The answer is to compute the
geometric mean of the returns.
Instead of using the percents, each return is represented as a multiplier indicating how much higher
the value is after the year.
This multiplier is 1.13 for a 13% return and 0.95 for a 5% loss.
232
Therefore, the average annual rate of return is 5%. The portfolio gaining 5% a year would end up
with the same value (₵1,276) as shown below.
In view of the limitations mentioned above, the geometric mean is not frequently used.
233
1 n
HM = reciprocal f i = , Where n is the total number of observations. Here,
xi 1
f i x
i
each reciprocal of the original figure is weighted by the corresponding frequency (f).
Example
1. Suppose we have three observations 4, 8 and 16. Calculate the harmonic mean.
Solution
1 1 1
Reciprocals of 4,8 and 16 are , and respectively
4 8 16
3 3
HM = = = 6.857
1 / 4 + 1 / 8 + 1 / 16 0.25 + 0.125 + 0.0625
Solution
Class-interval Mid-value (x) Frequency (f) Frequency Reciprocal of x f *1/x
2-4 3 20 0.3333 6.6660
4-6 5 40 0.2000 8.0000
6-8 7 30 0.1429 4.2870
8-10 9 10 0.1111 1.1111
Total 20.0641
n 100
HM = = = 4.984
1 20.0641
f i
xi
3. In a small company, two typists are employed. Typist A types one page in ten minutes while
typist B takes twenty minutes for the same.
(i). If both are asked to type 10-page document. What is the average time taken for typing one
page?
(ii), If both are asked to type for one hour. What is the average time taken for typing one page?
Solution
Here (i) is on arithmetic mean while (ii) is on harmonic mean.
234
i. Mean =
(10 10) + (20 20) = 15 min utes
(20 2)
60 40
ii. Harmonic Mean = = = 13 min utes 20 sec onds
60 / 10 + 60 / 20 3
4. It takes ship A 10 days to cross the Pacific Ocean; ship B takes 15 days and ship C takes 20
days.
(i). What is the average number of days taken by a ship to cross the Pacific Ocean?
(ii).What is the average number of days taken by a cargo to cross the Pacific Ocean when the ships
are hired for 60 days?
Solution
Here again (i) pertains to simple arithmetic mean while (ii) is a harmonic mean.
10 + 15 + 20
i. Mean = = 15 days
3
60 3
ii. Harmonic Mean = = 13.8 days
60 / 10 + 60 / 15 + 6020
2. When we desire to give greater weight to smaller observations and less weight to the larger
observations, then the use of harmonic mean is more suitable.
Again, it is worth noting that the harmonic mean is always lower than the geometric mean, which
is lower than the arithmetic mean. This is because the harmonic mean assigns lesser importance to
higher values.
In addition, since the harmonic mean is based on reciprocals, it becomes clear that as reciprocals
of higher values are lower than those of lower values, it is a lower average than the arithmetic
mean as well as the geometric mean.
235
Trimean
The trimean is a weighted average of the 25th percentile, the 50th percentile, and the 75th
percentile. Letting P25 be the 25th percentile, P50 be the 50th and P75 be the 75th percentile, the
formula for the trimean is given by:
p 25 + 2 p50 + p 75
Trimean =
4
The trimean is a robust measure of central tendency; it is a weighted average of the 25th, 50th, and
75th percentiles. Alternatively, it is as: Trimean = 0.25 25th + 0.5 50th + 0.25 75th .
Example
Given 37, 33, 33, 32, 29, 28, 28, 23, 22, 22, 22, 21, 21, 21, 20, 20, 19, 19, 18, 18, 18, 18, 16, 15,
14, 14, 14, 12, 12, 9, 6. Find the trimean.
Solution
The 25th percentile is 15, the 50th is 20 and the 75th percentile is 23.
p 25 + 2 p50 + p 75 15 + 2 20 + 23
Trimean = = = 19.5
4 4
Trimmed Mean
To compute a trimmed mean, you remove some of the higher and lower scores and compute the
mean of the remaining scores.
A mean trimmed 10% is a mean computed with 10% of the scores trimmed off: 5% from the
bottom and 5% from the top.
A mean trimmed 50% is computed by trimming the upper 25% of the scores and the lower 25%
of the scores and computing the mean of the remaining scores.
The trimmed mean is similar to the median which, in essence, trims the upper 49+% and the lower
49+% of the scores.
This means trimmed mean is a robust measure of central tendency generally falling between the
mean and the median. And in the computation of the median:
• all observations are ordered.
• Next, the highest and lowest alpha percent of the data are removed, where alpha ranges
from 0 to 50.
• Finally, the mean of the remaining observations is taken.
236
The trimmed mean has advantages over both the mean and media, and therefore a hybrid of the
mean and the median, but is analytically more intractable.
Example
Find a 20% trimmed mean of the following data: 37, 33, 33, 32, 29, 28, 28, 23, 22, 22, 22, 21, 21,
21, 20, 20, 19, 19, 18, 18, 18, 18, 16, 15, 14, 14, 14, 12, 12, 9, 6.
Solution
To compute the mean trimmed 20% for the touchdown pass data above, we first remove the lower
10% of the scores (6, 9, and 12) as well as the upper 10% of the scores (33, 33, and 37).
Activity 5.1
1. The mean age of 5 persons in a room is 30 years. A 36-year-old person walks in. What is the
mean age of the persons in the room now?
2. Out of 100 numbers, 20 were 4s, 40 were 5s, 30 were 6s and the remainder were 7s. Find the
arithmetic mean of the numbers.
3. The following set of raw data shows the length, in millimeters, measured to the nearest mm, of
each of 40 leaves taken from plants of a certain species. Use the table to find the mean.
Length (mm Frequency (f)
25-29 2
30-34 4
35-39 7
40-44 10
45-49 8
50-54 6
55-59 3
4. The following set of raw data shows the height, in meters, measured to the nearest m, of each
of 20 trees taken from plants of a certain species. Use the table to find the mean.
Length (mm Frequency (f)
500-600 3
600-700 6
700-800 5
800-900 5
900-1000 0
237
1000-1100 1
Frequency 2 4 9 11 12 6 4 2
6. A set of numbers is transformed by taking the log base 10 of each number. The mean of the
transformed data is 1.65. What is the geometric mean of the untransformed data?
7. Which measure of central tendency is most often used for returns on investment?
THE MEDIAN
Introduction
The mean and the median can be calculated to help you find the ‘centre’ of a data set. As the mean
is the best estimate for the actual data set, the median is the best measurement when a data set
contains several outliers or extreme values, and the mode will tell you the most frequently
occurring datum (or data) in the data set.
The three are extremely helpful when you need to analyze your data, but if your data set consists
of ranges which lack specific values, the mean may seem impossible to calculate. However, the
mean can only be approximated if you add the lower boundary with the upper boundary and divide
by two to find the midpoint of each interval, and proceed as outlined in the grouped data.
The ‘centre’ of a data set is also a way of describing location. The two most widely used measures
of the ‘centre’ of the data are the mean (average) and the median. We all know that to calculate
the mean weight of 50 people, add the 50 weights together and divide by 50. But to find the median
weight of the 50 people, we have to order the data and find the number that splits the data into two
equal parts.
This makes the median a better measure of the centre when there are extreme values or outliers.
This is because it is not affected by the precise numerical values of the outliers.
238
Various conceptions of the median
The median is the middle number of a set of numbers arranged in numerical order. If the number
of values in a set is even, then the median is the sum of the two middle values, divided by 2. The
median is not affected by the magnitude of the extreme (smallest or largest) values.
Thus, it is useful because it is not affected by one or two abnormally small or large values, and
because it is very simple to calculate. For example, to obtain a relatively accurate average life of a
particular type of light bulb, you could measure the median life by installing several bulbs and
measuring how much time passed before half of them died. Alternatives would probably involve
measuring the life of each bulb.
Median is defined as the value of the middle item (or the mean of the values of the two middle
items) when the data are arranged in an ascending or descending order of magnitude. Thus, in an
ungrouped frequency distribution if the n values are arranged in ascending or descending order of
magnitude, the median is the middle value if n is odd. When n is even, the median is the mean of
the two middle values.
The median is the measure used to indicate the centre of a distribution. Ordered elements in the
sample, the median is the value (or not the sample belongs) which divides in half, i.e. 50% of the
sample elements are less than or equal to the median and 50% are greater than or equal to the
median.
The median is a number that separates ordered data into halves so that half the values are the same
number or smaller than the median and half the values are the same number or larger than the
median. The median may or may not be part of the data at all. When there is an odd number of
numbers, the median is simply the middle number, such as the median of 2, 4, and 7 is 4. But when
there is an even number of numbers, the median is the mean of the two middle numbers.
The median is a frequently used measure of central tendency. The median is the midpoint of a
distribution: the same number of scores is above the median as below it. For the data 37, 33, 33,
32, 29, 28, 28, 23, 22, 22, 22, 21, 21, 21, 20, 20, 19, 19, 18, 18, 18, 18, 16, 15, 14, 14, 14, 12, 12,
9, 6, there are 31 scores. The 16th highest score (which equals 20) is the median because there are
15 scores below the 16th score and 15 scores above the 16th score. The median can also be thought
of as the 50th percentile.
Median is defined as the value of the middle item (or the mean of the values of the two middle
items) when the data are arranged in an ascending or descending order of magnitude. Thus, in an
ungrouped frequency distribution if the n values are arranged in ascending or descending order of
magnitude, the median is the middle value if n is odd.
239
When n is even, the median is the mean of the two middle values. Suppose we have the series: 15,
19,21,7, 10,33,25,18 and 5, we have to first arrange it in either ascending or descending order as:
5,7,10,15,18,19,21,25,33. Because the series consists of odd number of items, we use the formula
n +1
to obtain 18.
2
n +1
It may be noted that the formula itself is not the formula for the median; it merely indicates
2
the position of the median, namely, the number of items we have to count until we arrive at the
item whose value is the median. In the case of the even number of items in the series, we identify
the two items whose values have to be averaged to obtain the median.
The sample median ˜x of a set of discrete sample data for which there are an even number of
measurements is the mean of the two middle measurements when the data are arranged in
numerical order. The median is a value that divides the observations in a data set so that 50% of
the data are on its left and the other 50% on its right.
Example 1
1. Suppose we have the following series: 15, 19,21,7, 10,33,25,18 and 5. Calculate the median.
Solution
We have to first arrange it in either ascending or descending order. These figures are arranged in
an ascending order as follows: 5,7,10,15,18,19,21,25,33.
Now as the series consists of odd number of items, to find out the value of the middle item, we use
n +1 9 +1
the formula . This gives us = 5 , that is, the size of the 5th item is the median. This
2 2
happens to be 18.
2. Suppose the series consists of 5, 7, 10, 15, 18, 19, and 21,23,25,33. Calculate the median.
Solution
n +1
Applying the formula , the median is the size of 5.5th item. Here, we have to take the average
2
of the values of 5th and 6th item.
This means an average of 18 and 19, which gives the median as 18.5.
240
3. Given the variable x = {1, 3, 0, 2.4}, calculate the average median of a data set.
Solution
i). order the set such as x= {0,1,2,3,4}
ii). Verify that there is an odd or even number of values in the set; in the above example: 5
iii). If it is odd the median is the value that occupies the central position, and, if even will be the
average of the two central positions.
n +1 5 +1
i). Because it is odd, the median is th = = 3rd ,which is 2.
2 2
Solution
The data has already been ordered
The two middle numbers are 4 and 7
4+7
Therefore, the median is = 5.5 !
2
Median of an ungrouped data
• Rearrange the data in either ascending or descending.
• Find the cumulative frequency of the frequencies.
n +1 n
• Find the cumulative frequency whose sum meets or exceeds the if odd or if even
2 2
total number.
n +1 n
• Find the median from the or
2 2
Example
1. The following table shows the number of times drivers experience traffic jams in a city. Use the
table to find the median.
Driving Frequency
Every day 751
Few days a week 217
Few days a month 82
Few times a year 37
Never 457
Number 1,544
Solution
241
• Find the cumulative frequency of the table.
Driving Frequency Cumulative frequency
Every day 751 751
Few days a week 217 968
Few days a month 82 1,050
Few times a year 37 1,087
Never 457 1,544
Number 1,544 ----
Example
1. The table shows the wages of workers in a company. Use the table to find the median salary.
Monthly wage Number of workers
800-1,000 18
242
1,000-1,200 25
1,200-1,400 30
1,400-1,600 34
1,600-1,800 26
1,800-2,000 10
Total 143
Solution
In order to calculate median, we have to first provide cumulative frequency as shown below:
Monthly wage Number of workers Cumulative frequency
800-1,000 18 18
1,000-1,200 25 43
1,200-1,400 30 73
1,400-1,600 34 107
1,600-1,800 26 133
1,800-2,000 10 143
Total 143
n +1 143 + 1
We calculate the value as = 72 . So, the median class is 1,200-1,400.
2 2
L − L1
We substitute our values into M = L1 + 2 (m − c ) as
f
1,400 − 1,200
M = 1,200 + (72 − 43) = 1,393.30
30
Solution
Since the data have two open-end classes-one in the beginning (below 50) and the other at the end
(200 and above), median should be the right choice as a measure of central tendency.
So, we find the cumulative frequency table as below:
Size Frequency Cumulative frequency
Below 50 15 15
50-100 20 35
243
100-150 36 71
150-200 40 111
200 and above 10 121
Total 121
n + 1 121+ 1
Calculate the median class of = = 61 , which lies between 100-150.
2 2
L − L1
Calculate the value of M = L1 + 2 (m − c ) = 100 + 150 − 100 (61− 35) = 136.11
f 36
Questions
1. The data in numerical order are 1.39; 1.76; 1.90; 2.12; 2.53; 2.71; 3.00; 3.33; 3.71; 4.00.
Calculate the median.
Solution
The number of observations is ten, which is even, so there are two middle measurements, the fifth
2.53 + 2.71
and sixth, which are 2.53 and 2.71. Therefore the median of these data is M = = 2.62
2
.
2. Suppose that in a small town of 50 people, one person earns ₵5,000,000 per year and the other
49 each earn ₵30,000. Which is the better measure of the ‘centre’: the mean or the median?
Solution
5,000,000 + 49(330,000)
x= = 129,400 and M = 30,000
50
There are 49 people who earn ₵30,000 and one person who earns ₵5,000,000. The median is a
better measure of the ‘centre’ than the mean because 49 of the values are 30,000 and one is
5,000,000. The 5,000,000 is an outlier.
3. The playbill for the Alley Theatre in Houston wants to appeal to advertisers. They reported the
mean household income and the median age of theatre goers. What might have guided their choice
of the mean or median?
Solution
It is likely that they wanted to emphasize that theatre goers had high income but de-emphasize
how old they are. The distributions of income and age of theatre goers probably have positive
244
skew. Therefore the mean is probably higher than the median, which results in higher income and
lower age than if the median household income and mean age had been presented!
• Simplicity: It is very simple measure of the central tendency of the series. I the case of
simple statistical series, just a glance at the data is enough to locate the median value.
• Free from the effect of extreme values: Unlike arithmetic mean, median value is not
destroyed by the extreme values of the series.
• Certainty: Certainty is another merits is the median. Median values are always a certain
specific value in the series.
• Real value: Median value is real value and is a better representative value of the series
compared to arithmetic mean average, the value of which may not exist in the series at all.
• Graphic presentation: Besides algebraic approach, the median value can be estimated also
through the graphic presentation of data.
• Possible even when data is incomplete: Median can be estimated even in the case of certain
incomplete series. It is enough if one knows the number of items and the middle item of
the series.
• . Open end intervals: As taking any value of the intervals, value of the median remains the
same.
• Other statistical devices: The median is also used for other statistical devices such as mean
deviation and skewness.
• It can be used for the Quantities. It is possible to arrange in any order and to locate the
middle valve. For such cases it is the best measure.
• It can easily located by inspection
• It is appropriate for a qualitative data
Exercise 5.2
1. Design three data sets on your own that have 5 numbers in each set so that:
(a). the same mean but different standard deviations.
(b). the same mean but different medians.
(c). the same median but different means.
2. Compare the mean, median, trimean in terms of their sensitivity to extreme scores.
3. If the mean time to respond to a stimulus is much higher than the median time to respond, what
can you say about the shape of the distribution of response times?
4. Find the mean and the median for the LDL cholesterol level in a sample of ten heart patients of
132, 139, 162, 147, 133, 160, 145, 150, 148, 153.
Answers
4. x = 146.9 M = 147.5
5. Find the mean and the median, for the LDL cholesterol level in a sample of ten heart patients
on a special diet of 127, 113, 152, 131, 138, 148, 110, 135, 152, 158
246
6. A man tosses a coin repeatedly until it lands heads and records the number of tosses required.
(For example, if it lands heads on the first toss he records a 1; if it lands tails on the first two tosses
and heads on the third he records a 3.) The data are shown.
X 1 2 3 4 5 6 7 8 9 10
f 384 208 98 56 28 12 8 2 3 1
Answers
6. x = 2.05 , Me = 2 , Mo = 1
8. In a sample of 60 households, one house is worth ₵2,500,000. Half of the rest are worth
₵280,000, and all the others are worth ₵315,000. Which is the better measure of the ‘centre’: the
mean or the median? (Stats1pp1107).
Summary
How do the various measures of central tendency compare with each other? For symmetric
distributions, the mean, median, trimean, and trimmed mean are equal, as is the mode except in
bimodal distributions. Differences among the measures occur with skewed distributions.
In the distribution, 37, 33, 33, 32, 29, 28, 28, 23, 22, 22, 22, 21, 21, 21, 20, 20, 19, 19, 18, 18, 18,
18, 16, 15, 14, 14, 14, 12, 12, 9, 6, we can notice they do not differ greatly, with the exception that
the mode is considerably lower than the other measures.
When distributions have a positive skew, the mean is typically higher than the median, although it
may not be in bimodal distributions. For these data, the mean of 91.58 is higher than the median
of 90.
Typically, the trimean and trimmed mean will fall between the median and the mean, although in
this case, the trimmed mean is slightly lower than the median. The geometric mean is lower than
all measures except the mode (Stats5).
247
There is no need to summarize a distribution with a single number. When the various measures
differ, our opinion is that you should report the mean, median, and either the trimean or the mean
trimmed 50%. Sometimes it is worth reporting the mode as well.
In the media, the median is usually reported to summarize the centre of skewed distributions. You
will hear about median salaries and median prices of houses sold, and so on. This is better than
reporting only the mean, but it would be informative to hear more statistics (Stats5).
THE MODE
Introduction
Another measure of the centre is the mode. The mode is the most frequent value. There can be
more than one mode in a data set as long as those values have the same frequency and that
frequency is the highest.
A data set with one mode is called unimodal, with two modes is called bimodal and with three
modes is called trimodal. Any other number of modes beyond three is called multimodal.
The mode is also a measure of central location since most real-life data sets have more observations
near the centre of the data range and fewer observations on the lower and upper ends. The value
with the highest frequency is often in the middle of the data range.
The mode can be calculated for qualitative data as well as for quantitative data. For instance, in a
qualitative data such as red, red, red, green, green, yellow, purple, black, blue, the mode is red.
248
And in a quantitative data such as 5, 3, 6, 5, 4, 5, 2, 8, 6, 5, 4, 8, 3, 4, 5, 4, 8, 2, 5, and 4, the mode
is 5.
The mode is the most frequent value in a distribution: the mode of 3, 4, 4, 5, 5, 5, 8 is 5. Note that
the mode may be very different from the mean and the median.
The sample mode of a set of sample data is the most frequently occurring value. On a relative
frequency histogram, the highest point of the histogram corresponds to the mode of the data set.
Example 1
1. Statistics exam scores for 20 students are as follows:
50; 53; 59; 59; 63; 63; 72; 72; 72; 72; 72; 76; 78; 81; 83; 84; 84; 84; 90; 93. Find the mode.
Solution
The most frequent score is 72, which occurs five times.
Therefore, the mode is 72.
2. The number of books checked out from the library from 25 students are as follows:
0; 0; 0; 1; 2; 3; 3; 4; 4; 5; 5; 7; 7; 7; 7; 8; 8; 8; 9; 10; 10; 11; 11; 12; 12. Find the mode.
Solution
The mode is 7
3. Five real estate exam scores are 430, 430, 480, 480, and 495. Find the mode.
Solution
The data set is bimodal because the scores 430 and 480 each occur twice.
Therefore, the modes are 430 and 480.
4. Five credit scores are 680, 680, 700, 720, and 720. Find the mode.
Solution
The data set is bimodal because the scores 680 and 720 each occur twice.
Therefore, the modes are 680 and 720.
5. Create a sample data set of size n = 4 for which the mean, the median, and the mode are all
identical.
249
Solution
{0, 1, 1, 2}
With continuous data, such as response time measured to many decimals, the frequency of each
value is one since no two scores will be exactly the same. Therefore, the mode of continuous data
is normally computed from a grouped frequency distribution.
Statistical method
• Find the modal class---the class interval that has the largest frequency.
• Find the difference of frequency between the modal class to its upper class (a).
• Find the difference of frequency between the modal class to its lower class (b).
a
• Add lb to the to products by c.
a+b
a
• Therefore, Mo = Lo + C where Lo is the lower limit of the modal class
a+b
b is frequency of the modal class - frequency of previous class to the modal class
a is frequency of the modal class - frequency of posterior class to the modal class
C is amplitude of the modal class
Example 2
Find the mode from the table below:
250
Interval 30-40 40-50 50-60 60-70 70-80 80-90 90-100
Frequency 4 6 8 12 9 7 4
Solution
Interval 30-40 40-50 50-60 60-70 70-80 80-90 90-100
Frequency 4 6 8 12 9 7 4
We can see from Column (2) of the table that the maximum frequency of 12 lies in the class-
interval of 60-70. This suggests that the mode lies in this class interval.
Applying the second formula given earlier, we get:
f1 − f 0 12 − 8
Mo = L1 + C = 60 + 10
( f1 − f 0 ) + ( f1 − f 2 ) (12 − 8) + (12 − 9)
12 − 8 4
Mo = 60 + 10 = 60 + 10 = 65.7
(12 − 8) + (12 − 9) 4+3
Algebraic method
If we have a bi-modal or multimodal series, which shows that some classes have occurred the same
number of times, we can determine mode indirectly by applying the following formula:
Mo = 3median − 2mean = 3M − 2x
Example 3
Find the mode from the table below:
Solution
Size Frequency cf x d u=d/10 fu
10-20 10 10 15 -20 -2 -20
20-30 18 28 25 -10 -1 -18
30-40 25 53 35 0 0 0
40-50 26 79 45 10 1 26
50-60 17 96 55 20 2 34
60-70 4 100 65 30 3 12
251
Total 100 34
40 − 30
Median = 30 + (50.5 − 28) = 39
25
34
Mean = 35 + 10 = 38.4
100
Mode = 3(39) − 2(38.4) = 40.2
The mean, the median, and the mode are each seven for these data. In a perfectly symmetrical
distribution, the mean and the median are the same.
This example has one mode (unimodal), and the mode is the same as the mean and median. In a
symmetrical distribution that has two modes (bimodal) or more modes (multimodal), the modes
are normally different from the mean and median.
252
We can see that the mean is 6.3, the median is 6.5, and the mode is 7. Notice that the mean is less
than the median, and they are both less than the mode. The mean and the median both reflect the
skewing, but the mean reflects it more so.
Therefore, when a distribution is skewed to the left, then mod e median mean . This is because
the mean is pulled down below the median by extremely low values.
253
In this figure, the mean is 7.7, the median is 7.5, and the mode is 7. Of the three statistics, the mean
is the largest, while the mode is the smallest. Again, the mean reflects the skewing the most.
If the distribution of data is skewed to the right, the mode is often less than the median, which is
less than the mean. That is, mean median mod e . In such a case, the mean is pulled up by the
extreme high values.
When mean median, it is skewed to the right; when median mean, it is skewed to the left. It
may be noted that the median is always in the middle between mean and mode.
The Best Measure of Central Tendency
The question commonly asked is: which of the three measures of central tendency is the best?
There is no simple answer to this question. This is because the three measures are based upon
different concepts:
• The arithmetic mean is the sum of the values divided by the total number of observations
in the series.
• The median is the value of the middle observation that divides the series into two equal
parts.
• Mode is the value around which the observations tend to concentrate.
As such, the use of a particular measure will largely depend on the purpose of the study and the
nature of the data;
• When we are interested in knowing the consumers preferences for different brands of
television sets or different kinds of advertising, the choice should go in favour of mode.
The use of mean and median would not be proper.
• Median can sometimes be used in the case of qualitative data when such data can be
arranged in an ascending or descending order.
• Mean is most appropriate in the case of quantitative data when such data can be arranged
in an interval or a ratio.
Example 4
1. Find the mean, the median, and the mode for the number of vehicles owned in a survey of 52
households in the following table:
x 0 1 2 3 4 5 6 7
f 2 12 15 11 6 3 1 2
Solution
Mean=2.6, median=2, mode=2
254
2. Begin with the following set of Data Set I: 5, −2, 6, 14, −3, 0, 1, 4, 3, 2, 5.
a. Compute the mean, median, and mode.
b. Form a new data set, Data Set II, by adding 3 to each number in Data Set I.
Calculate the mean, median, and mode of Data Set II.
c. Form a new data set, Data Set III, by subtracting 6 from each number in
Data Set I. Calculate the mean, median, and mode of Data Set III.
d. Comparing the answers to parts (a), (b), and (c), can you guess the pattern?
State the general principle that you expect to be true.
Answers
2. a. x = 3.18 , Me = 3 , Mo = 5
b. x = 6.18 , Me = 6 , Mo = 8
c. x = −2.18 , Me = −3 , Mo = −1
d. If a number is added to every measurement in a data set, then the mean, median, and mode all
change by that number.
3. Suppose we invite applications for a certain vacancy in our company. A large number of
candidates apply for that post. What measure should we use to determine the age or age group with
the largest concentration of applicants?
Solution
Here, obviously the mode will be the most appropriate choice.
The arithmetic mean may not be appropriate as it may be influenced by some extreme values.
However, the mean happens to be the most commonly used measure of central tendency as will be
evident from other data.
Exercise
1. Calculate the mean, median and mode from the following data:
a.
Class 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89
Frequency 2 4 9 11 12 6 4 2
b.
Height (inc) 62-63 63-64 64-65 65-66 66-67 67-68 68-69
255
Frequency 2 6 14 16 8 3 1
2. What are the requirements of a good average? Compare the mean, the median and the mode in
the light of these requirements? Why averages are called measures of central tendency?
Summary
We have learned that the mean, the median, and the mode each answer the question “Where is the
centre of the data set?” The nature of the data set determines which one gives the best answer. It
is left with the learner to determine the best method to calculate the central value.
The median is an example that specifies a location such that half the data fall below it and half fall
above it. The range uses two other measures of position, the maximum value and the minimum
value. The z-score tells us how far an observation falls from a particular point, such as the number
of standard deviations an observation falls from the mean.
Another more common use of the term quantiles is a general term for partitioning ranked data into
equal parts. For example, quartiles partition the data into 4 equal parts. Percentiles partition the
data into 100 equal parts. Thus, the k-th q-tile is the value in the data for which k/q of the values
are below the given value. This naturally leads to some rounding issues which lead to a large
variety of small differences in the definition of quantiles.
A more common use of the term quantiles is a general term for partitioning ranked data into equal
parts. For example, quartiles partition the data into 4 equal parts. Percentiles partition the data into
100 equal parts. Thus, the k-th q-tile is the value in the data for which k/q of the values are below
the given value. This naturally leads to some rounding issues which lead to a large variety of small
differences in the definition of quantiles.
The common measures of location are quartiles and percentiles. Quartiles are special percentiles.
The first quartile, Q1, is the same as the 25th percentile, and the third quartile, Q3, is the same as
the 75th percentile. The median, M, is called both the second quartile and the 50th percentile.
256
To calculate quartiles and percentiles, the data must be ordered from smallest to largest. Quartiles
divide ordered data into quarters. Percentiles divide ordered data into hundredths. To score in the
90th percentile of an exam does not mean, necessarily, that you received 90% on a test. It means
that 90% of test scores are the same or less than your score and 10% of the test scores are the same
or greater than your test score.
Locations of percentiles
The values that divide a rank-ordered set of data into 100 equal parts are called percentiles.
Percentiles are used to compare and interpret data. For example, an observation at the 50th
percentile would be greater than 50 percent of the other observations in the set.
Percentiles are useful for comparing values. For this reason, universities and colleges use
percentiles extensively. One instance in which colleges and universities use percentiles is when
exam results are used to determine a minimum testing score that will be used as an acceptance
factor. For example, suppose UEW accepts exam scores at or above the 75th percentile for
postgraduate studies. That translates into a score of at least 3.50.
Percentiles are mostly used with very large populations. Therefore, if you were to say that 90% of
the test scores are less (and not the same or less) than your score, it would be acceptable because
removing one particular data value is not statistically significant.
The Pth percentile cuts the data set in two so that approximately P% of the data lie below it and
(100 − P)% of the data lie above it. In particular, the three percentiles that cut the data into fourths
are called the quartiles.
There is no universally accepted definition of a percentile. Using the 65th percentile as an example:
1. The 65th percentile can be defined as the lowest score that is greater than 65% of the scores.
2. The 65th percentile can also be defined as the smallest score that is greater than or equal to 65%
of the scores.
3. The 65th percentile can also be defined as a weighted average of the percentiles computed
according to the first two definitions.
Unfortunately, the first two definitions can lead to dramatically different results, especially when
there is relatively little data.
Moreover, neither of these definitions is explicit about how to handle rounding. For instance, what
rank is required to be higher than 65% of the scores when the total number of scores is 50?
257
This is tricky because 65% of 50 is 32.5. How do we find the lowest number that is higher than
32.5% of the scores? The third definition handles rounding more gracefully than the other two and
has the advantage that it allows the median to be defined conveniently as the 50th percentile.
k
In all cases, the position of the percentile is i = (n + 1) , where:
100
k = the kth percentile. It may or may not be part of the data.
i = the index (ranking or position of a data value)
n = the total number of data
258
.Example
1. Find the 25th percentile for the 8 numbers in the Table below.
.
Number 3 5 7 8 9 11 13 15
Rank 1 2 3 4 5 6 7 8
Solution
The first step is to compute the rank (R) of the 25th percentile. This is done using the formula:
(n + 1) , where P is the desired percentile (25 in this case) and n is the number of numbers
p
R=
100
(8 in this case).
(n + 1) = (8 + 1) = 2.25 .
p 25
Therefore, R =
100 100
If R is an integer, the Pth percentile is the number with rank R.
When R is not an integer, we compute the Pth percentile by interpolation as follows:
1. Define IR as the integer portion of R (the number to the left of the decimal point i.e. IR=2).
2. Define FR as the fractional portion of R (the number to the right of the decimal point i.e.
IR=0.25).
3. Find the scores with Rank IR and with Rank IR + 1. This means the score with Rank 2 and the
score with Rank 3 are 5 and 7 respectively.
4. Interpolate by multiplying the difference between the scores by FR and add the result to the
lower score. That is, (0.25) (7 − 5) + 5 = 5.5 .
Therefore, the 25th percentile is 5.5.
• If we had used the first definition (the smallest score greater than 25% of the scores), the
25th percentile would have been 7.
• If we had used the smallest score greater than or equal to 25% of the scores, the 25th
percentile would have been 5.
• But because we used the weighted average of the percentiles computed according to the
first two definitions,, the 25th percentile is 5.5.
2. The table below represents the cores of 20 pupils in a test. Use the table to find the 25th and 85th
percentiles.
Score 4 4 5 5 5 5 6 6 6 7 7 7 8 8 9 9 9 10 10 10
Rank 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Solution
a). For the 25th:
259
(n + 1) = (25 + 1) = 5.25
p 25
R=
100 100
IR = 5 and FR = 0.25.
Since the score with a rank of IR (which is 5) and the score with a rank of IR + 1 (which is 6) are
both equal to 5, the 25th percentile is 5.
In terms of the formula: R25 = (0.25) (5 − 5) + 5 = 5 .
The score with a rank of 17 is 9 and the score with a rank of 18 is 10. Therefore, the 85th percentile
is: R85 = (0.85) (10 − 9) + 9 = 9.85
Solution
(n + 1) = (4 + 1) = 2.5
p 50
R=
100 100
IR = 2 and FR = 0.5.
The score with a rank of IR is 3 and the score with a rank of IR + 1 is 5.
4. Listed are 29 ages for Academy Award winning best actors in order from smallest to largest.
18; 21; 22; 25; 26; 27; 29; 30; 31; 33; 36; 37; 41; 42; 47; 52; 55; 57; 58; 62; 64; 67; 69; 71; 72; 73;
74; 76; 77
260
a. Find the 70th percentile.
b. Find the 83rd percentile.
Solution
a. k = 70
i = the index
n = 29
k
(n + 1) = (29 + 1) = 21
70
i =
100 100
5. Twenty-one is an integer, and the data value in the 21st position in the ordered data set is 64.
The 70th percentile is 64 years.
b. k = 83rd percentile
i = the index
n = 29
i=k
k
(n + 1) = (29 + 1) = 24.9 , which is NOT an integer. Round it down to 24 and up to 25.
83
i =
100 100
The age in the 24th position is 71 and the age in the 25th position is 72. Average of 71 and 72 is
71.5 years.
Example 2
1. Listed are 29 ages for Academy Award winning best actors in order from smallest to largest.
18; 21; 22; 25; 26; 27; 29; 30; 31; 33; 36; 37; 41; 42; 47; 52; 55; 57; 58; 62; 64; 67; 69; 71; 72; 73;
74; 76; 77
a. Find the percentile for 58.
b. Find the percentile for 25.
261
Solution
a. Counting from the bottom of the list, there are 18 data values less than 58. There is one value of
58.
x + 0.5 y 18 + 0.5(1)
x = 18 and y = 1. (100) = (100) = 63.80 . 58 is the 64th percentile.
n 29
b. Counting from the bottom of the list, there are three data values less than 25. There is one value
of 25.
x + 0.5 y 3 + 0.5(1)
x = 3 and y = 1. (100) = (100) = 12.07 . Twenty-five is the 12th percentile.
n 29
In the same manner, we can calculate deciles (where the series is divided into 10 parts) and
percentiles (where the series is divided into 100 parts) (Stats10pp32).
Locations of Deciles or Decision
These are values that divide the ordered data set (list) within ten (10) equal parts.
First Decile (D1) - set value so the data series that 10% of the observations are smaller than him
and 90% are greater.
Second Decile (D2) - set value so the data series that 20% of the observations are smaller than him
and 80% are greater.
Ninth Decile (D9) - set value so the data series that 90% of the observations are smaller than him
and 10% are greater.
Steps taken to calculate the First decile:
Compare the value of (n / 10) with the cumulative values, starting from the first class (the top!)
and asking the question: “This cumulative is greater than or equal to (n / 10)?” If the answer is
NO, the cumulative goes to the next class. When the answer is YES, we will stop and try the
corresponding class!
262
Finally, we will apply the formula of Q3, extracting the data Q1 of the class, and use the formula.
Example
For the set below, determine the value of the first decile.
Step 1) we will find and calculate (n / 10): Hence, we find that n = 24 and therefore (n / 10) = 2.4
We think, therefore, that the corresponding class (10-20) will be our Class of First Decile!
To the E: D1 = 10.8
263
A safer procedure is to obtain a type of range that uses observations that are not the largest and
smallest values. The interquartile range (IQR) is one that is commonly used. The interquartile
range is defined as IQR = Q3 - Q1.
Three quartiles divide the distribution into four equal parts, with 25% of the distribution in each
part. We have already introduced one of the quartiles, Q2, which is the median. The quartile Q1
divides the lower half of the distribution into halves; Q3 divides the upper half of the distribution
into halves. Quartiles are computed by first ordering the data, and the location of Q1 is .25(n+l),
Q2 is .50(n+l), and Q3 is .75(n+1).
The interquartile range is available in many statistical programs. The Q1 and Q3 quartiles are not
easy measures to compute by hand, as they often require interpolation. (Interpolation is a method
of estimating an unknown value by using its position among a series of known values. Since the
quartiles are not sensitive to the numerical values of extreme observations, they are considered
measures of location resistant to the effect of outliers.
Note that the numerical value of the difference between the median and Q1 does not have to equal
the difference between Q3 and the median. If the distribution is skewed to the right, then Q3 minus
the median usually is larger than the median minus Q1. But the proportion of observations is the
same.
For small samples, fourths are simpler measures to compute. If n is even, we simply compute the
median of the lower and upper halves of the ordered observations and call them the lower and
upper fourths, respectively. If n is odd, we consider the middle measurement, or median, to be part
264
of the lower half of measurements and compute the median of these measurements, Q1. Then
assign the median to the upper half of the measurements and compute the median of the upper half,
Q3.
Quartiles are numbers that separate the data into quarters. Quartiles may or may not be part of the
data. To find the quartiles, first find the median or second quartile. The first quartile, Q1, is the
middle value of the lower half of the data, and the third quartile, Q3, is the middle value, or median,
of the upper half of the data. To get the idea, consider the same data set:
1; 1; 2; 2; 4; 6; 6.8; 7.2; 8; 8.3; 9; 10; 10; 11.5.
The number two, which is part of the data, is the first quartile. One-fourth of the entire sets of
values are the same as or less than two and three-fourths of the values are more than two. The
upper half of the data is 7.2, 8, 8.3, 9, 10, 10, 11.5. The middle value of the upper half is nine.
Quartiles divide data into quarters. The first quartile (Q1) is the 25th percentile, the second quartile
(Q2 or median) is 50th percentile, and the third quartile (Q3) is the 75th percentile. The
interquartile range, or IQR, is the range of the middle 50 percent of the data values. The IQR is
found by subtracting Q1 from Q3, and can help determine outliers by using the following two
expressions.
• Q3 + IQR(1.5)
• Q1 – IQR(1.5)
k
i = (n + 1) , where i = the ranking or position of a data value, k = the kth percentile, n = total
100
number of data.
x + 0.5 y
Expression for finding the percentile of a data value: (100) , where x = the number of
n
values counting from the bottom of the data list up to but not including the data value for which
you want to find the percentile, y = the number of data values equal to the data value for which
you want to find the percentile.
To understand quartile and decile, we should first know that the median belongs to a general class
of statistical descriptions called fractiles. A fractile is a value below that lays a given fraction of a
set of data. In the case of the median, this fraction is one-half (1/2).
Likewise, a quartile has a fraction one-fourth (1/4). The three quartiles Q1, Q2 and Q3 are such
that 25 percent of the data fall below Q1, 25 percent fall between Q1 and Q2, 25 percent fall
265
between Q2 and Q3 and 25 percent fall above Q3. It will be seen that Q2 is the median. We can
use the above formula for the calculation of quartiles as well.
The third quartile, Q3, is nine. Three-fourths (75%) of the ordered data set are less than nine. One-
fourth (25%) of the ordered data set are greater than nine. The third quartile is part of the data set
in this example.
The IQR can help to determine potential outliers. A value is suspected to be a potential outlier if it
is less than (1.5)(IQR) below the first quartile or more than (1.5)(IQR) above the third quartile.
Potential outliers always require further investigation.
The series are equally separated. These measures are the quartiles, deciles and percentiles. We call
quartiles the values of a series that fall into four (4) equal parts. Three quartiles are therefore
identified (Q1, Q2 and Q3) to divide the series into four equal parts.
Note: The quartile 2 (Q2) will always be equal to the median of the series.
Example
1. The first step to be taken is the sort (ascending or descending) of the values:
3. The value that divides the above series into two equal parts is greater than 9, then the
Median = Q2 = 9 that will be.
4. The steps for determining the Q1 of a set are as follows: Determine on the (adding column
fi);
• Calculate the value of (n / 4) (whether n is even or odd!);
• To build the college column;
• To compare the value of (n / 4) with the college’s values, starting from the first college
class (the top!) And asking the question: “This college is greater than or equal to (n / 4)? “
266
If the answer is NO, the college spent the next class. When the answer is YES, we will stop
and try the corresponding class! This will be our Class of First Quartile.
Finally, we will apply the formula for Q1, extracting the data Q1 of this class, we
just found! Again the formula:
Example
For the set below, determine the value of the third quartile!
Xi 0-10 10-20 20-30 30-40 40-50 n
fi 2 5 8 6 3 24
Solution
Step 1) We will find and calculate n (3n / 4):
Step 4) we will apply the formula of Q3, using data from the Class of Q3, just identified!
Therefore, Q3 is 35
267
Locations of medians
The median is a number that measures the ‘centre’ of the data. You can think of the median as the
"middle value," but it does not actually have to be one of the observed values. It is a number that
separates ordered data into halves. Half the values are the same number or smaller than the median,
and half the values are the same number or larger. For example, consider the following data: 1;
11.5; 6; 7.2; 4; 8; 9; 10; 6.8; 8.3; 2; 2; 10; 1.
The median is seven. Half of the values are smaller than seven and half of the values are larger
than seven.
The median or second quartile is seven. The lower half of the data are 1, 1, 2, 2, 4, 6, 6.8. The
middle value of the lower half is 2: 1; 1; 2; 2; 4; 6; 6.8.
Given an observed value x in a data set, x is the Pth percentile of the data if the percentage of the
data that are less than or equal to x is P. The number P is the percentile rank of x.
In addition to the three quartiles, the two extreme values, the minimum xmin and the maximum
max are also useful in describing the entire data set. Together these five numbers are called the
five-number summary of the data set: {xmin , Q1 , Q2 , Q3 , xmax}
The box plot
The five-number summary is used to construct a box plot. Each of the five numbers is represented
by a vertical line segment, a box is formed using the line segments at Q1 and Q3 as its two vertical
sides, and two horizontal line segments are extended from the vertical segments marking Q1 and
Q3 to the adjacent extreme values.
The two horizontal line segments are referred to as “whiskers,” and the diagram is sometimes
called a ‘box and whisker plot.’
We caution that there are other types of box plots that differ somewhat from the ones we are
constructing, although all are based on the three quartiles.
268
Example
Construct a box plot and find the IQR for the data 1.39, 1.76, 1.90, 2.12, 2.53, 2.71, 3.00, 3.33,
3.71, 3.88, and 4.00
Solution
xmin = 1.39, Q1 = 1.90, Q2 = 2.62, Q3 = 3.33, xmax = 4.00
Example
What percentile is the value 3.33?
Solution
The data written in increasing order are: 1.39; 1.76; 1.90; 2.12; 2.53; 2.71; 3.00; 3.33; 3.71; 4.00.
The only data value that is less than or equal to 1.39 is 1.39 itself.
Since 1 is 1∕10 = .10 or 10% of 10, the value 1.39 is the 10th percentile.
Eight data values are less than or equal to 3.33. Since 8 is 8∕10 = .80 or 80% of 10, the value 3.33
is the 80th percentile.
Examples
269
1. Find the quartiles of the data set of GPAs of1.39; 1.76; 1.90; 2.12; 2.53; 2.71; 3.00; 3.33; 3.71;
4.00.
Solution
This data set has n = 10 observations. Since 10 is an even number, the median is the mean of the
two middle observations:
x ˜ = (2.53 + 2.71) / 2 = 2.62. Thus the second quartile is Q2 = 2.62.
2. Adjoin the observation 3.88 to the data set of the previous example and find
the quartiles of the new set of data.
Solution:
As in the previous example we first list the data in numerical order:
1.39 1.76 1.90 2.12 2.53 2.71 3.00 3.33 3.71 3.88 4.00
This data set has 11 observations. The second quartile is its median, the middle value 2.71. Thus
Q2 = 2.71.
The lower set L has median the middle value 1.90, so Q1 = 1.90. The upper set has median the
middle value 3.71, so Q3 = 3.71.
It may be noted that unlike arithmetic mean, median is not affected at all by extreme values, as it
is a positional average. As such, median is particularly very useful when a distribution happens to
be skewed.
Another point that goes in favour of median is that it can be computed when a distribution has
open-end classes.
Yet, another merit of median is that when a distribution contains qualitative data, it is the only
average that can be used. No other average is suitable in case of such a distribution.
270
Interpreting Percentiles, Quartiles, and Median
A percentile indicates the relative standing of a data value when data are sorted into numerical
order from smallest to largest.
Percentages of data values are less than or equal to the pth percentile. For example, 15% of data
values are less than or equal to the 15th percentile.
• Low percentiles always correspond to lower data values.
• High percentiles always correspond to higher data values.
A percentile may or may not correspond to a value judgment about whether it is "good" or "bad."
The interpretation of whether a certain percentile is "good" or "bad" depends on the context of the
situation to which the data applies.
In some situations, a low percentile would be considered "good;" in other contexts a high percentile
might be considered "good". In many situations, there is no value judgment that applies.
Understanding how to interpret percentiles properly is important not only when describing data,
but also when calculating probabilities.
When writing the interpretation of a percentile in the context of the given data, the sentence should
contain the following information.
• Information about the context of the situation being considered
• The data value (value of the variable) that represents the percentile
• The percent of individuals or items with data values below the percentile
• The percent of individuals or items with data values above the percentile.
Activities 1
1. Following are the published weights (in kg) of all of the team members of UEW basketball from
the previous year.
177; 205; 210; 210; 232; 205; 185; 185; 178; 210; 206; 212; 184; 174; 185; 242; 188; 212; 215;
247; 241; 223; 220; 260; 245; 259; 278; 270; 280; 295; 275; 285; 290; 272; 273; 280; 285; 286;
200; 215; 185; 230; 250; 241; 190; 260; 250; 302; 265; 290; 276; 228; 265.
Answer
271
a). 174; 177; 178; 184; 185; 185; 185; 185; 188; 190; 200; 205; 205; 206; 210; 210; 210; 212; 212;
215; 215; 220; 223; 228; 230; 232; 241; 241; 242; 245; 247; 250; 250; 259; 260; 260; 265; 265;
270; 272; 273; 275; 276; 278; 280; 280; 285; 285; 286; 290; 290; 295; 302
b). 241 c). 205.5 d). 272.5
2. For the following 13 real estate prices in Ghana, calculate the IQR and determine if any prices
are potential outliers. Prices are in Ghana cedis.
389,950; 230,500; 158,000; 479,000; 639,000; 114,950; 5,500,000; 387,000; 659,000; 529,000;
575,000; 488,800; 1,095,000.
Solution
Order the data from smallest to largest.
114,950; 158,000; 230,500; 387,000; 389,950; 479,000; 488,800; 529,000; 575,000; 639,000;
659,000; 1,095,000; 5,500,000
M = 488,800
3. For the two data sets in the test scores below, find:
a. The interquartile range. Compare the two interquartile ranges.
b. Any outliers in either set.
Min Q1 Median Q3 Max
Day 32 56 74.5 82.5 99
Night 25.5 78 81 89 98
Solution
272
a. The IQR for the day group is Q3 – Q1 = 82.5 – 56 = 26.5
The IQR for the night group is Q3 – Q1 = 89 – 78 = 11
The interquartile range (the spread or variability) for the day class is larger than the night class
IQR. This suggests more variation will be found in the day class’s class test scores.
b. Day class outliers are found using the IQR times 1.5 rule. So,
Q1 - IQR(1.5) = 56 – 26.5(1.5) = 16.25
Q3 + IQR(1.5) = 82.5 + 26.5(1.5) = 122.25
Since the minimum and maximum values for the day class are greater than 16.25 and less than
122.25, there are no outliers.
For this class, any test score less than 61.5 is an outlier. Therefore, the scores of 45 and 25.5 are
outliers. Since no test score is greater than 105.5, there is no upper end outlier.
4. Fifty statistics students in UEW, Winneba were asked how much sleep they get per school night
(rounded to the nearest hour) and the results were collated as follows.
Sleep per school Frequency
4 2
5 5
6 7
7 12
8 14
9 7
10 3
273
D. The 80th percentile
E. The 90th percentile
F. The 25th percentile
Solution
Compute the relative frequencies and the cumulative frequencies
B. The median
• Look again at the ‘cumulative relative frequency’ column and find 0.52.
• The median is the 50th percentile or the second quartile. 50% of 50 is 25.
• There are 25 values less than the median, namely the two 4s, the five 5s, the seven 6s, and
eleven of the 7s.
• The median or 50th percentile is between the 25th, or seven, and 26th, or seven, values.
• Therefore, the median is seven
274
• Therefore, the 75th percentile, then, must be an eight (8)
Alternatively
• Another way to look at the problem is to find 75% of 50, which is 37.5, and round up to
38.
• The third quartile, Q3, is the 38th value, which is an eight.
• We can check this answer by counting the values.
• There are 37 values below the third quartile and 12 values above.)
5. On a timed math test, the first quartile for time it took to finish the exam was 35 minutes.
Interpret the first quartile in the context of this situation.
Solution
• Twenty-five percent of students finished the exam in 35 minutes or less.
• Seventy-five percent of students finished the exam in 35 minutes or more.
• A low percentile could be considered good, as finishing more quickly on a timed exam is
desirable. (If you take too long, you might not be able to finish.)
6. On a 20 question math test, the 70th percentile for number of correct answers was 16. Interpret
the 70th percentile in the context of this situation.
Solution
• Seventy percent of students answered 16 or fewer questions correctly.
• Thirty percent of students answered 16 or more questions correctly.
275
• A higher percentile could be considered good, as answering more questions correctly is
desirable.
1. At a UEW, Winneba distance campus, it was found that the 30th percentile of credit units
that students are enrolled for is seven units. Interpret the 30th percentile in the context of
this situation.
Solution
• Thirty percent of students are enrolled in seven or fewer credit units.
• Seventy percent of students are enrolled in seven or more credit units.
• In this example, there is no ‘good’ or ‘bad’ value judgment associated with a higher or
lower percentile.
• Therefore, students attend the campus for varied reasons and needs, and their course load
varies according to their needs.
8. HPERS Department of UEW, Winneba is applying for a grant that will be used to add fitness
equipment to the gym. The Head of Department surveyed 15 anonymous students to determine
how many minutes a day the students spend exercising in the gym.
The results from the 15 anonymous students are shown as follows:
a. 0 minutes; 40 minutes; 60 minutes; 30 minutes; 60 minutes
b. 10 minutes; 45 minutes; 30 minutes; 300 minutes; 90 minutes;
c. 30 minutes; 120 minutes; 60 minutes; 0 minutes; 20 minutes
Use the information to determine the following five values:
a. The minimum time spent
b. The first quartile
c. The median time
d. The third quartile
e. The maximum time spent
f. If you were the HOD of HPERS, how would you be justified in purchasing the new fitness
equipment?
Solutions
a. Min = 0
b. Q1 = 20
c. Med = 40
d. Q3 = 60
e. Max = 300
f. Since 75% of the students exercise for 60 minutes or less daily, and since the IQR is 40
minutes (60 – 20 = 40), we know that half of the students surveyed exercise between 20
minutes and 60 minutes daily.
276
This seems a reasonable amount of time spent exercising, so the principal would be justified in
purchasing the new equipment.
However, the HOD needs to be careful. The value 300 appears to be a potential outlier.
Q3 + 1.5(IQR) = 60 + (1.5)(40) = 120.
The value 300 is greater than 120 so it is a potential outlier. If we delete it and calculate the five
values, we get the following values:
• Min = 0
• Q1 = 20
• Q3 = 60
• Max = 120
• We still have 75% of the students exercising for 60 minutes or less daily and half of the
students exercising between 20 and 60 minutes a day.
• However, 15 students is a small sample and the HOD should survey more students to be
sure of his survey results.
It may be that frequency distributions differ in three ways: Average value, Variability or
dispersion, and Shape. Here, our main spotlight will be on the shape of frequency distribution.
Generally, there are two comparable characteristics called skewness and kurtosis that help us to
understand a distribution.
Two distributions may have the same mean and standard deviation but may differ widely in their
overall appearance as can be seen from the following:
277
In both these distributions the value of mean and standard deviation is the same ( X = 15, σ = 5).
But it does not imply that the distributions are alike in nature.
The distribution on the left-hand side is a symmetrical one whereas the distribution on the right-
hand side is symmetrical or skewed. Measures of skewness help us to distinguish between different
types of distributions.
1. When a series is not symmetrical it is said to be asymmetrical or skewed (Croxton & Cowden).
2. Skewness refers to the asymmetry or lack of symmetry in the shape of a frequency distribution
(Morris Hamburg).
3. Measures of skewness tell us the direction and the extent of skewness. In symmetrical
distribution the mean, median and mode are identical. The more the mean moves away from the
mode, the larger the asymmetry or skewness (Simpson & Kalka).
4. A distribution is said to be ‘skewed’ when the mean and the median fall at different points in
the distribution, and the balance (or centre of gravity) is shifted to one side or the other-to left or
right (Garrett).
The above definitions show that the term ‘skewness’ refers to lack of symmetry i.e., when a
distribution is not symmetrical (or is asymmetrical) it is called a skewed distribution.
The concept of skewness will be clear from the following three diagrams showing a symmetrical
distribution. a positively skewed distribution and a negatively skewed distribution.
278
1. Symmetrical Distribution
It is clear from the diagram that in a symmetrical distribution the values of mean, median and mode
coincide. The spread of the frequencies is the same on both sides of the centre point of the curve.
2. Asymmetrical Distribution
A distribution, which is not symmetrical, is called a skewed distribution and such a distribution
could either be positively skewed or negatively skewed as would be clear from the diagrams (b)
and (c).
It should be noted that in moderately symmetrical distributions the interval between the mean and
the median is approximately one-third of the interval between the mean and the mode. It is this
relationship, which provides a means of measuring the degree of skewness.
Tests of Skewness
In order to ascertain whether a distribution is skewed or not the following tests may be applied.
Skewness is present if:
1. The values of mean, median and mode do not coincide.
2. When the data are plotted on a graph they do not give the normal bell-shaped form i.e. when cut
along a vertical line through the centre the two halves are not equal.
3. The sum of the positive deviations from the median is not equal to the sum of the negative
deviations.
4. Quartiles are not equidistant from the median.
5. Frequencies are not equally distributed at points of equal deviation from the mode.
On the contrary, when skewness is absent, i.e. in case of a symmetrical distribution, the following
conditions are satisfied:
1. The values of mean, median and mode coincide.
2. Data when plotted on a graph give the normal bell-shaped form.
3. Sum of the positive deviations from the median is equal to the sum of the negative deviations.
4. Quartiles are equidistant from the median.
5. Frequencies are equally distributed at points of equal deviations from the mode.
Measures of Skewness
It is an indicator of the shape of the data distribution
1. AS = 0 → distribution is symmetrical;
280
2. AS> 0 → positive distribution is asymmetric;
There are four measures of skewness, each divided into absolute and relative measures. The
relative measure is known as the coefficient of skewness and is more frequently used than the
absolute measure of skewness.
Further, when a comparison between two or more distributions is involved, it is the relative
measure of skewness, which is used. The relative measures of skewness are:
281
(i) Karl Pearson's measure
(ii) Bowley’s measure
(iii) Kelly’s measure
(iv) Moment’s measure.
Karl Pearson’s measure of skewness
mean − mod e
Coefficient of skewness is sk =
3( x − M )
sk =
s
Thus, the direction of skewness is determined by ascertaining whether the mean is greater than the
mode or less than the mode. If it is greater than the mode, then skewness is positive. But when the
mean is less than the mode, it is negative.
The difference between the mean and mode indicates the extent of departure from symmetry. It is
measured in standard deviation units, which provide a measure independent of the unit of
measurement.
The value of coefficient of skewness is zero, when the distribution is symmetrical. Normally, this
coefficient of skewness lies between +1.
If the mean is greater than the mode, then the coefficient of skewness will be positive, otherwise
negative.
282
Example
1. Given the following data, calculate the Karl Pearson's coefficient of skewness: Σx = 452; Σx2=
24270; Mode = 43.7 and N = 10.
Solution
3( x − M )
Pearson’s coefficient is
s
x=
x = 452 = 45.2
n 10
x x
2 2 2
Applying the values of mean, mode and standard deviation in the above formula,
sk =
(x − M ) = 45.2 − 43.7 = 0.08
s 19.59
This shows that there is a positive skewness though the extent of skewness is marginal.
2. From the following data, calculate the measure of skewness using the mean, median and
standard deviation:
f 18 30 40 55 38 20 16
Solution
Using A = Assumed mean = 45, cf = Cumulative frequency, d = Deviation from assumed mean,
and i = 10 to find the mean:
interval f x d =x-a fd fd2 cf
283
30-40 40 35 -1 -40 40 88
50-60 38 55 1 38 38 181
60-70 20 65 2 40 80 201
x = A+
fd c = 45 + − 28 10 = 43.71
n 217
l 2 − l1
M = l1 + (m − c ) = 40 + 50 − 40 (109 − 88) = 43.82
f1 55
s=
fd 2
fd
− c = 584
−
− 28
2
10 = 16.4
n n 217 217
skewness − 0.33
Coefficient of skewness = = = −0.02
s 16.4
The result shows that the distribution is negatively skewed, but the extent of skewness is extremely
negligible.
In a symmetrical distribution, skewness is zero. This means that Q3 and Q1 are positioned
equidistantly from Q2 that is, the median.
284
In symbols, Q3 - Q2 = Q2 – Q1'
In contrast, when the distribution is skewed, then Q3 - Q2 will be different from Q2 – Q1' When
Q3- Q2 exceeds Q2 – Q1' then skewness is positive.
Bowley’s measure of skewness can- be written as: Skewness = (Q3 - Q2) - (Q2 – Q1 or Q3 - Q2 -
Q2 + Q1 or Q3 + Q1 - 2Q2 (2Q2 is 2M).
Solution
It should be noted that the series given in the question is an open-ended series. As such, Bowley's
coefficient of skewness, which is based on quartiles, would be the most appropriate measure of
skewness in this case.
285
In order to calculate the quartiles and the median, we have to use the cumulative frequency.
Example
Use Kelly's measure to calculate skewness from the table below:
Intervals 10 - 20 20-30 30-40 40-50 550-60 60-70 70-80
f 18 30 40 55 38 20 16
286
cf 18 48 88 143 181 201 217
Solution
Now we have to calculate P10, P50 and P90.
p (n + 1)th
mp = =
100
10(n + 1)th 217 + 1
m10 = = = 21.8th , which lies in the 20 - 30 class.
100 10
30 − 20
Therefore, P10 = 20 + (21.8 − 18) = 21.27
30
50(n + 1)th 217 + 1
m 50 = = = 109th , which lies in the class 40 – 50.
100 2
50 − 40
Therefore, P50 = 40 + (109 − 88) = 43.82
55
90(n + 1)th 217 + 1
m 90 = = = 196.2th , which lies in the class 60 - 70.
100 2
70 − 60
Therefore, P90 = 60 + (196.20 − 181) = 67.60
20
P − 2 P50 + P10 67.60 − (2 43.82) + 21.27
Hence, Kelley's skewness, Skk is Skk = 90 = 0.027
P90 − P10 67 − 60 − 21.27
This shows that the series is positively skewed though the extent of skewness is extremely
negligible. It may be recalled that if there is a perfectly symmetrical distribution, then the skewness
will be zero. One can see that the above answer is very close to zero.
In mechanics, the term moment is used to denote the rotating effect of a force. In Statistics, it is
used to indicate peculiarities of a frequency distribution. The utility of moments lies in the sense
that they indicate different aspects of a given distribution.
287
Thus, by using moments, we can measure the central tendency of a series, dispersion or variability,
skewness and the peakedness of the curve. The moments about the actual arithmetic mean are
denoted by μ.
In statistical theory, location and variability are referred to as the first and second moments of a
distribution. The third and fourth moments are called skewness and kurtosis. Skewness refers to
whether the data is skewed to larger or smaller values and kurtosis indicates the propensity of the
data to have extreme values. Generally, metrics are not used to measure skewness and kurtosis;
instead, these are discovered through visual displays.
The kurtosis index measures the extent to which the peak of a unimodal frequency distribution
departs from the shape of normal distribution. A value of zero corresponds to a normal distribution;
positive values indicate a distribution that is more pointed than a normal distribution and a negative
value a flatter distribution.
The first four moments about zero in relation to individual items are as follows:
First moment is x1 = ( x i )
1
n
Second moment is x 2 = ( x i )
1 2
Third moment is x 3 = ( x i )
1 3
Fourth moment is x 4 = ( x i )
1 4
The first four moments about mean or central moments in relation to individual items are:
First moment is 1 = ( x1 − x )
1
n
Second moment is 2 = ( x1 − x )
1 2
Third moment is 3 = ( x1 − x )
1 3
Fourth moment is 4 = ( x1 − x )
1 4
n
The first four moments about mean or central moments in the case of a frequency distribution are
as follows:
First moment is 1 = f i ( x1 − x )
1
n
Second moment is 2 = f i ( x1 − x )
1 2
n
288
f i ( x1 − x )
1
Third moment is 3 =
3
Fourth moment is 4 = f i ( x1 − x )
1 4
n
It may be noted that:
• the first central moment is zero, that is, 1 = 0
• The second central moment is 2 = , indicating the variance.
• The third central moment 3 is used to measure skewness.
• The fourth central moment 4 gives an idea about the Kurtosis.
32
Karl Pearson measure of skewness is based on the third and second central moments as 1 = 3
2
Example
Find the first, second, third and fourth moments about zero for the set of numbers 2,3,4,5 and 6.
Solution
a). First moment is x1 =
1
(xi ) = 2 + 3 + 4 + 5 + 6 = 4
n 5
4 + 9 + 16 + 25 + 36
b). Second moment is x 2 = ( x i ) =
1 2
= 18
n 5
8 + 27 + 64 + 125 + 216
c). Third moment is x 3 = (x i ) =
1 3
= 88
n 5
16 + 81 + 256 + 625 + 1296
d). Fourth moment is x 4 = (x i ) =
1 4
= 454.80
n 5
2. Using the five figures 2,3,4,5 and 6, find the first, second, third and fourth moments about the
mean.
Solutions
a). First moment is 1 =
1
(x − x ) = (2 − 4) + (3 − 4) + (4 − 4) + (5 − 4) + (6 − 4) = 0
n 5
b). Second moment is
(2 − 4) 2 + (3 − 4) 2 + (4 − 4) 2 + (5 − 4) 2 + (6 − 4) 2
2 =
1
( x − x ) 2
= =2
n 5
(2 − 4) 3 + (3 − 4) 3 + (4 − 4) 3 + (5 − 4) 3 + (6 − 4) 3
c). Third moment is 3 =
1
( x − x )3
= =0
n 5
d). Fourth moment is
289
(2 − 4) 4 + (3 − 4) 4 + (4 − 4) 4 + (5 − 4) 4 + (6 − 4) 4
4 =
1
( x − x ) 4
= = 6.8
n 5
3. Calculate the first four central moments from the following data:
Frequency 5 12 20 7 6
Solution
70-80 20 A = 75 0 0 0 0 0 0
80-90 7 85 10 1 7 7 7 7
90-100 6 95 20 2 12 24 48 96
Total 50 0 0 -3 63 -4 195
1 − 3 10
11 =
n
fu c =
50
= −0.6
1 63 10
21 =
n
fu 2 c =
50
= 12.60
1 − 4 10
31 =
n
fu 3 c =
50
= −0 . 8
1 195 10
41 =
n
fu 4 c =
50
= 19
290
μ3= μ3’ - 3 μ2’μ’1+2 μ1’3=-0.8-3(12.6)(-0.6)+2(-0.6)3
= -0.8 + 22.68 + 0.432 = 22.312
Kurtosis measures how fat or thin the tails of a distribution are relative to a normal distribution.
Distributions with long tails are called leptokurtic, and distributions with short tails are called
platykurtic. Normal distributions have zero kurtosis.
Distributions also differ from each other in terms of how large or “fat” their tails. If the upper
distribution has relatively more scores in its tails; its shape is called leptokurtic, and if the lower
distribution has relatively fewer scores in its tails; its shape is called platykurtic.
As distributions differ in kurtosis, the top distribution having long tails is called “leptokurtic’’, and
the bottom distribution has short tails is called “platykurtic”.
Kurtosis is another measure of the shape of a frequency curve. It is a Greek word, which means
bulginess. While skewness signifies the extent of asymmetry, kurtosis measures the degree of
peakedness of a frequency distribution. Karl Pearson classified curves into three types on the basis
of the shape of their peaks. These are mesokurtic, leptokurtic and platykurtic.
291
It can be seen that mesokurtic curve is neither too much flattened nor too much peaked. In fact,
this is the frequency curve of a normal distribution. Leptokurtic curve is a more peaked than the
normal curve. In contrast, platykurtic is a relatively flat curve.
4
The coefficient of kurtosis as given by Karl Pearson is 2 = . In case of a normal distribution,
22
that is, mesokurtic curve, the value of 2 = 3 . If 2 turn out to be 2 3 , the curve is called a
leptokurtic curve and is more peaked than the normal curve. Again, when 2 3 , the curve is
called a platykurtic curve and is less peaked than the normal curve.
Another measure of kurtosis is based on both quartiles and percentiles and is given by the following
Using percentiles and quartiles, the degree of flattening of the distribution, is an indicator of the
(Q3 − Q1 )
shape of this distribution. The coefficient of kurtosis. is calculated as C = .
2(P90 − P10 )
The fourth moment of the standard normal random variable is 3, and the kurtosis compares the
x−
4
292
The measure of kurtosis is very helpful in the selection of an appropriate average. For example,
for normal distribution, mean is most appropriate; for a leptokurtic distribution, median is most
appropriate; and for platykurtic distribution, the quartile range is most appropriate.
Example
From the data given below, calculate the percentile coefficient of kurtosis.
Wage 50-60 60-70 70-80 80-90 90-100 100-110 110-120
Workers 10 14 18 24 16 12 6
Solution
Let us compute the cumulative frequencies of the data set as follows:
Wage 50-60 60-70 70-80 80-90 90-100 100-110 110-120
Workers 10 14 18 24 16 12 6
cf 10 24 42 66 82 94 100
1(n + 1) 1(100 + 1)
m1 = th = th = 25.25th , which lies in the 70-80 class
4 4
L − L1
Q1 = L1 + 2 (m − c ) = 70 + 80 − 70 (25.25 − 24) = 70.69
f1 18
3(n + 1) 3(100 + 1)
m3 = th = th = 75.75th , which falls in 90 - 100 class interval
4 4
L − L1
Q3 = L1 + 2 (m − c ) = 90 + 100 − 90 (75.75 − 66) = 96.09
f1 16
10(n + 1)th (100 + 1)
P10 = = th = 10.10th . This falls in 60 - 70 class interval
100 10
L − L1
P10 = L1 + 2 (m − c ) = 60 + 70 − 60 (10.10 − 10) = 60.07
f1 14
90(n + 1)th 9(100 + 1)
P90 = = th = 90.90th This falls in 100 - 110 class interval
100 10
L − L1
P90 = L1 + 2 (m − c ) = 100 + 110 − 100 (90.90 − 82) = 107.41
f1 12
(Q3 − Q1 ) (96.09 − 70.69) = 0.268
Now, Ku = =
2(P90 − P10 ) 2(1007.41 − 60.07)
It will be seen that the distribution is very close to normal distribution as the value of K is 0.268,
which is extremely close to 0.263.
293
Exercises
1. Forty randomly selected students were asked the number of pairs of sneakers they owned. Let
X = the number of pairs of sneakers owned. The results are as follows:
x 1 2 3 4 5 6 7
f 2 5 8 12 12 0 1
a. Find the first quartile.
b. Find the median.
c. Find the third quartile.
d. Find the 40th percentile.
e. Find the 90th percentile.
2. The mean score on a standardized mathematics examination is 49.6 and the standard deviation
is 1.35. Dede is told that the z-score of his exam score is −1.19.
a. Is Dede’s score above average or below average?
b. What was Dede’s actual score on the examination?
4. Calculate the first four moments about the mean from the following data. Also calculate the
values of β1 and β2
Summary
The average value cannot adequately describe a set of observations, unless all the observations are
the same. It is necessary to describe the variability or dispersion of the observations. Again, in two
or more distributions the central value may be the same but still there can be wide disparities in
the formation of distribution.
Furthermore, two distributions may have the same mean and standard deviation but may differ
widely in their overall appearance in terms of symmetry and skewness. To distinguish between
different types of distributions, we may use:
• Measures of skewness and kurtosis
• The percentile rank and z-score of a measurements
• The three quartiles
• The five-number summary in the box plot
We therefore need the measures of dispersion to determine the differences and similarities of data
sets and make the necessary judgements.
294
UNIT 5 SECTION 6 TEACHING DISPERSION, SPREAD AND VARIABILITY
Introduction
The terms variability, spread, and dispersion are synonyms, and refer to how spread out a
distribution is. There are four frequently used measures of variability, namely range, interquartile
range, variance, and standard deviation.
An important characteristic of any set of data is the variation in the data. In some data sets, the
data values are concentrated closely near the mean; in other data sets, the data values are more
widely spread out from the mean. The most common measure of variation, or spread, is the
standard deviation. The standard deviation is a number that measures how far data values are from
their mean.
The various measures of central value give us one single figure that represents the entire data. But
the average alone cannot adequately describe a set of observations, unless all the observations are
the same. It is necessary to describe the variability or dispersion of the observations. In two or
more distributions the central value may be the same but still there can be wide disparities in the
formation of distribution. Measures of dispersion help us in studying this important characteristic
of a distribution.
Our learning indicators are:
1. Determine the relative variability of two distributions
2. Compute the range
3. Compute the inter-quartile range
4. Compute the variance in the population
5. Estimate the variance from a sample
6. Compute the standard deviation from the variance
295
Some important definitions of dispersion
1. Dispersion is the measure of the variation of the items (A.L. Bowley)
2. The degree to which numerical data tend to spread about an average value is called the variation
of dispersion of the data (Spiegel)
3. Dispersion or spread is the degree of the scatter or variation of the variable about a central value
(Brooks & Dick)
4. The measurement of the scatterness of the mass of figures in a series about an average is called
measure of variation or dispersion (Simpson & Kajka)
It is clear that dispersion (also known as scatter, spread or variation) measures the extent to which
the items vary from some central value. Since measures of dispersion give an average of the
differences of various items from an average, they are also called averages of the second order. An
average is more meaningful when it is examined in the light of dispersion.
For example, if the average wage of the workers of factory A is Gh₵3885 and that of factory B
Gh₵3900, we cannot necessarily conclude that the workers of factory B are better off because in
factory B there may be much greater dispersion in the distribution of wages.
2. Another purpose of measuring dispersion is to determine nature and cause of variation in order
to control the variation itself. In matters of health variations in body temperature, pulse beat and
blood pressure are the basic guides to diagnosis. Prescribed treatment is designed to control their
variation. In industrial production efficient operation requires control of quality variation the
causes of which are sought through inspection is basic to the control of causes of variation. In
social sciences a special problem requiring the measurement of variability is the measurement of
‘inequality’ of the distribution of income or wealth.
3. Measures of dispersion enable a comparison to be made of two or more series with regard to
their variability. The study of variation may also be looked upon as a means of determining
uniformity of consistency. A high degree of variation would mean little uniformity or consistency
whereas a low degree of variation would mean great uniformity or consistency.
296
4. Many powerful analytical tools in statistics such as correlation analysis, the testing of
hypothesis, analysis of variance, the statistical quality control, regression analysis is based on
measures of variation of one kind or another.
The Range
The range is the simplest measure of variability to calculate, and one you have probably
encountered many times in your life. The range is simply the highest score minus the lowest score.
The range is a measure of variability because it indicates the size of the interval over which the
data points are distributed. A smaller range indicates less variability (less dispersion) among the
data, whereas a larger range indicates the opposite.
The range of a data set is the number R defined by the formula, R = x max − x min , where xmax is the
largest measurement in the data set and xmin is the smallest.
Note that the range is the only measure of dispersion that has on average the reference point.
• When the data are not grouped the total amplitude is a difference between the largest and
the smallest observed value: R = Maximum X - X min.
• When data are grouped without class intervals still have R = maximum X - X min.
• With class intervals the total amplitude is the difference between the upper boundary of the
last class and the lower boundary of the first class. Then Range is Lmax–Lmin.
The full range is inconvenient and only considers the two extreme values of the series, neglecting
the set of intermediate values. We makes use of the full range when we want to determine the
amplitude of the temperature in a day, quality control or as a quick calculation measure without
much accuracy.
Example
297
1. What is the range of the following group of numbers: 10, 2, 5, 6, 7, 3, 4?
Solution
The highest number is 10, and the lowest number is 2
So 10 - 2 = 8
The range is 8.
2. Here is a dataset with 10 numbers: 99, 45, 23, 67, 45, 91, 82, 78, 62, 51.
What is the range?
Solution
The highest number is 99 and the lowest number is 23, so 99 -
23 equals 76; the range is 76.
Solution
For Data Set I the maximum is 43 and the minimum is 38, so the range is R = 43 − 38 = 5.
For Data Set II the maximum is 47 and the minimum is 33, so the range is R = 47 − 33 = 14.
4. Find the range for the values 40, 45, 48, 62 and 70.
Solution
R = 70-40 = 30
5. Find the range for the following three sets of data:
Set 1: 05, 15, 15, 05, 15, 05, 15, 15, 15, 15
Set 2: 8, 7, 15, 11, 12, 5, 13, 11, 15, 9
Set 3: 5, 5, 5, 5, 5, 5, 5, 5, 5, 5
Solution
In each of these three sets, the highest number is 15 and the lowest number is 5.
Since the range is the difference between the maximum value and the minimum value of the data,
it is 10 in each case.
But the range fails to give any idea about the dispersal or spread of the series between the highest
and the lowest value.
This becomes evident from the above data.
Solution
Here, the upper limit of the highest class is 120 and the lower limit of the lowest class is 20. Hence,
the range is 120 - 20 = 100.
Note that the range is not influenced by the frequencies. Symbolically, the range is calculated b
the formula L - S, where L is the largest value and S is the smallest value in a distribution.
9. Calculate the coefficient of range separately for the two sets of data given below:
Set 1: 8, 10, 20, 9, 15, 10, 13, 28
Set 2: 30, 35, 42, 50, 32, 49, 39, 33
Solution
It can be seen that the range in both the sets of data is the same:
Set 1 28 - 8 = 20
Set 2 50 - 30 = 20
28 − 8
Coefficient of range in Set 1 is: CR = = 0.55
28 + 8
50 − 30
Coefficient of range in set 2 is: CR = = 0.25
50 + 30
Despite these limitations of the range, it is mainly used in situations where one wants to quickly
have some idea of the variability or' a set of data.
When the sample size is very small, the range is considered quite adequate measure of the
variability. Thus, it is widely used in quality control where a continuous check on the variability
of raw materials or finished products is needed.
The range is also a suitable measure in weather forecast. The meteorological department uses the
range by giving the maximum and the minimum temperatures. This information is quite useful to
the common man, as he can know the extent of possible variation in the temperature on a particular
day.
Many times the interquartile range is reduced in the form of semi-interquartile range or quartile
deviation, expressed as (Q3 – Ql)/2.
When quartile deviation is small, it means that there is a small deviation in the central 50 percent
items. In contrast, if the quartile deviation is high, it shows that the central 50 percent items have
a large variation. It may be noted that in a symmetrical distribution, the two quartiles, that is, Q3
and QI are equidistant from the median or M-QI = Q3-M.
However, this is seldom the case as most of the education, business and economic data are
asymmetrical. But, one can assume that approximately 50 percent of the observations are contained
in the interquartile range. It may be noted that interquartile range or the quartile deviation is an
absolute measure of dispersion. It can be changed into a relative measure of dispersion as Q3 –
Q1/Q3 +Q1.
The computation of a quartile deviation is very simple, involving the computation of upper and
lower quartiles. As the computation of the two quartiles has already been explained in the
preceding chapter, it is not attempted here.
300
Merits of quartile deviation
The following merits are entertained by quartile deviation:
1. As compared to range, it is considered a superior measure of dispersion.
2. In the case of open-ended distribution, it is quite suitable.
3. Since it is not influenced by the extreme values in a distribution, it is particularly suitable in
highly skewed or erratic distributions.
In view of the above-mentioned limitations, the interquartile range or the quartile deviation has a
limited practical utility.
Symbolically, MD =
| d | for discrete data and MD =
f | d | for grouped data, where MD is
n n
the mean deviation, f is the frequency, |x| is the deviation of an item from the mean ignoring
positive and negative signs, n = the total number of observations.
Example
Size 2-4 4-6 6-8 8-10
Frequency 20 40 30 10
Solution
x=
fx
=
560
= 5.6
n 100
301
4-6 40 5 200 -0.6 24
6-8 30 7 210 1.4 42
8-10 10 9 90 -3.4 34
Total 100 560 152
MD =
f | d | = 152 = 1.52
n 100
Merits of the mean deviation
1. A major advantage of mean deviation is that it is simple to understand and easy to calculate.
2. It takes into consideration each and every item in the distribution. As a result, a change in the
value of any item will have its effect on the magnitude of mean deviation.
3. The values of extreme items have less effect on the value of the mean deviation.
4. As deviations are taken from a central value, it is possible to have meaningful comparisons of
the formation of different distributions.
In view of these limitations, it is seldom used in business studies. A better measure known as the
standard deviation is more frequently used.
The Variance
Variability can also be defined in terms of how close the scores in the distribution are to the middle
of the distribution. Using the mean as the measure of the middle of the distribution, the variance
is defined as the average squared difference of the scores from the mean.
This mean of the squared deviations is known as the variance. It may be noted that this variance is
described by different terms that are used interchangeably, namely the variance of the distribution
X the variance of X or the variance of the distribution.
To calculate the standard deviation, we need to calculate the variance first. The variance is the
average of the squares of the deviations (the x − x values for a sample, or the x − values for a
population).
302
The symbol 2 represents the population variance and the population standard deviation σ is the
square root of the population variance. The symbol s 2 represents the sample variance, and the
sample standard deviation s is the square root of the sample variance.
If the numbers come from a census of the entire population and not a sample, when we calculate
the average of the squared deviations to find the variance, we divide by N, the number of items in
the population.
If the data are from a sample rather than a population, when we calculate the average of the squared
deviations, we divide by n − 1 , one less than the number of items in the sample.
Note: Sample variance has different units from data. For example, if the units in a data set are
inches, the new units would be inches squared, or square inches. It is thus primarily of theoretical
importance and not considered for further statistical analysis.
s2 =
n −1 n −1
(
fx 2 )
f (x − x ) , or s 2 = fx − n , where the denominator is n − 1 is the sample size
2
2
s2 =
n −1 n −1
minus 1.
Although the first sets of formulas in each case look less complicated than the second, the latter
are easier to use in hand computations, and are called a shortcut formula.
2
= or 2
= , where the denominator is N is the number of items in
N N
the population and f represents the frequency with which a value appears.
Note that the denominator in the fraction is the full number of observations, not that number
reduced by one, as is the case with the sample standard deviation. Since most data sets are samples,
we will always work with the sample standard deviation and variance.
303
We determine the best estimate of the measures of centre by finding the mean of the grouped data
fx fx
fx fx
2 2
2
Note that in many real-life situations the most important statistical issues have to do with
comparing the means and standard deviations of data sets.
Example
1. Find the variance of the population data 20, 15, 19, 24, 16, 14.
Solution
=
x = 108 = 18
N 6
x (x-μ) (x-μ)2
20 2 2
15 -3 9
19 1 1
24 6 36
16 -2 4
14 -4 16
108 70
(x − )
2
70
2
= = 11.67 =
N 6
2. Assume the scores 1, 2, 4, and 5 were sampled from a larger population. Estimate the variance
in the population.
Solution
x=
x = 12 = 3
n 4
s2 =
(x − x )2 = (1 − 3)2 + (2 − 3)2 + (4 − 3)2 + (5 − 3)2
n −1 4 −1
s2
=
(1 − 3)
+ (2 − 3) + (4 − 3) + (5 − 3)
2 2
=
4 +1+1+ 4
2 2
4 −1 3
4 + 1 + 1 + 4 10
s2 = = = 3.333
3 3
304
3. Estimate the variance of 46, 37, 40, 33, 42, 36, 40, 47, 34, 45
Solution
x=
x = 400 = 40
n 10
s 2
=
( x − x)
2
=
(46 − 40) + (37 − 40) + (40 − 40) + (36 − 40) + ... + (45 − 40)
2 2 2 2 2
n −1 10 − 1
s 2
=
(x − x ) 2
=
(6) 2
+ (− 3) + (0) + (− 4) + ... + (5)
2 2 2 2
n −1 10 − 1
s2 =
( x − x )2 =
36 + 9 + 0 + 36 + ... + 25 224
= = 24. 8
n −1 9 9
4. Use the short-cut formula to estimate the variance of 1.90, 3.00, 2.53, 3.71, 2.12, 1.76, 2.71,
1.39, 4.00, and 3.33.
Solution
x = 1.90 + 3.00 + 2.53 + 3.71+ 2.12 + 1.76 + 2.71+ 1.39 + 4.00 + 3.33 = 26.45
x 2
= 1.90 2 + 3.00 2 + 2.532 + 3.712 + 2.12 2 + 1.76 2 + 2.712 + 1.39 2 + 4.00 2 + 3.332 = 76.7321
( x ) 2
26.452
x 2
−
n
76.7321−
10
s2 = =
n −1 10 − 1
26.45 2
76.7321 −
s2 = 10 = 6.77185 = 7.52427
10 − 1 9
Although the variance is a measure of dispersion, the unit of its measurement is (points). If a
distribution relates to income of families then the variance is (Gh₵)2 and not cedis.
Similarly, if another distribution pertains to marks of students, then the unit of variance is (marks)2.
To overcome this inadequacy, the square root of variance is taken, which yields a better measure
of dispersion known as the standard deviation.
305
based on deviations around the mean and its basic formula can be translated as the square root of
the arithmetic mean of the squares of deviations.
The standard deviation provides a measure of the overall variation in a data set
The standard deviation is always positive or zero. The standard deviation is small when the data
are all concentrated close to the mean, exhibiting little variation or spread. The standard deviation
is larger when the data values are more spread out from the mean, exhibiting more variation.
Suppose that we are studying the amount of time customers wait in line at the checkout at
supermarket A and supermarket B and the average wait time at both supermarkets is five minutes.
At supermarket A, the standard deviation for the wait time is two minutes; at supermarket B the
standard deviation for the wait time is four minutes.
Because supermarket B has a higher standard deviation, we know that there is more variation in
the wait times at supermarket B. Overall, wait times at supermarket B are more spread out from
the average; wait times at supermarket A are more concentrated near the average.
The standard deviation can be used to determine whether a data value is close to or far from the
mean
Suppose that Rosa and Bintu both shop at supermarket A. Rosa waits at the checkout counter for
seven minutes and Bintu waits for one minute.
At supermarket A, the mean waiting time is five minutes and the standard deviation is two minutes.
The standard deviation can be used to determine whether a data value is close to or far from the
mean.
306
(i) About 68 percent of the values will fall within: +1 standard deviation from the mean.
(ii) About 95 percent of the values will fall within +2 standard deviations from the mean.
(iii) About 99 percent of the values will fall within + 3 standard deviations from the mean.
If the numbers belong to a population, in symbols a deviation is x − . For sample data, in symbols
a deviation is x − x .
The procedure to calculate the standard deviation depends on whether the numbers are the entire
population or are data from a sample. The calculations are similar, but not identical. Therefore the
symbol used to represent the standard deviation depends on whether it is calculated from a
population or a sample.
The lower case letter ‘s’ represents the sample standard deviation and the Greek letter σ (sigma,
lower case) represents the population standard deviation. If the sample has the same characteristics
as the population, then ‘s’ should be a good estimate of σ.
The standard deviation, s or σ, is either zero or larger than zero. When the standard deviation is
zero, there is no spread; that is, all the data values are equal to each other.
The standard deviation is small when the data are all concentrated close to the mean, and is larger
when the data values show more variation from the mean.
When the standard deviation is a lot larger than zero, the data values are very spread out about the
mean; outliers can make s or σ very large.
The standard deviation, when first presented, can seem unclear. By graphing your data, we can get
a better ‘feel’ for the deviations and the standard deviation.
We will find that in symmetrical distributions, the standard deviation can be very helpful but in
skewed distributions, the standard deviation may not be much help. The reason is that the two sides
of a skewed distribution have different spreads.
307
In a skewed distribution, it is better to look at the first quartile, the median, the third quartile, the
smallest value, and the largest value. Because numbers can be confusing, always graph your data
in a histogram or a box plot.
(x − x ) f (x − x )
2 2
(x − ) f (x − )
2 2
Example
1. In a class five, the teacher was interested in the average age and the sample standard deviation
of the ages of her students. The following data are the ages for a SAMPLE of 20 class five pupils.
The ages are rounded to the nearest half year: 9; 9.5; 9.5; 10; 10; 10; 10; 10.5; 10.5; 10.5; 10.5;
11; 11; 11; 11; 11; 11; 11.5; 11.5; 11.5.
Calculate the average age and the sample standard deviation of the ages of her pupils.
Solution
x=
x = 9 + 9.5(2) + 10(4) + 10.5(4) + 11(6) + 11.5(4) = 10.525
n 20
x f x-mean (x-mean)2 f(x-mean)2
9 1 9 – 10.525 = –1.525 (–1.525)2 = 2.325625 1 × 2.325625 = 2.325625
9.5 2 9.5 – 10.525 = –1.025 (–1.025)2 = 1.050625 2 × 1.050625 = 2.101250
10 4 10 – 10.525 = –0.525 (–0.525)2 = 0.275625 4 × 0.275625 = 1.1025
10.5 4 10.5 – 10.525 = –0.025 (–0.025)2 = 0.000625 4 × 0.000625 = 0.0025
11 6 11 – 10.525 = 0.475 (0.475)2 = 0.225625 0.225625 = 1.35375
11.5 3 11.5 – 10.525 = 0.975 (0.975)2 = 0.950625 3 3 × 0.950625 = 2.851875
Total 20 9.7375
(x − x )
2
9.7375
s= = = 0.5125
n −1 20 − 1
2. Find the standard deviation of the population data 20, 15, 19, 24, 16, 14.
Solution
308
=
x = 108 = 18
N 6
x (x-μ) (x-μ)2
20 2 2
15 -3 9
19 1 1
24 6 36
16 -2 4
14 -4 16
108 70
(x − )
2
70
2
= = = 11.67
N 6
( x − )2 70
= = 2
=
= 11.67 = 3.42
N 6
Explanation of the standard deviation calculation shown in the table
The deviations show how spread out the data are about the mean. The data value 11.5 is farther
from the mean than is the data value 11 which is indicated by the deviations 0.97 and 0.47.
A positive deviation occurs when the data value is greater than the mean, whereas a negative
deviation occurs when the data value is less than the mean. The deviation is –1.525 for the data
value nine.
If you add the deviations, the sum is always zero. So you cannot simply add the deviations to get
the spread of the data. By squaring the deviations, you make them positive numbers, and the sum
will also be positive. The variance, then, is the average squared deviation.
The variance is a squared measure and does not have the same units as the data. Taking the square
root solves the problem. The standard deviation measures the spread in the same units as the data.
When it becomes clear that the actual mean would turn out to be in fraction, calculating deviations
from the mean would be too cumbersome. In such cases, an assumed mean is used and the
deviations from it are calculated. In doing so, while midpoint of any class can be taken as an
assumed mean, it is advisable to choose the mid-point of that class that would make calculations
least cumbersome.
Also, just as we could not find the exact mean, neither can we find the exact standard deviation.
Remember that standard deviation describes numerically the expected deviation a data value has
from the mean. In simple English, the standard deviation allows us to compare how “unusual”
individual data is compared to the mean.
Example
1. Find the standard deviation for the data in Table below
Solution
Class f x x2 fx fx2
0-2 1 1 1 1 1
3-5 6 4 16 24 96
6-8 10 7 49 70 490
9-11 7 10 100 70 700
12-14 0 13 169 0 0
15-17 2 16 256 32 512
Total 26 197 1,799
x=
fx = 197 = 7.577
f 26
310
fx fx
2 2 2
1,799 197
s= − = −
f f
26 26
2
1,799 197
s= − = 69.192 − (7.577)
2
26 26
s = 69.192 − (7.577) = 69.192 − 42.257 = 26.935 = 3.5
2
For this data set, we have the mean, x = 7.577 and the standard deviation, s = 3.5 . This means
that a randomly selected data value would be expected to be 3.5 units from the mean. If we look
at the first class, we see that the class midpoint is equal to one. This is almost two full standard
deviations from the mean since 7.58 − 3.5 − 3.5 = 0.58 .
Solution
Marks Frequency f Midpoints x d=Deviation/c = d/10 fd fd2
0-10 1 5 -5 -5 25
10-20 3 15 -4 -12 48
20-30 6 25 -3 -18 54
30-40 10 35 -2 -20 40
40-50 12 45 -1 -12 12
50-60 11 55 0 0 0
60-70 6 65 1 6 6
70-80 3 75 2 6 12
80-90 2 85 3 6 18
90-100 1 95 4 4 16
Total 55 -45 231
fd fd
2 2
= − = 231 − − 45 = 18.8
N N 55 55
311
The standard deviation is an absolute measure of dispersion as it measures variation in the same
units as the original data. As such, it cannot be a suitable measure while comparing two or more
distributions.
For this purpose, we should use a relative measure of dispersion. One such measure of relative
dispersion is the coefficient of variation, which relates the standard deviation and the mean such
that the standard deviation is expressed as a percentage of mean. Thus, the specific unit in which
the standard deviation is measured is done away with and the new unit becomes percent.
Example
In a small business firm, two typists are employed-typist A and typist B. Typist A types out, on an
average, 30 pages per day with a standard deviation of 6. Typist B, on an average, types out 45
pages with a standard deviation of 10. Which typist shows greater consistency in his output?
Solution
6
Coefficient of variation for A = 100 = 100 = 20%
30
10
Coefficient of variation for B = 100 = 100 = 22.2%
45
Comment
These calculations clearly indicate that although typist B types out more pages, there is a greater
variation in his output as compared to that of typist A.
Alternatively, we can say though typist A’s daily output is much less, he is more consistent than
typist B.
This usefulness of the coefficient of variation becomes clear in comparing two groups of data
having different means.
312
The empirical rule tells us that for a bell-shaped distribution, it is unusual for an observation to fall
more than 3 standard deviations from the mean. An alternative criterion for identifying potential
outliers uses the standard deviation. An observation in a bell-shaped distribution is regarded as a
potential outlier if it falls more than 3 standard deviations from the mean.
The z -score allows us to quickly tell how surprising or extreme an observation is. The z -score
converts an observation (regardless of the observation’s unit of measurement) to a common scale
of measurement, which allows comparisons.
The z-score for an observation is the number of standard deviations that it falls from the mean. A
positive z-score indicates the observation is above the mean. A negative z-score indicates the
observation is below the mean.
Another way to locate a particular observation x in a data set is to compute its distance from the
mean in units of standard deviation. The z-score of an observation x is the number z given by the
computational formula
x−x X −
z= or z = , according to whether the data set is a sample or is the entire population.
s
The formulas in the definition allow us to compute the z-score when x is known. If the z-score is
known then x can be recovered using the corresponding inverse formulas
x = x − sz or X = + z .
The z-score indicates how many standard deviations an individual observation x is from the centre
of the data set, its mean. If z is negative then x is below average. If z is 0 then x is equal to the
average. If z is positive then x is above average.
313
The variable Z = (x - x )/s or (x - μ)/μ, which measures the deviation from the mean in units of the
standard deviation, is called a standardised variable. Since both the numerator and the denominator
are in the same units, a standardised variable is independent of units used.
If deviations from the mean are given in units of the standard deviation, they are said to be
expressed in standard units or standard scores. Through this concept of standardised variable,
proper comparisons can be made between individual observations belonging to two different
distributions whose compositions differ.
The standard deviation is useful when comparing data values that come from different data sets.
If the data sets have different means and standard deviations, then comparing the data values
directly can be misleading. For each data value, calculate how many standard deviations away
from its mean the value is.
Example
1. Find the z-scores for all ten observations in the GPA sample data 1.90 3.00 2.53 3.71 2.12 1.76
2.71 1.39 4.00 3.33.
Solution
For these data x = 2.645 and s = 8.8674. The first observation x = 1.9 in the data set has z-score
x − x 1.9 − 2.645
z= = = −0.8589, which means that x = 1.90 is 0.8589 standard deviations
s 0.8674
below the sample mean.
x − x 3.00 − 2.645
z= = = 0.4093, which means that x = 3.00 is 0.4093 standard deviations
s 0.8674
above the sample mean.
Repeating the process for the remaining observations gives the full set of z-scores −0.86; 0.41;
−0.13; 1.23; −0.61; −1.02; 0.07; −1.45; 1.56; 0.79.
2. Suppose the mean and standard deviation of the GPAs of all currently registered students at a
college are = 2.74 and = 0.50 .
The z-scores of the GPAs of two students, Antonio and Beatrice, are z = −0.62 and z = 1.28 ,
respectively. What are their GPAs?
Solution
314
Using the formula x = x _ sz or X = + z of z-scores we compute the GPAs as follows:
Antonion: x = + z = 2.70 + (−0.62)(0.50) = 2.39
Beatrice : x = + z = 2.70 + (1.28)(0.50) = 3.34
3. A student has scored 68 marks in Statistics for which the average marks were 60 and the standard
deviation was 10. In the paper on Mathematics, she scored 74 marks for which the average marks
were 68 and the standard deviation was 15. In which paper, Statistics or Mathematics, was she
relative standing higher?
Solution
The standardised variable Z = (x - x ) ÷ s measures the deviation of x from the mean x in terms of
standard deviation s.
For Statistics, Z = (68 - 60) ÷ 10 = 0.8
For Mathematics, Z = (74 - 68) ÷ 15 = 0.4
Since the standard score is 0.8 in Statistics as compared to 0.4 in Mathematics, her relative standing
was higher in Statistics.
x=
x = 40 = 8 =
(x − x ) 2
= 2.61
n 5 and n
x − x 6−8 x − x 7 −8
z6 = = = −0.77 z7 = = = −0.38
s 2.61 s 2.61
x − x 5−8 x − x 6−8 x − x 12 − 8
z5 = = = −1.15 z10 = = = 0.77 z12 = = = 1.53
s 2.61 s 2.61 s 2.61
Thus the standard scores for 6, 7, 5, 10 and 12 are -0.77, -0.38, -1.15, 0.77 and 1.53, respectively.
5. Let’s consider air pollution data for the European Union (EU). The Energy-EU data file 8 on
the text CD contains data on per capita carbon dioxide (CO2) emissions, in metric tons, for the 27
nations in the EU. The mean was 8.3 and the standard deviation was 3.6.
a. How many standard deviations from the mean was the CO 2 value of 21.3 for Luxembourg?
b. The CO2 value for the United States was 18.9. According to the three-standard-deviation
criterion, is the United States an outlier on carbon dioxide emissions relative to the EU?
Solution
a). Since the mean is 8.3 and s = 3.6 inches , the z -score for the observation of
315
x − x 21.3 − 8.3
21.3 is z = = = 3.6
s 3.6
The carbon dioxide emission (per capita) for Luxembourg is 3.6 standard deviations above the
mean. By the 3 standard deviation criterion, this is a potential outlier. Since it is well removed
from the rest of the data, we would regard it as an actual outlier. However, Luxembourg has only
350,000 people, so in terms of the amount of pollution it is not a major polluter in the EU.
x − x 18.9 − 8.3
b). The z -score for the CO 2 value of the United States is z = = = 2.9
s 3.6
Although the 3 standard deviation rule fails to flag the United States as an outlier relative to EU
nations, the value of 2.9 is close enough to 3 to garner some attention. Furthermore, because of the
relatively large size of the U.S. population, a z -score this close to 3 indicates that the U.S. is a
significant contributor to overall CO2 emission.
Insight
The z -scores of 3.6 and 2.9 are positive. This indicates that the observations are above the mean,
because an observation above the mean has a positive z -score. In fact, these large positive z -
scores tell us that Luxembourg and the United States have very high CO2 emissions compared to
the other nations. The z -score is negative when the observation is below the mean. For instance,
France has a CO2 value of 5.7, which is below the mean of 8.3 and has a z -score of -0.7.
Example
Two students, John and Ali, from different high schools, wanted to find out who had the highest
GPA when compared to their schools. Which student had the highest GPA when compared to his
school?
Student School GPA Mean School SD
John 2.85 3.00 0.7
Ali 77 80 10
Solution
For each student, determine how many standard deviations of his GPA is away from the average,
for his school. Pay careful attention to signs when comparing and interpreting the answer.
x − 2.85 − 3.00
s John = = = −0.21
0.7
x − 77 − 80
s Ali = = = −0.30
10
John has the better GPA when compared to his school because his GPA is 0.21 standard deviations
below his school's mean while Ali's GPA is 0.3 standard deviations below his school's mean.
316
John's z-score of –0.21 is higher than Ali's z-score of –0.3. For GPA, higher values are better, so
we conclude that John has the better GPA when compared to his school.
Activities 2
1. Find the z-score of each measurement in the following sample data set−5 6 2 −1 0.
Answer
−1.3, 1.39, 0.4, −0.35, −0.11
2. For the population 0, 0, 2, 2, compute each of the following:
a. The population mean μ.
b. The population variance σ2.
c. The population standard deviation σ.
d. The z-score for every value in the population data set.
Answer
a. 1.
b. 1.
c. 1.
d. z = −1 for x = 0, z = 1 for x = 2.
Answer
x = 16
4. A measurement x in a population with mean μ = 2.3 and standard deviation σ =
1.3 has z-score z = 2. Find x.
Answer
x = 4.9
5. The weekly sales for the last 20 weeks in a kitchen appliance store for an electric automatic rice
cooker are 20, 15, 15, 19, 15, 19, 17, 15, 14, 12, 16, 15, 14, 13, 16, 16, 18, 9, 18, 15.
a. Find the percentile rank of 15.
b. If the sample accurately reflects the population, then what percentage of weeks would an
inventory of 15 rice cookers be adequate?
Answer
a. 55.
b. 55.
317
Activities
1. Find the range, the variance, and the standard deviation for the sample of ten IQ scores randomly
selected from a school for academically gifted students: 132, 139, 162, 147, 133, 160, 145, 150,
148 and 153
Answers
R = 30, s2 = 103.2, s = 10.2.
The interquartile range
Another name for fourths is hinges. The difference between the upper and lower fourths (called
the fourth-spread) should be close but not necessarily equal to the interquartile range since the
quartiles are not necessarily equal to the fourths. Quartiles or fourths are often used when the
distribution is skewed or outliers are expected.
The interquartile range is a number that indicates the spread of the middle half or the middle 50%
of the data. It is the difference between the third quartile (Q3) and the first quartile (Q1).
IQR = Q3 – Q1.
1. The quartile deviation has the advantage the fact that it is an easy measure to calculate and
interpret. Besides, is not affected by extreme, large or small values and is recommended, therefore,
when between the data contained extreme values are not considered representative.
2. The quartile deviation should be used preferably when the measure of central tendency is the
median.
3. It is a measure insensitive to the distribution of smaller items that Q1, between Q1 and Q3 and
higher than Q3.
Example
For the values 40, 45, 48, 62 and 70, find the quartile deviation.
Solution
Q1 = (45 + 40) / 2 = 42.5 and Q3 = (70 + 62) / 2 = 66
SQR = (66 to 42.5) / 2 = 11.75
318
How much the statistic varies from one sample to another is known as the sampling variability of
a statistic. We typically measure the sampling variability of a statistic by its standard error.
The standard error of the mean or the standard error is a special standard deviation and is known
as the standard deviation of the sampling distribution of the mean.
The notation for the standard error of the mean is s = , where σ is the standard deviation of
n
the population and n is the size of the sample.
Lorenz Curve
This measure of dispersion is graphical. It is known as the Lorenz curve named after Dr. Max
Lorenz. It is generally used to show the extent of concentration of income and wealth.
The Lorenz curve is a simple graphical device to show the disparities of distribution in any
phenomenon. It is, used in business and economics to represent inequalities in income, wealth,
production, savings, and so on.
319
The straight line AB is a line of equal distribution, whereas AEB shows complete inequality. Curve
ACB and curve ADB are the Lorenz curves. As curve ACB is nearer to the line of equal
distribution, it has more equitable distribution of income than curve ADB.
Assuming that these two curves are for the same company, this may be interpreted in a different
manner. Prior to taxation, the curve ADB showed greater inequality in the income of its employees.
After the taxation, the company’s data resulted into ACB curve, which is closer to the line of equal
distribution. In other words, as a result of taxation, the inequality has reduced.
If the Lorenz curve is close to the line of equal distribution, then it implies that the dispersion is
much less. If, on the contrary, the Lorenz curve is farther away from the line of equal distribution,
it implies that the dispersion is considerably wide.
The first encounters one has to data are through graphical displays and numerical summaries. A
well-known adage is that “a picture is worth a thousand words.” This saying proves true when it
comes to presenting statistical information in a data set. There are many effective ways to present
data graphically. The graphical tools are among the most commonly used and are relevant to the
subsequent presentations o statistics.
The goal is to find an elegant method for this presentation that is at the same time both objective
and informative---making clear with a few lines or a few numbers the salient features of the data.
In this sense, data presentation is at the same time an art, a science, and an obligation to
impartiality. In the section, we will describe some of the standard graphical presentations of data.
Bar Chart
Because the human eye is good at judging linear measures and poor at judging relative areas, a bar
chart or bar graph is often preferable to pie charts as a way to display categorical data.
Two-way Tables
Relationships between two categorical variables can be shown through a two-way table (also
known as a contingency table, cross tabulation table or a cross classifying table).
Examples
1. In 1964, a Surgeon, General Dr. Luther Leonidas Terry published a landmark report saying that
smoking may be hazardous to health. This led to many influential reports on the topic, including
the study of the smoking habits of 5,375 high school children in Tucson in 1967. Here is a two-
way table summarizing some of the results.
Student smokes Student does not smoke Total
2 parents smoke 400 1380 1780
1 parent smokes 416 1823 2239
0 parents smoke 188 1168 1356
Total 4004 4371 5375
A two-way table with r rows and c columns is often called an r by c table (written r x c).
The totals along each of the rows and columns give the marginal distributions.
Stem-and-leaf diagrams
321
One way is to construct a stem and leaf diagram. The numbers in the tens place, start from 2
through 9, are the “stems,” and are arranged in numerical order from top to bottom to the left of a
vertical line.
The number in the units place in each measurement is a “leaf,” and is placed in a row to the right
of the corresponding stem. Thus the three leaves 5, 8 and 9 in the row headed with the stem 6
correspond to the three exam scores in the 60s, 65 (in the first row of data), 68 (in the third row),
and 69 (also in the third row).
The display is made even more useful for some purposes by rearranging the leaves in numerical
order. Either way, with the data reorganized certain information of interest becomes apparent
immediately.
There are two perfect scores; three students made scores under 60; most students scored in the 70s,
80s and 90s; and the overall average is probably in the high 70s or low 80s.
Example
Suppose 30 students in a statistics class took a test and made the following scores: 86, 90, 40, 80,
83, 58, 25, 70, 68, 77, 73, 69, 73, 73, 100, 76, 70, 78, 100, 90, 87, 90, 83, 97, 69, 71, 92, 93, 95,
74.
Solutions
How did the class do on the test?
A quick glance at the set of 30 numbers does not immediately give a clear answer. However, the
data set may be reorganized and rewritten to make relevant information more visible.
2 5
3
4 0
5 8
6 9 8 9
7 7 3 6 0 3 3 0 1 8 4
8 6 0 3 3 7
9 0 3 0 0 5 7 2
10 0 0
In this example the scores have a natural stem (the tens place) and leaf (the ones place). One could
spread the diagram out by splitting each tens place number into lower and upper categories. For
example, all the scores in the 80s may be represented on two separate stems, lower 80s and upper
80s:
322
8 0 3 3
8 6 7
The definitions of stems and leaves are flexible in practice. The general purpose of a stem and leaf
diagram is to provide a quick display of how the data are distributed across the range of their
values; some improvisation could be necessary to obtain a diagram that best meets that goal.
Note that all of the original data can be recovered from the stem and leaf diagram. This will not be
true of other types of graphical displays.
Frequency histograms
The stem and leaf diagram is not practical for large data sets, so we need a different, purely
graphical way to represent data.
The direction of the skewness is the direction of the longer of the two tails (left or right) of the
distribution.
No one choice for the number of bins is considered best. One possible choice for larger data sets
is Sturges’ formula to choose 1 + log n2 bins.
Example
323
Suppose 30 students in a statistics class took a test and made the following scores:
86, 90, 40, 80, 83, 58, 25, 70, 68, 77, 73, 69, 73, 73, 100, 76, 70, 78, 100, 90, 87, 90, 83, 97, 69,
71, 92, 93, 95, 74.
Solutions
For the 30 scores on the exam, it is natural to group the scores on the standard ten-point scale, and
count the number of scores in each group. Thus there are two 100s, seven scores in the 90s, six in
the 80s, and so on.
We then construct the frequency histogram by drawing for each group, or class, a vertical bar
whose length is the number of observations in that group.
For example, the bar labelled 100 is 2 units long, the bar labelled 90 is 7 units long, and so on.
While the individual data values are lost, we know the number in each class. This number is called
the frequency of the class, hence the name frequency histogram as follows:
The same procedure can be applied to any collection of numerical data. Observations are grouped
into several classes and the frequency (the number of observations) of each class is noted.
These classes are arranged and indicated in order on the horizontal axis (called the x-axis), and for
each group a vertical bar, whose length is the number of observations in that group, is drawn.
324
The resulting display is a frequency histogram for the data. Frequency Histogram is apparent,
particularly if you imagine turning the stem and leaf diagram on its side by rotating it a quarter
turn counter-clockwise.
In general, the definition of the classes in the frequency histogram is flexible. The general purpose
of a frequency histogram is very much the same as that of a stem and leaf diagram, to provide a
graphical display that gives a sense of data distribution across the range of values that appear.
Relative frequency histograms
In our previous example of the exam scores in a statistics class, five students scored in the 80s.
The number 5 is the frequency of the group labelled “80s.” Since there are 30 students in the entire
statistics class, the proportion who scored in the 80s is 5/30.
The number 5/30, which could also be expressed as 0.1 6 = 0.1667 0.16, or as 16.67% , is the
relative frequency of the group labelled “80s.”
Every group (the 70s, the 80s, and so on) has a relative frequency. We can thus construct a diagram
by drawing for each group, or class, a vertical bar whose length is the relative frequency of that
group.
For example, the bar for the 80s will have length 5/30 unit, not 5 units. The diagram is a relative
frequency histogram for the data. It is exactly the same as the frequency histogram except that the
vertical axis in the relative frequency histogram is not frequency but relative frequency.
325
The same procedure can be applied to any collection of numerical data. Classes are selected, the
relative frequency of each class is noted, the classes are arranged and indicated in order on the
horizontal axis, and for each class a vertical bar, whose length is the relative frequency of the class,
is drawn. The resulting display is a relative frequency histogram for the data. A key point is that
now if each vertical bar has width 1 unit, then the total area of all the bars is 1 or 100%.
Although the histogram and the Frequency Histogram have the same appearance, the relative
frequency histogram is more important for us. It is the relative frequency histogram that is used
repeatedly to represent data.
To see why this is so, reflects on what it is that you are actually seeing in the diagrams that quickly
and effectively communicates information to you about the data. It is the relative sizes of the bars.
The bar labelled “70s” takes up 1/3 of the total area of all the bars, and although we may not think
of this consciously, we perceive the proportion 1/3 in the figures, indicating that a third of the
grades were in the 70s.
326
The relative frequency histogram is important because the labelling on the vertical axis reflects
what is important visually: the relative sizes of the bars.
When the size n of a sample is small only a few classes can be used in constructing a relative
frequency histogram. Such a histogram might look something like the one in panel (a) of sample
size and relative frequency histograms.
If the sample size n were increased, then more classes could be used in constructing a relative
frequency histogram and the vertical bars of the resulting histogram would be finer, as indicated
in panel (b) of Sample Size and Relative Frequency Histograms.
For a very large sample the relative frequency histogram would look very fine, like the one in (c)
of sample size and relative frequency histograms.
If the sample size were to increase indefinitely then the corresponding relative frequency histogram
would be so fine that it would look like a smooth curve, such as the one in panel (d) of sample size
and relative frequency histograms.
327
It is common in statistics to represent a population or a very large data set by a smooth curve. It is
good to keep in mind that such a curve is actually just a very fine relative frequency histogram in
which the exceedingly narrow vertical bars have disappeared.
Because the area of each such vertical bar is the proportion of the data that lies in the interval of
numbers over which that bar stands, this means that for any two numbers a and b, the proportion
of the data that lies between the two numbers a and b is the area under the curve that is above the
interval (a, b) in the horizontal axis.
This is the area shown as ‘A Very Fine Relative Frequency Histogram’. In particular, the total area
under the curve is 1, or 100%.
Scatterplots
We now consider two dimensional data. The values of the first variable x1; x2; : : : ; xn are assumed
known and in an experiment and are often set by the experimenter. This variable is called the
independent, explanatory, predictor, descriptor or input variables and in a two dimensional
scatterplot of the data display its values on the horizontal axis.
The values y1; y2,…,yn, taken from observations with input x1; x2;….; xn are called the dependent,
response or target variable and its values are displayed on the vertical axis.
Activity 5
1. Describe one difference between a frequency histogram and a relative frequency histogram.
Solution
The vertical scale on the frequency histogram is the frequencies and on the relative frequency
histogram is the relative frequencies.
2. Construct a stem and leaf diagram, a frequency histogram, and a relative frequency histogram
for the following data set. For the histograms use classes 51–60, 61–70, and so on: 69, 70, 93, 53,
92, 85, 75, 70, 68, 88, 76, 70, 77, 85, 82, 82, 80, 96, 100, 85.
Solution
5 3
6 8 9
7 0 0 0 5 6 7
8 0 2 3 5 5 5 8
9 2 3 6
10 0
329
UNIT 6 BASIC PROBABILITY: LEARNING, TEACHING AND APPLYING
UNIT INTRODUCTION
Dear Students,
In the last unit, we learned basic concepts of statistics. In this unit we are going to learn the basic
concepts of probability, namely:
1. Fundamental counting principles
2. Basic concepts of probability
3. Dependent events with replacements and without replacements
4. Dependent events with conditional probabilities and tree diagrams
5. Conditional probability and Bayes theorem
6. Applications of permutations and combinations in probability
331
The fundamental counting principle states that if there are n( A) outcomes in
event A and n(B ) outcomes in event B, then there are n( A) n(B ) outcomes in event A and
event B combined.
Example
Given A = 1,2,3,4,5,8 and B = 5,7,9
This can further be generalised to k events, where k is the number of events. The total number of
outcomes for k events is:
n1 n 2 n3 ... n k .
Examples
1. What is the total number of possible outcomes when a die is rolled and then a coin is tossed?
Solution
The roll of a die has six possible outcomes (1;2;3;4;5; 6) and the toss of a coin, 2 outcomes (heads
or tails).
The sample space (total possible outcomes) can be represented as follows:
S={(1;H); (1;T); (2;H); (2;T);(3;H);(3;T);(4;H);(4;T);(5;H);(5;T);(6;H);(6;T)}
2. A restaurant has a 4-piece lunch special which consists of a sandwich, soup, dessert and drink
for 50.00 Ghana cedis. They offer the following choices for:
Sandwich: chicken mayonnaise, cheese and tomato, tuna mayonnaise, ham and lettuce
Solution
Number of choices
4 3 2 5
Using the fundamental counting principle, the possible different meals are:
n1 n 2 n3 n 4 = 4 3 2 5 = 120
3. If a coin is flipped three times, what is the total number of different results?
Solution
Each time a coin is flipped, there are two possible outcomes, namely heads or tails. The coin is
flipped 3 times. We that there is a total of 8 different possible outcomes.
NB: Drawing a tree diagram is possible to draw for three different coin flips, but as soon as the
number of events increases, the total number of possible outcomes increases to the point where
drawing a tree diagram is impractical.
Solution
In this case, using the fundamental counting principle is a far easier option.
We know that each time a coin is flipped that there are two possible outcomes.
333
So if we flip a coin six times, the total number of possible outcomes is equivalent to multiplying 2
by itself six times.
Therefore, 2×2×2×2×2×2=26=64
5. Diana packed 2 skirts, 4 blouses, and 1 sweater to go to school. She will need to choose a skirt
and a blouse for each outfit and decide whether to wear the sweater depending on the weather
conditions. Find the total number of possible outfits.
Solution
To find the total number of outfits, we need to find the product of the number of skirt options, the
number of blouse options, and the number of sweater options as shown below:
2 4 2
Hence, n1 n 2 n3 = 4 2 2 = 16
This is because for the first letter of the pattern, we can choose any of the four available letters, for
the second letter of the pattern, we can choose any of the four letters, and for the final letter of the
pattern, we can choose any of the four letters.
Multiplying the number of available choices for each letter in the pattern gives the total available
arrangements of letters: 4×4×4=43=64
This allows us to formulate the following: when you have n objects to choose from and you choose
from them r times, then the total number of possibilities is n n n ... n(r times) = n r .
Examples
334
1. A school plays a series of 6 soccer matches. For each match there are 3 possibilities: a win, a
draw or a loss. How many possible results are there for the series?
Solution
Step 1: We will determine how many outcomes to choose from for each event:
There are 3 outcomes for each match: win, draw or lose (n).
There are 3 possible outcomes for each of the 6 events. Therefore, the total number of possible
outcomes for the series of matches is nr =3×3×3×3×3×3=36=729
Our daily applications are arranging letters into words and digits into numbers, lining up for
photographs, and decorating rooms. An ordering of such objects is called a permutation.
To solve permutation problems, it is often helpful to draw line segments for each option to enables
us determine the number of each option to multiply. For instance, suppose we have four paintings,
and we want to find the number of ways we can hang three of the paintings in order on the wall.
We can draw three lines to represent the three places on the wall.
Examples
a. How many ways can they place first, second, and third?
335
b. How many ways can they place first, second, and third if a swimmer named Aba wins first
place? (Assume there is only one contestant named Aba.)
c. How many ways can all nine swimmers line up for a photo?
Solution
There are 9 options for first place. Once someone has won first place, there are 8 remaining options
for second place. Once first and second place have been won, there are 7 remaining options for
third place.
Therefore, 9 8 7 = 504
Multiply to find that there are 504 ways for the swimmers to place.
Options for first place x options for second place x options for third place
We know Aba must win first place, so there is only 1 option for first place. There are 8 remaining
options for second place, and then 7 remaining options for third place.
1 x 8 x 7 = 56
Multiply to find that tere are 56 ways for the swimmers to place if Ariel wins first.
There are 362,880 possible permutations for the swimmers to line up.
336
There are two common notations for this permutation, namely P(n, r ) or n Pr , where n is the set
of objects and we want to choose r objects from the n set in order.
n!
The general formula is as n Pr =
(n − r )!
Procedures
• We begin by finding n!, the number of ways to line up all n objects.
• We then divide by (n−r)!
• We cancel out the (n−r)! items that we do not wish to line up.
• We then obtain the actual number we wish to line up
Example
Imagine that six students has formed a study group, and they need to elect a president, a vice
president, and a treasurer. How many possible ways can they do this exercise fairly to everyone in
the group.
Solution
Any of the six students could be elected president, any one of the five remaining students could be
elected vice president, and any of the remaining four students could be elected treasurer.
6! 6 5 4 3!
However, in the using factorials, we get the same result as: n Pr = = = 120
3! 3!
NB: The formula stills works if we are choosing all n objects and placing them in order. In that
case we would be dividing by (n − n )! or 0 ! , which is 1. So the number of permutations
of n objects taken n at a time is n!.
Example
A mathematics professor is creating an exam of 9 questions from a test bank of 12 questions. How
many ways can she select and arrange the questions?
Solution
Substitute n=12 and r=9 into the permutation formula and simplify.
337
12! 12!
n Pr = = = 79,833,600
(12 − 9)! 3!
So far, we have looked at problems asking us to put objects in order. There are many problems in
which we want to select a few objects from a group of objects, but we do not care about the order.
When we are selecting objects and the order does not matter, we are dealing with combinations.
A selection of r objects from a set of n objects where the order does not matter can be written
as C(n, r ) or nCr , just as with permutations,
n!
In this case, the general formula is C (n, r ) = .
(n − r )! r!
• Identify n from the given information.
• Identify r from the given information.
• Replace n and r in the formula with the given values.
• Evaluate the formula
Examples
1. How many ways can a painter select 3 colours out of 4 for a building if order is not considered?
Solution
If we do not care about the order, we would expect a smaller number because selecting
paintings 1,2,3 would be the same as selecting paintings 2,3,1.
To find the number of ways to select 3 of the 4 paintings, disregarding the order of the paintings,
divide the number of permutations by the number of ways to order 3 paintings.
There are 3!=3×2×1= 6 ways to order 3 paintings. There are 24 or 4 ways to select 3 of
the 4 paintings. This is because every time we are selecting 3 paintings, we
are not selecting 1 painting.
There are 4 paintings we could choose not to select, so there are 4 ways to select 3 of
the 4 paintings.
2. UEW food restaurant offers five side dish options. Your meal comes with two side dishes.
a). How many ways can you select your side dishes?
b). How many ways can you select 33 side dishes?
Solution
a). We want to choose 2 side dishes from 5 options.
338
n!
C (n, r ) =
5!
= = 10
(n − r )! r! (5 − 2)!2 !
Examples
1. A pizza restaurant offers 5 toppings. Any number of toppings can be ordered. How many
different pizzas are possible?
Solution
Here, we need to consider pizzas with any number of toppings.
Note that there is C(5,0)=1 way to order a pizza with no toppings. There are C(5,1)=5 ways to
order a pizza with exactly one topping.
We are presented with a sequence of choices. For each of the n objects we have two choices:
include it in the subset or not. So for the whole subset we have made n choices, each with two
options. So there are a total of 2×2×2×…×2 possible resulting subsets.
Thus we start all the way from the empty subset, which we obtain when we say “no” each time, to
the original set itself, which we obtain when we say “yes” each time.
2. A restaurant offers butter, cheese, chives, and sour cream as toppings for a baked potato. How
many different ways are there to order a potato?
Solution
We are looking for the number of subsets of a set with 4 objects. Substitute n=4 into nr.
We have 2n = 24 =16
There are 16 possible ways to order a potato.
339
Finding the Number of Permutations of n Non-Distinct Objects
For example, suppose there is a sheet of 12 stickers. If all of the stickers were distinct, there would
be 12! ways to order the stickers. However, if 4 of the stickers are identical stars, and 3 are
identical moons, then all the objects are not distinct, and many of the 12! Permutations may be
duplicated.
n!
Therefore, the general formula for this situation is P(n, r ) =
r1 ! r2 !...rk !
In this example, we need to divide by the number of ways to order the 4 stars and the ways to order
the 3 moons to find the number of unique permutations of the stickers.
There are 4! ways to order the stars and 3! ways to order the moon.
n!
P(n, r ) =
12!
= = 3,326,400
r1 ! r2 !...rk ! 4!3!
Example
Find the number of rearrangements of the letters in the word DISTINCT.
Solution
There are 8 letters. Both I and T have repeated 2 times.
n!
Substitute n=8, r1 =2, and r2 =2 into the formula P(n, r ) =
8!
= = 10,080
r1 ! r2 !...rk ! 2!2!
Summary
Experiment
An experiment is a process by which an outcome is obtained. Therefore, a random experiment is a
process or action whose outcome is not determined.
Examples
1. rolling a die
2. tossing a coin
3. selecting a card
341
4. selecting a colour
5. throwing a ball
6. measuring ages or height or weights of pupils
7. choosing a number
8. writing an English vowel
9. drawing a shape
Outcome
An outcome is a single result from a measurement of an experiment.
Examples
1. one of head or tail from a coin
2. one of 1,2,3,4,5 or 6 from a die
3. one of heart, spade, diamond or culet of a desk of cards
4. one of ‘a’, ‘e’, ‘I’, ‘o’, or ‘u’ of the English vowels
5. one 2,3,5,7,11,13,17,or 19 of prime numbers less than 20
Sample Space
In probability, the set of all possible outcomes is called the Sample Space. We use S to represent
the sample space.
In terms of the language of sets, a sample space is a universal set and an outcome is an element of
the universal set.
NB: The number of all possible outcomes may be finite, infinite or continuous.
Examples
1. The sample space for the experiment of tossing a coin once is S = {H, T} because there are only
two possible outcomes, Heads or Tails.
342
2. The sample space for the experiment of tossing a standard die is S = {1, 2, 3, 4, 5, 6} because
these are the only six possible outcomes.
2. The sample space for the experiment of selecting a playing from a standard deck of cards is S =
{1, 2, 3, …52} because these are the only 52 possible outcomes.
3. The set of prime numbers S = {2, 3, 5, 7, 11, 13, 17, 19…}
An Event
It is a particular result or set of results amongst the possibilities in the sample space.
Examples
1. obtaining odd numbers from tossing a die
2. obtaining head from throwing a coin
3. obtaining a heart from selecting cards
The concept of probability
Probability is the likelihood that an event will occur. It is written as a fraction with the number of
favourable outcomes as the numerator and the total number of outcomes as the denominator.
Favourable just means that a particular outcome is what you are curious about, not that it is
necessarily positive.
Probability can be used to determine many things, from the likelihood that you will win the jackpot
in the lottery to the likelihood that a baby will be born with a certain birth defect and anything in
between. Probability is used extensively in the sciences, investing, weather reporting and many
other areas.
A. Classical definition
If there are a finite number of possible outcomes of an experiment, all equally likely and mutually
exclusive, then the probability of an event (A) is the number of outcomes favourable to the event,
divided by the total number of possible outcomes.
n( A)
That is, P ( A) =
n (S )
Examples
1. A fair die is rolled once. What is the probability that the outcome is more than 4>
Solution
343
S = 1,2,3,4,5,6 and A = 5,6
Thus n( S ) = 6 and n( A) = 2
n( A) 2 1
Therefore ( A) = = =
n(S ) 6 3
The probability of an event denotes the relative frequency of occurrence of that event in the long
run. For instance, the probability of a newborn infant being female is estimated to be about .51 in
Ghana. This is also called the frequentist definition and is the one in common use.
But it is not a fully satisfactory definition. What does in the long run mean? And what about
situations in which the experiment cannot be repeated indefinitely under identical conditions, even
in principle?
In other words, it is a type of probability that has a set of axioms (rules) attached to it. For example,
you could have a rule that the probability must be greater than 0%, that one event must happen,
and that one event cannot happen if another event happens.
It could also states that a probability is a number between zero and one, but nothing is specified
about how to assign it. Assignment may be based on a model or on experimental data.
Developments are valid if they follow from the axioms, independent of any correspondence to
phenomena of the physical world. The following statements represent the axioms of probability.
Axioms of probability
Let S be a finite sample space, A an event in S. We define P(A), the probability of A, to be the value
of an additive set that satisfies the following three conditions
Axiom 1: 0 ≤𝑃𝐴≤1 for each event A in S (probabilities are real numbers between 0 and 1 inclusive).
344
Axiom 2: (𝑆) =1 (the probability of some event occurring from S is unity).
Axiom 3: If A and B are mutually exclusive events in S, then (𝐴 U 𝐵) = 𝑃𝐴+𝑃(𝐵) (the probability
function is an additive set function).
( ) ( )
Axiom 4: If A is an event in S, then P A = 1 − P( A) , where A is the complement of A.
Compound Events
A compound event is one in which there is more than one possible outcome. Determining the
probability of a compound event involves finding the sum of the probabilities of the individual
events and, if necessary, removing any overlapping probabilities.
1) What is the probability that you will roll a five using a 6-sided die?
The favourable outcome is rolling a five, and that can only occur once using one die. The total
number of outcomes is six, since the die is 6-sided.
2) What is the probability that you will pull a heart out of a standard deck of cards?
The favourable outcome would be pulling a heart and there are 13 of them in a standard deck. The
total number of outcomes is 52 because there are 52 cards in a standard deck.
A compound event is an event with two or more favourable outcomes. There are three types of
compound events and determining the probability for each is different. First, let’s talk about an
exclusive compound event.
An exclusive compound event in one in which the multiple events do not overlap. The method for
determining the probability of this type of compound event is to add together the probabilities of
each event.
345
Compound events that cannot happen at the same time are called mutually exclusive events. For
example, a number cannot be both even and odd or you cannot have picked a single card from a
deck of cards that is both a ten and a jack.
NB: Mutually inclusive events, however, can occur at the same time. For example, a number can
be both less than 5 and even or you can pick a card from a deck of cards that can be a club and a
ten.
When finding the probability of events occurring at the same time, there is a concept known as the
“double counting” feature. It happens when the intersection is counted twice.
But if the events are mutually exclusive events, then P(A and B)=0, because they cannot happen
at the same time. To find the probability of either mutually exclusive event A or B occurring, we
say that P(A or B)=P(A)+P(B)
If the events are mutually inclusive, then the probability of one or the other mutually inclusive
event is to add the individual probabilities and subtract the probability they occur at the same time.
That is, P(A or B)=P(A)+P(B)−P(A and B).
Note that finding the probability of one or the other mutually exclusive event is the same as the
formula for finding the probability of one or the other mutually inclusive event
except P(A and B)=0 in the case of the mutually exclusive events.
Examples
1). What is the probability of rolling either a two or a four using one 10-sided die?
Solution
The probability of rolling a two is 1/10 and the probability of rolling a four is 1/10.
So, the compound probability is: P(C) = 1/10 + 1/10 = 2/10 or 1/5
2). What is the probability of pulling any face card or a three of clubs from a standard deck of
cards?
Solution
The probability of getting a face card is 12/52 and the probability of getting a three of clubs is 1/52
3). What is the probability of drawing a black card or a ten in a deck of cards?
346
Solution
P(black or ten)=4/52+26/52−2/52=30/52−2/52=28/52=7/13
a. P(jack or a king)
b. P(jack or a spade)
Solutions
b. P(jack or a spade)
P(jack)=4/52
P(spade)=13/52
The probability of an independent event in the future is not dependent on its past. For example, if
you toss a coin three times and the head comes up all the three times, then what is the probability
of getting a tail on the fourth try? The answer is simply 1/2.
Example
347
1. Out of the following examples, which represents an independent event?
A). The probability of drawing an Ace from a well-shuffled pack of 52 cards, twice.
B). Probability of drawing a King from a pack of 52 cards and an Ace from another well-shuffled
pack of 52 cards.
C). Two queens which we draw out of a well-shuffled pack of 52 cards.
D). All of the above events are examples of independent events.
Solution
In option A), the two of events are drawing an ace and then drawing another ace. When we draw
the first ace, we have one event in our favour and 52 in total. So the probability is 1/52. For the
second draw, there is 1 less card in the deck, so these events in which we have only one pack of
cards cannot be independent events.
In option B), Probability of drawing a King from a pack of 52 cards and an Ace from another well-
shuffled pack of 52 cards are independent events.
In option C), the two of events are drawing a queen and then drawing another queen. When we
draw the first queen, we have one event in our favour and 52 in total. So the probability is 1/52.
For the second draw, there is 1 less card in the deck, so these events in which we have only one
pack of cards cannot be independent events.
In option D), not true that of the above events are independent events.
The Rule of products is only applicable to the events that are independent of each other. The product
gives the total probability of such events. In other words, the probability of all such events occurring
is what we get from the product of probabilities.
Example
1. Find the probability of the event of getting a total of less than 12 in the experiment of throwing
a die twice.
348
Solution
But P(A)=1/36
Hence P( A ) = 1 − 1 / 36 = 35 / 36
2. In a sample of 55 people, 28 have brown hair and 22 have blue eyes. 5 of them have neither
brown hair nor blue eyes. What is the probability that a random person from the sample has at least
one of these features?
Solution
Our compound events here are “brown hair”, and “blue eyes.” To find the probability that a person
chosen at random from the sample of 55 people has at least one of these features, we can simply
note that since 5 of the 55 have neither feature, all the rest must have at least one, that is,
that 55−5=50 of the 55 have either brown hair or blue eyes or both.
The probability that a person chosen at random has either brown hair, blue eyes, or both is,
therefore,𝑃(∪)=50/55=10/11. Brown hair + Blue eyes
In essence, we have used our total probability rule to calculate to know that, for event 𝐴,
(𝐴)=1−𝑃(𝐴¯) and neither brown hair nor blue eyes 𝑃(∪)=1−𝑃(∪)=1−5/55=50/55=10/11.
Hence, as noted, the probability that a person chosen at random from the sample has at least one
of the features “brown hair” and “blue eyes” is 10/11.
Summary
In this section that we have learned that an experiment is a process by which an outcome is
obtained, an outcome is a single result from a measurement of an experiment and an event is a
particular result or set of results amongst the possibilities in the sample space.
We have also learned that the Axioms of probability are 0 ≤𝑃𝐴≤1 (probabilities are real numbers
between 0 and 1 inclusive), 𝑃(𝑆) =1 (the probability of some event occurring from S is unity), if A
and B are mutually exclusive events in S, then 𝑃(𝐴 U 𝐵) = 𝑃𝐴+𝑃(𝐵) (the probability function is
( ) ( )
an additive set function), and if A is an event in S, then P A = 1 − P( A) , where A is the
complement of A.
349
Again, we have learned that a compound event is an event with two or more favourable outcomes
and there are three types of compound events and determining the probability for each is different.
These are mutually exclusive or mutually inclusive, independent or dependent, and nonmutually
exclusive or complementary events.
Introduction
In this unit 6 section 2, we learned the basic concepts of an experiment, an outcome and an event.
We also learned the axioms of probability, compound events, mutually exclusive or mutually
inclusive, independent or dependent, and nonmutually exclusive or complementary events. In this
section, we are going to learned basic concepts in dependents events with replacements and events
without replacements and the total probability rule.
350
Dependent Events with replacements and events without replacements
Suppose you flip a coin and roll a die at the same time. These are compound events. What is the
probability you will flip a head and roll a four? These events are independent. Rolling a die has no
effect on flipping a coin, and so replacing the dice or not does not affect the probability of the coin.
However, suppose you randomly draw a card from a standard deck and then randomly draw a
second card without replacing the first. The second probability is now different from the first and
depends on the first.
If the second card is replaced, then the probabilities of the two events will be the same. However,
if the second card is not replaced, then the probability of the two will differ since the total deck of
cards will be reduced accordingly.
351
To find the probability of two dependent events, multiply the probability of the first event by the
probability of the second event, after the first event occurs. P(A and B)=P(A)⋅P(B following A)
Example
B: 1st card is a 7
a. P(A or B)
A club or a 7 can be picked at the same time so these are mutually inclusive events. You can use
the formula from above.
P(A or B)=13/52+4/52−1/52
b. P(B or A)
c. A club and a 7 can be picked at the same time so these are mutually inclusive events. You can
use the formula from above.
P(B or A)=4/52+13/52−1/52
d. P(A and C)
Picking a club on the first card and a heart on the second card are dependent events so you need to
multiply the probability of A by the probability of C following A.
2. A bowl contains 12 red marbles, 5 blue marbles and 13 yellow marbles. Find the probability of
drawing a blue marble and then drawing a yellow marble.
352
Solution
Let A=blue marble chosen 1st and B=yellow marble chosen 2nd. The total number of marbles in
the bowl is 12+5+13=30.
3. 𝐴 and 𝐵 are independent events, where (𝐴)=13 and 𝑃(𝐵)=25. What is the probability that
events 𝐴 and 𝐵 both occur?
Solution
Given that events 𝐴 and 𝐵 are independent, the probability that they both occur is
(𝐴∩𝐵)=𝑃(𝐴)×𝑃(𝐵)=1/3×2/5=2/15.
4). A bag contains 22 red balls and 9 green balls. One red ball is removed from the bag and then a
ball is drawn at random. Find the probability that the drawn ball is red.
Solution
To find the probability of drawing a second red ball from the bag, we note first that since there are
22 red balls and 9 green ones, there are 22+9=31 balls in total.
The probability of drawing a red ball from the bag on our first pick is, therefore, 𝑃(R)==22/31.
Since we are keeping the first ball out of the bag, there is one less ball in the bag in total, so there
are now 30 balls in the bag. And of those 30, there is one less red since the ball we took out was
red.
To help us work out the probability of drawing a second red ball from the bag, we can illustrate
the probability of taking a second red ball having not replaced the first is found by multiplying the
probabilities of the “first ball red” and “second ball red”: 𝑃(∩) = 22/31×21/30 = =77/155 ≈ 0.497.
The probability that the second ball drawn is red is therefore 0.497. We can say that there is
approximately a 50% chance of choosing two consecutive red balls (since 0.497×100%=49.7%).
353
Note that we have actually used the formula for compound dependent
events: (𝐴∩𝐵)=𝑃(𝐴∣𝐵)×𝑃(𝐵).
The probability 21/30 is the conditional probability of selecting a red ball given that a red ball has
already been taken from the bag:
5. Two coins are flipped simultaneously. What is the probability of getting heads on either of these
coins?
Solution
First thing that you realise is that these are independent events. Once you do that, move on to find the
probability of each individual event.
Let us call the first coin toss as E and the second coin toss as F. Therefore we can write: P (E) = 1/2
i.e. probability of getting a head on the first coin toss = 1/2.
Similarly, the probability of getting a head on the second coin’s toss = 1/2. In other words, we can
write that P (F) = 1/2.
Now we have to calculate the probability of both these events happening together. Hence we use the
rule of the product. If P is the probability of some event and Q is the probability of another event, then
the probability of both P and Q happening together is P×Q.
Hence the probability that either of the two coins will turn up a head = 1/2 × 1/2 = 1/4
6. A die is cast twice and a coin is tossed twice. What is the probability that the die will turn a 6 each
time and the coin will turn a tail every time?
Solution
Each time the die is cast, it is an independent event. The probability of a getting a 6 is = 1/6. So the
probability of getting a 6 when the die is cast twice = 1/6 × 1/6 = 1/36
Similarly the probability of getting a tail in two flips that follow each other (are independent) =
(1/2)×(1/2) = 1/4
Therefore as the two events i.e. casting the die and tossing the coin are independent, and the
probability of both the events = (1/36)×(1/4) = 1/144.
7. If one has three dice what is the probability of getting three 4s?
Solution
The probability of getting a 4 on one die is 1/6
The probability of getting three 4s is: P(4 and 4 and 4) =1/6×1/6×1/6=1/216
354
When the outcome affects the second outcome, which is what we called dependent events.
8. What is the probability for you to choose two red cards in a deck of cards?
Solution
A deck of cards has 26 black and 26 red cards. The probability of choosing a red card randomly
is: P(red)=26/52=1/2
The probability of choosing a second red card from the deck is now: P(red)=25/51
The probability: P(2red)=1/2×25/51=25/102
Introduction
Dear Students,
In this unit 6 section 3, we learned the basic concepts of dependent and independent events and
their applications to real life situations. In this section, we are going to learn dependent events
using conditional probabilities and tree diagrams.
355
Tree and Venn Diagrams
Sometimes, when the probability problems are complex, it can be helpful to graph the situation.
Tree diagrams and Venn diagrams are two tools that can be used to visualize and solve conditional
probabilities.
A tree diagram is a special type of graph used to determine the outcomes of an experiment. It
consists of ‘branches’ that are labelled with either frequencies or probabilities.
Tree diagrams can make some probability problems easier to visualize and solve. The following
example illustrates how to use a tree diagram.
Example
1. In an urn, there are 11 balls. Three balls are red (R) and eight balls are blue (B). Draw two balls,
one at a time, with replacement. With replacement means that you put the first ball back in the urn
before you select the second ball. The tree diagram using frequencies that show all the possible
outcomes follows.
356
Solution
On the diagram, the first set of branches represents the first draw. The second set of branches
represents the second draw. Each of the outcomes is distinct.
In fact, we can list each red ball as R1, R2, and R3 and each blue ball as B1, B2, B3, B4, B5, B6,
B7, and B8.
Then the nine RR outcomes can be written as: R1R1; R1R2; R1R3; R2R1; R2R2; R2R3; R3R1;
R3R2; R3R3.
The other outcomes are similar.
There are a total of 11 balls in the urn. Draw two balls, one at a time, with replacement. There are
11(11) = 121 outcomes, the size of the sample space.
Solution
a. B1R1; B1R2; B1R3; B2R1; B2R2; B2R3; B3R1; B3R2; B3R3; B4R1; B4R2; B4R3; B5R1; B5R2;
B5R3; B6R1; B6R2; B6R3; B7R1; B7R2; B7R3; B8R1; B8R2; B8R3
Solution
b. P(RR) = (3/11)(3/11)=9/121
Solution
357
P(RB OR BR)=(3/11)(8/11)+(8/11)(3/11)=48/121
d. Using the tree diagram, calculate P(R on 1st draw AND B on 2nd draw).
Solution
d. P(R on 1st draw AND B on 2nd draw) = P(RB) = (3/11)(8/11)=24/121
e. Using the tree diagram, calculate P(R on 2nd draw GIVEN B on 1st draw).
Solution
e. P(R on 2nd draw GIVEN B on 1st draw) = P(R on 2nd|B on 1st) = 24/88=3/11
This problem is a conditional one. The sample space has been reduced to those outcomes that
already have a blue on the first draw. There are 24 + 64 = 88 possible outcomes (24 BR and 64
BB). Twenty-four of the 88 possible outcomes are BR. 24/88 = 3/11 .
f. Using the tree diagram, calculate P(BB).
Solution
f. P(BB) = 64/121
g. Using the tree diagram, calculate P(B on the 2nd draw given R on the first draw).
Solution
g. P(B on 2nd draw|R on 1st draw) = 8/11
There are 9 + 24 outcomes that have R on the first draw (9 RR and 24 RB). The sample space is
then 9 + 24 = 33.
24 of the 33 outcomes have B on the second draw. The probability is then 24/33=8/11 .
358
a. P(RR) = (3/11)(2/10) = 6/110 =2/55
e. P(BB) = (8/11)(7/10)=56/110=28/55
Summary
In this section, we have learned how to use the tree diagrams to solve problems in probability. I
hope we have all enjoyed the two problems we have solved above?
Thank you.
Let us proceed to section 5.
Introduction
Dear Students,
359
In this unit 6 section 4, we learned the basic concepts of conditional probability using the tree
diagrams and their applications to real life situations. In this section, we are going to extend the
tree diagrams to Bayes theorem and solve practical problems.
Say active and we all do the following activities!
Conditional Probability
Dear Students,
Suppose we know that a certain event ‘B’ has occurred. How does this impact the probability of
some other ‘A’. This question is addressed by conditional probabilities. We write it as P(A|B), the
conditional probability of A given B.
P( A B )
The conditional probability of A given B is expressed as P( A / B ) = .
P (B )
It is also useful to think of this formula P(A ∩ B) = P(A|B)P(B).
More generally we can condition on a collection of n events provided they are pairwise disjoint
and add up to all the sample space as P(A) = P(A|B1)P(B1) + P(A|B2)P(B2) + · · · + P(A|Bn)P(Bn)
Bayes’ Theorem
Dear Students,
Bayes Theorem is a theorem in probability theory named for Thomas Bayes (1702-1761). In
epidemiology, it is used to obtain the probability of disease in a group of people with some
characteristic on the basis of the overall rate of that disease and of the likelihoods of that
characteristic in healthy and diseased individuals.
The most familiar application is in clinical decision analysis where it is used for estimating the
probability of a particular diagnosis given the appearance of some symptoms or test result.
Bayes’ theorem is a way to figure out conditional probability. Conditional probability is the
probability of an event happening, given that it has some relationship to one or more other events.
For example, your probability of getting a sitting place in the lecture room is connected to the time
of the day you come to lectures, where you sit, and what conventions are going on at any time.
Bayes’ theorem is slightly more nuanced. In a nutshell, it gives you the actual probability of
an event given information about tests.
P(B / A)P( A)
Mathematically, P( A / B ) =
P (B )
Activities
360
1. In a particular pain clinic, 10% of patients are prescribed narcotic pain killers. Overall, five
percent of the clinic’s patients are addicted to narcotics (including pain killers and illegal
substances). Out of all the people prescribed pain pills, 8% are addicts. If a patient is an addict,
what is the probability that they will be prescribed pain pills?
Solution
Step 1: Figure out what your event “A” is from the question. That information is in the italicized
part of this particular question. The event that happens first (A) is being prescribed pain pills.
That’s given as 10%.
Step 2: Figure out what your event “B” is from the question. That information is also in the
italicized part of this particular question. Event B is being an addict. That’s given as 5%.
Step 3: Figure out what the probability of event B (Step 2) given event A (Step 1). In other words,
find what (B|A) is. We want to know “Given that people are prescribed pain pills, what is the
probability they are an addict?” That is given in the question as 8%, or .8.
Step 4: Insert your answers from Steps 1, 2 and 3 into the formula and solve.
P(A|B) = P(B|A) * P(A) / P(B) = (0.08 * 0.1)/0.05 = 0.16
Therefore, the probability of an addict being prescribed pain pills is 0.16 or 16%.
2. A doctor is called to see a sick child. The doctor has prior information that 90% of sick children
in that neighborhood have the flu, while the other 10% are sick with measles.
Let F stand for an event of a child being sick with flu and M stand for an event of a child being
sick with measles. Assume for simplicity that F ∪ M = Ω, i.e., there are no other maladies in that
neighborhood. A well-known symptom of measles is a rash (the event of having which we denote
R). Assume that the probability of having a rash if one has measles is P(R | M) = 0.95. However,
occasionally children with flu also develop rash, and the probability of having a rash if one has flu
is P(R | F) = 0.08. Upon examining the child, the doctor finds a rash.
What is the probability that the child has measles?
Solution
Solution
361
Let RR, BB, and RB denote, respectively, the events that the chosen cars is the red-red, the black-
black, or the red-black card.
Letting R be the event that the upturned side of the chosen card is red, then we obtain,
P( RR) P( R)
P(RB / R ) =
P( R / RB) P( RM )
=
P( R) P( R / RR) P( RR) + PR / RBP ( RB) + PR / BB) P( BB)
P(RB / R ) =
(1 / 2)(1 / 3) 1
=
(1)(1 / 3) + (1 / 2)(1 / 3) + (0)(1 / 3) 3
Summary
In this section, we have learned that conditional probability of A given B is expressed as
P( A B )
P( A / B ) = .
P (B )
We have also learned that Bayes’ theorem gives you the actual probability of event ‘A’ given event
P(B / A)P( A)
‘B’ as P( A / B ) = .
P (B )
Can you create your own problems and use the two theorems to solve them?
Congratulations!
Dear Students,
In unit 6 section 1, we learned the counting principles. The two main ones were permutations and
combinations.
What did we say about permutations?
What did we say about combinations?
362
Thank you for recalling these two important concepts in probability.
Again, in unit 6 section 5, we learned that conditional probability of A given B is
P( A B )
P( A / B ) = , and the Bayes’ theorem of event ‘A’ given event ‘B’ as
P (B )
P(B / A)P( A)
P( A / B ) = .
P (B )
In this section, we are going to learn how to solve problems in probability using permutations and
combinations.
Take your pens, calculators and jotters and lets go!
363
n!
Also, recall that the permutation of r objects from n is given by n Pr = and combination is
(n − r )!
n!
nCr = .
r !(n − r )!
n! n!
As applied to probability, n Pr = and nCr = serve as the total or sample space
(n − r )! r !(n − r )!
while the selections serve as the numerators.
Dear Students,
Let us go straight to solving some problems and these concepts will be consolidated.
Thank you.
Activities
1. A four-digit PIN is selected. What is the probability that there are no repeated digits?
Solution
There are 10 possible values for each digit of the PIN (0, 1, 2, 3, 4, 5, 6, 7, 8, 9)
To have no repeated digits, all four digits would have to be different, which is selecting without
replacement. We could either compute 10×9×8×7, or notice that this is permutation
10P4 = 5040.
The probability of no repeated digits is the number of 4 digit PINs with no repeated digits divided
by the total number of 4 digit PINs.
10P 4 5040
This probability is = = 0.504
10 4 10000
2. In a certain lottery company, 48 balls numbered 1 through 48 are placed in a machine and six
of them are drawn at random. If the six numbers drawn match the numbers that a player had chosen,
the player wins 1,000,000 Ghana cedis. In this lottery, the order the numbers are drawn in does not
matter.
Compute the probability that you win the million-dollar prize if you purchase a single lottery ticket.
Solution
364
In order to compute the probability, we need to count the total number of ways six numbers can
be drawn, and the number of ways the six numbers on the player’s ticket could match the six
numbers drawn from the machine.
Since there is no stipulation that the numbers be in any particular order, the number of possible
outcomes of the lottery drawing is
48C6 = 12,271,512.
Of these possible outcomes, only one would match all six numbers on the player’s ticket, so the
6C 6 1
probability of winning the grand prize is =
48C 6 12271512
3. In the lottery company from question 2, if five of the six numbers drawn match the numbers that
a player has chosen, the player wins a second prize of Ghc1,000. Compute the probability that she
wins the second prize if she purchases a single lottery ticket.
Solution
In order to win the second prize, five of the six numbers on the ticket must match five of the six
winning numbers
In other words, we must have chosen five of the six winning numbers and one of the 42 losing
numbers. The number of ways to choose 5 out of the 6 winning numbers is given by 6C5 = 6 and
the number of ways to choose 1 out of the 42 losing numbers is given by 42C1 = 42.
Thus the number of favourable outcomes is then given by the 6C5 × 42C1 = 6 × 42 = 252.
4. Compute the probability of randomly drawing five cards from a deck and getting exactly one
Ace.
Solution
In many card games (such as poker) the order in which the cards are drawn is not important (since
the player may rearrange the cards in his hand any way he chooses). So, we will assume that this
365
is the case. Thus we use combinations to compute the possible number of 5-card hands,
52C5.
This number will go in the denominator of our probability formula, since it is the number of
possible outcomes.
For the numerator, we need the number of ways to draw one Ace and four other cards (none of
them Aces) from the deck.
Since there are four Aces and we want exactly one of them, there will be 4C1 ways to select one
Ace
Since there are 48 non-Aces and we want 4 of them, there will be 48C4 ways to select the four non-
Aces.
Now we use the Basic Counting Rule to calculate 4C1 × 48C4 ways to choose one ace and four non-
Aces.
Summary
Dear Students,
You have noticed how Permutation and Combination is a very important topic in this unit. Through
permutations and combinations, we counted various arrangements in unit 6 section 1 that can be made
from a certain group. In this unit 6 section 6, we have used the concepts to solve concepts with a diverse
forms and structure.
Continue to explore more applications of Permutations and Combinations in your everyday life.
Congratulations!
366