TEC 106-Complete Converted Notes-1
TEC 106-Complete Converted Notes-1
6 Quadratic mean
PROBABILITY AND STATISTICS 4. Measures of dispersion or spread ………… 13
4.1 Range
Objectives 4.2 Mean deviation
At the end of this course the student should be able to: 4.3 Quartile deviation
calculate statistical functions and analyze 4.4 Variance
samples and present them in tabular or 4.5 Standard deviation………………………… 14
graphical forms 4.6 Relative measures of deviation…………… 15
describe concept of a random event and a 4.7 Moments
random experiment 4.8 Measures of skewness
describe and calculate discrete random and
continuous random variables 5. Random experiments and events…………….16
5.1 Basic terminology
Course Content 5.1.1 Trial and outcome……………
Tabular and graphical representation of samples: 5.1.2 Sample space
frequency, relative frequency, absolute frequency, and 5.2 Venn diagram representations ……………. 17
distribution function; sample mean, sample variance and 5.2.1 Complement
standard deviation. Random experiments and events: 5.2.2 Intersection………………………….18
Venn diagram, union, intersection, mutually exclusive 5.2.3 Union
events, multiplication rule, and complementation rule. 5.3 Sequential experiments
Discrete random variables: probability function, probability 5.4 Sampling
distribution function, mean and variance of a distribution. 5.5 Tree diagram representations
Continuous random variables: continuous distributions,
Binomial distributions, normal probability distribution. 6. Introduction to Probability ……………………...19
6.1 Introduction
Table of Contents 6.1.1 Theoretical probability
1. Introduction to Statistics 6.1.2 Empirical probability
1.1 Definition of Statistics…………………….. 2 6.2 Concept of Statistical regularity
1.2 Purpose of Engineering statistics 6.3 Axioms of mathematical probability…………20
1.3 Introduction to Basic terms 6.4 Conditional probability………………………..21
1.3.1 Statistical unit 6.5 Permutations and combinations…………….23
1.3.2 Population and sample
1.3.3 Variation and Random variation 7. Random variables………………………………..25
1.3.4 Random selection 7.1 Definition of the random variable
1.3.5 Experiment 7.2 Discrete random variables
1.4 The statistical experiment……………….. 3 7.2.1 Discrete probability function
1.5 Probability versus Statistics 7.2.2 Discrete distribution function………
7.3 Continuous random variables
2. Tabular and Graphical representation of 7.3.1 Probability density function ………26
Samples 7.3.2 Continuous distribution function
2.1 Grouping of data 7.4 Mean and variance of a probability distribution
2.2 Tabular representations of data ……….. 4 7.4.1 Mean or expectation………………27
2.2.1 Tally chart 7.4.2 Variance
2.2.2 Absolute frequency
2.2.3 Relative frequency 8. Probability Distributions………………………..29
2.2.4 Cumulative absolute frequency 8.1 Discrete probability distributions
2.2.5 Cumulative relative frequency 8.1.1 Binomial distribution
2.3 Graphical representations of data ……. 7 8.1.2 Hypergeometric distribution
2.3.1 Graphs of the Absolute frequency 8.1.3 Poisson distribution
2.3.1.1 Bar chart 8.2 Continuous probability distributions………..31
2.3.1.2 Dot frequency diagram 8.2.1 Normal distribution
2.3.2 Graphs of the relative frequency 8.2.2 Standard normal distribution
2.3.2.1 Frequency histogram 9. Sample Examination Paper………………….…33
2.3.2.2 Frequency polygon
2.3.3 Cumulative frequency function / curve
Reading List
3. Measures of Central tendency………………… 13 th
3.1 Measures of position 1. Elementary Statistics, 4 Ed, Robert Johnson,
3.1.1 Quartiles (1984). PWS Publishers.
3.1.2 Deciles 2. Mathematical Methods for Physics and
3.1.3 Percentiles Engineering, Riley K. F., Hobson M. P., Bence S.
3.2 Measures of average (2000) J. Cambridge University Press.
th
3.2.1 Median 3. Engineering Mathematics, 4 Ed.
3.2.2 Mode Stroud K. A, (1995), Macmillan Press Ltd.
3.2.3 Arithmetic mean 4. Advanced Engineering Mathematics, Jaggi V. P.,
3.2.4 Geometric mean Mathur A. B. (1992), Khanna Publishers, Delhi.
th
3.2.5 Harmonic mean 5. Advanced Engineering Mathematics, 5 Ed,
Erwin Kreyszig (1993), Wiley Eastern Limited.
1
TOPIC 1.0 Kenya may have the Kenya Shilling (KES) as the statistical
unit of reference. Other examples include kilograms,
INTRODUCTION TO STATISTICS degrees Celsius or Fahrenheit, metres, etc.
The origin of statistics is as old as the human society The statistical unit gives an indication as to whether the
itself. Governments have been collecting information sampled data is continuous or discrete.
regarding populations, armed forces, weapons, wealth,
agriculture, economic conditions and various other activities 1.3.2 Population and sample
of the state. This information when properly collected, Population refers to a collection, or set, of individuals,
classified and presented helped in administration and objects or measurements whose properties are to be
future planning of the state. analysed. In most cases the population is finite or countably
Theoretical development of modern statistics was infinite i.e. the total number of objects in the population is
possible due to the contribution of French mathematicians known. A numerical characteristic of an entire population is
Blaise Pascal (1623 - 1662) and P. Fermat (1601 - 1665), called a parameter.
who solved the famous 'Problem of Points' laying the A sample is a subset of a population. The sample consists
foundation of the theory of probability which is the of a number of individuals, objects or measurements that
backbone of modern statistics. are randomly selected to facilitate the analysis of one or
more characteristics of the population. A numerical
Section objectives: - characteristic of a sample is called a statistic.
At the end of this section you should be able to: - With respect to set theory the population can be thought
1. Define statistics and highlight its applications in the of as being the universal set (U), Fig. 1.0, from which
field of Engineering samples A, B, and C are randomly selected for study.
2. Describe the meanings of the basic terms used in
statistics such as the statistical unit, population, U
sample, data, random variation, random samples
3. Distinguish between discrete and continuous data B
types
4. Define the steps followed in statistical
experimentation. A
5. Describe the relationship between probability and C
statistics.
2
1.3.5 Random selection Home Study Exercise 1.1
The process of generating a sample in which all objects or An investigation focuses on the cost of Engineering
individuals in the population are given an equal chance of education at Moi University. One of the expenses that a
being selected into the sample is called random selection student must contend with is the cost of textbooks. Let x be
or random sampling. This process involves assigning a the cost of all textbooks purchased by each student this
number to each member of the population and then using a semester. Carefully describe: -
random number generator to select a specified number of (a) the population
members into the sample. (b) the statistical unit
If we are to obtain meaningful information about the If the Dean's office wishes to estimate the average cost of
population from a sample then it is mandatory that the textbooks per student per semester
sample to be a random selection. (c) describe the population parameter
(d) the sample statistic that is of interest to the Dean's
1.3.5 Experiment office if 50 students maintained a record of their
An experiment is a planned activity whose results yield a textbook expenses per semester.
set of data. (e) Describe how you would use the 50 data in the
sample to calculate the value of the sample statistic
Tutorial Exercise discussed in (d).
Find the average age and weight of students in your class
by randomly sampling
(a) 5 students TOPIC 2.0
(b) 10 students
(c) 20 students TABULAR AND GRAPHICAL
Repeat each of the above experiments twice and comment REPRESENTATION OF SAMPLES
on the consistency of the three sets of results obtained.
The raw ungrouped data that is obtained from experiments,
1.4 The Statistical Experiment surveys, etc. are usually numerous and disorderly and
The essential problem in statistics is to find quantitative hence cannot furnish any useful information. Grouping of
procedures for describing and interpreting sets of data. data into classes and manipulation of this data into tabular
There are two aspects to this problem: and graphical forms condenses the information therein
The description of a set of data (sample/population) making it easier to observe certain trends and consequently
in terms of a small set of descriptive quantities draw meaningful conclusions.
(called statistics/parameters respectively). This
aspect is called descriptive statistics. Section objectives
Drawing inferences about population parameters At the end of this section you should be able to: -
by examining sample statistics. This aspect is 1. Group an ungrouped data into classes of equal width
called inferential statistics. 2. Determine the class interval, class boundaries, class
values, class limits of a grouped data distribution.
The statistical experiment has the following distinct phases: 3. Construct the frequency distribution for ungrouped and
1. Formulation of the problem - involves the grouped data including the tally chart, absolute
creation of a mathematical model that is based on frequency, relative frequency, cumulative absolute
a clear understanding of statistical concepts. frequency, and cumulative relative frequency.
2. Design of the experiment - choice of statistical 4. Construct bar charts, dot diagrams, frequency
methods, sample size, etc. histograms and polygons and the cumulative frequency
3. Collection of data - actual experimentation to graph (ogive) from a tabulated frequency distribution
develop a data set. 5. Interpret the tabulated and graphical statistical data to
4. Mathematical description/ organisation of data draw meaningful conclusions.
- arranging data into tabular and graphical forms.
5. Analysis of data - computations of numbers that 2.1 GROUPING OF DATA
characterise the average size and spread of the To reduce the volume of numerical analysis bulky statistical
sample values. data raw data is usually condensed into classes of equal
6. Interpretation of data- drawing meaningful width. Figure 2.1 gives a summary of the class boundary
conclusions about the qualitative and quantitative definitions that are used when referring to grouped data.
aspects of the sample that provide answers for the
problem described in (1) above. Upper class
Lower class Class width
limit
limit
The phases 1 - 4 above can be categorised as constituting 7.1 7.3
descriptive statistics while phases 5 and 6 constitute
inferential statistics.
7.05 7.20 7.35
1.5 Probability versus Statistics (Lower class (class value) (Upper class
boundary) boundary)
Probability and statistics are two separate but related fields Class interval
in mathematics. Probability investigates the chance or
likelihood that something (a sample) will happen when you
know the possibilities (i.e. the population). Figure 2.1 Class boundary definitions
Statistics, on the other hand, requires a randomly selected Class interval = upper class boundary - lower class
sample that is described (descriptive statistics) and then boundary
inferences are made about the population based on the = 7.35 - 7.05 = 0.30 units
information found in the sample (inferential statistics). Class width = upper class limit - lower class limit
3
= 7.3 - 7.1 =0.2 units if the number of classes constructed is more or less
Class value = (upper class limit + lower class than desired, marginally increase or decrease the
limit) / 2 class width respectively, until the desired number is
7.3 7.1 attained.
= 7.2 units
2
Or Class boundaries for class intervals
Class value = (upper class boundary + lower class There are four ways in which the limits of class intervals are
boundary) / 2 shown. The first three cases in Table 2.1 are different ways
7.35 7.05 of expressing the same limits, the assumption being that
= 7.2 units the sample measurements, for example, the under 5
2 metres (column B), are taken to a sufficiently precise
If ndp is the number of decimal places of the class limit then, degree to warrant including items which are for all practical
Upper class boundary = Upper class limit + ndp /2 purposes equivalent to 5 acres e.g. 4.9897 acres. The limits
= 7.3 + 1/2 =7.35 shown in C are known as class boundaries, the highest
Lower class boundary = lower class limit - ndp /2 value of one class being the lowest value of the next class.
= 7.1 - 1/2 =7.05 In Case D, the assumption is made that measurements are
The class 7.1 -7.3 includes all values between 7.05 recorded to the nearest metre. The class boundaries are
and 7. 3499. The class 7.4 -7.6 includes all values therefore 0 - 4½, 4½ - 9½,9½ - 14½, etc.
between 7.35 and 7.6499, and so on.
Table 2.1 Different conventions for representing
Procedure for Grouping Data Class intervals
Consider, for example, the thickness of 20 samples of a A B C D
steel plate enlisted as follows: - (metres) (metres) (metres) (metres)
7.3, 7.1, 6.6, 7.0, 7.8, 7.3, 7.5, 6.2, 6.9, 6.7, 6.5, 6.8, 7.2, 0- 0 and under 5 0 -5 0-4
7.4, 6.5, 6.9, 7.2, 7.6, 7.0, 6.8. 5- 5 " 10 5 - 10 5-9
This data can be grouped into six classes of equal width as 10- 10 " 15 10 - 15 10 - 14
follows: 15- 15 " 20 15 - 20 15 -19
Determine the range, i.e. the difference between 20- 20 " 25 20 - 25 20 -29
the largest and the smallest data values.
Range 7.8 6.2 1.6 The convention used in C is more appropriate for
Determine the class interval by dividing the range representing data such as age because ages are usually
by the desired number of classes and round off to recorded as at the last birthday.
the number of decimal places (dp.) of the sample NOTE: In all cases the class boundaries must be used for
data. calculations and in making graphs of the distribution.
1.6
Class interval (C.I) = 0.2667 0.3 to 1 dp. 2.2 Tabular Representations of Frequency
6
Distributions
Determine the class width as follows
-n
The frequency distribution of the illustration sample given in
Class width = C.I 1 x 10 dp , the sample in the preceding section is tabulated as follows:
where C.I = Class interval = 0.3 and
ndp = number of decimal places of the Table 2.2 Frequency distribution table
sample data. Variable Tally (1) (2) (3) (4)
=1, hence X [mm] f r.f c.a.f c.r.f
Class width 0.3 1 x 101 0.3 0.1 0.2 6.2 - 6.4 I 1 0.05 1 0.05
Beginning with the least data variable construct the 6.5 - 6.7 IIII 4 0.20 5 0.25
classes by adding the class width to it to give the 6.8 - 7.0 IIII I 6 0.30 11 0.55
upper class limit for the first class and adding to it 7.1 - 7.3 IIII 5 0.25 16 0.80
the class interval to give the lower class limit of the 7.4 - 7.6 III 3 0.15 19 0.95
next class Figure 2.2. Repeat the procedure until all 7.7 - 7.9 I 1 0.05 20 1.00
data is covered within the range. n =20 r.f = 1.0
Frequency, f
4
2.2.4 Cumulative absolute frequency (Column 3)
The cumulative absolute frequency (c.a.f) corresponding to
a given sample variable, X, is the numerical sum of all the 2
frequencies of variables in the sample that are less or equal
to X. For example the c.a.f of the samples in the range 7.4 -
7.6 is 1 + 4 + 6 + 5 + 3 = 19. This implies that 19 samples
have a thickness less or equal to 7.6499 mm. 6.3 6.6 6.9 7.2 7.5 7.8 x
Frequency, f
range 7.4 - 7.6 is 0.05 + 0.20 + 0.30 + 0.25 + 0.15 = 0.95.
4
This implies that 95% of all the samples have a thickness
less or equal to 7.6499 mm.
2
In general when preparing the frequency distribution table
for grouped data the classes should be clearly defined.
They should be of equal width and the number of classes
should preferably lie between 5 and 15. 6.3 6.6 6.9 7.2 7.5 7.8 x
Variable , x
2.3 Graphical Representations of Frequency (b) Dot frequency diagram
Distributions Figure 2.2 Bar Chart and dot frequency diagram
Graphs and diagrams leave a lasting impression on the 2.3.1.2 Dot frequency diagrams and pictograms
mind and make intelligible even to the layman the salient A number of dots, equal to the absolute frequency, are
features of the data. When representing group frequency plotted against each corresponding variable class, Fig 2.2
distributions by graphs, the class values of the variable (x) (b). Pictograms are similar to dot frequency diagrams. In
are plotted along the x- axis while the frequencies are this case suitable pictorial icons replace the dots, with the
plotted along the y -axis. number of the icons per variable giving the frequency.
Graphical plots of frequency distributions can be
categorised into two groups, namely: - 2.3.2 Graphical plots of the absolute frequency
1. Graphical plots of the absolute frequency Here the relative frequency is plotted against the sample
2. Graphical plots of the relative frequency variables. Two common graphs used include the frequency
histogram and the frequency polygon.
2.3.1 Graphical plots of the absolute frequency
These graphs plot the absolute frequency of the sampled 2.3.2.1 Frequency Histogram
data against each corresponding sample variable. This graph is similar in construction to the bar chart. In this
Examples include the bar charts, dot frequency diagrams case the vertical length of each bar is made proportional to
and pictograms. the relative frequency corresponding to the class in
consideration Figure 2.3 (a). As before, all the bars in the
graph are constructed with an equal width that, in the case
2.3.1.1 Bar Chart
of grouped data, is proportional to the class interval.
The bar chart for the illustration sample that was used in
the preceding section is constructed as shown in Fig. 2.2 2.3.2.2 Frequency Polygon / frequency curve
(a) below. The vertical length of each bar is made The frequency polygon is derived from the frequency
proportional to the absolute frequency corresponding to the histogram by drawing a continuous straight line joining the
class in consideration. All the bars in the chart are corresponding relative frequencies that are marked off at
constructed with equal width. In ungrouped samples this middle points of adjacent bars, corresponding to the
width may be chosen arbitrarily whereas in grouped data respective sample points or class values. Figure 2.3(b). If
the width is constructed proportional to the class interval. the corners of the frequency polygon are smoothened out
using a freehand sketch then the resulting graph is called a
Table 2.2 Grouped Frequency distribution table with
frequency curve.
computed class values
Variable Class (1) (2) (3) (4) 2.4 Cumulative Frequency Function and Cumulative
X [mm] value f r.f c.a.f c.r.f Frequency Curve (Ogive)
6.2 - 6.4 6.3 1 0.05 1 0.05 These graphs plot the tabulated cumulative frequency
6.5 - 6.7 6.6 4 0.20 5 0.25 against the sample variables. In the cumulative frequency
6.8 - 7.0 6.9 6 0.30 11 0.55 function is a step function that plots the tabulated
7.1 - 7.3 7.2 5 0.25 16 0.80 cumulative relative frequency (c.r.f) as the y - coordinates
7.4 - 7.6 7.5 3 0.15 19 0.95 and the samples/class values as the x - coordinates, Fig
5
2.4(a). In normal frequency distributions the steps in this y
graph tend to increase in height as you move towards the 20
centre of the distribution.
16
y
Relative frequency, f
0.3 12
c.a.f
8
0.2
4
0.1
0 6.45 6.75 7.05 7.35 7.65 7.95 x
Upper class boundaries
(b) Cumulative frequency curve (ogive)
6.3 6.6 6.9 7.2 7.5 7.8 x
Figure 2.3 Histogram and Frequency Polygon
Variable , x Home Study Exercise 2
(b) Histogram 1. The marks obtained by 70 students in an Applied
y Mathematics exam are as follows:
32, 45, 38, 7, 40, 15, 5, 26, 0, 11, 40, 2, 18, 8, 31, 4, 27, 7,
Relative frequency, f
0.3
0, 15, 12, 35, 28, 46, 9, 29, 10, 34, 2, 7, 5, 17, 2, 8, 35, 30,
11, 36, 47, 19, 16, 0, 18, 16, 14, 2, 38, 41, 42, 17, 45, 28,
0.2 48, 20, 7, 21, 8, 5, 28, 13, 22, 27, 41, 40, 36, 29, 29, 31, 34,
48.
Group this data into seven classes of equal width and
0.1 hence tabulate the frequency distribution. On separate axes
graph the: -
(a) Bar chart
(b) Frequency histogram
6.3 6.6 6.9 7.2 7.5 7.8 x (c) Frequency polygon
Variable , x (d) Cumulative frequency function
(b) Frequency polygon (e) Ogive.
Figure 2.3 Histogram and Frequency Polygon
2. An automatic machine was tested on 40 occasions and
In the cumulative frequency curve (also called the ogive) is found to stitch the following number of bags in one
the free-hand smooth curve that is obtained by joining minute:
points generated when the tabulated cumulative absolute 18, 17, 21, 18, 19, 17, 18, 20, 16, 17, 19, 19, 17, 16,
frequency (c.a.f) is plotted on the y - axis against the upper 15, 19, 17, 17, 20, 18, 17, 18, 19, 19, 18, 19, 18, 18,
class boundaries on the x - axis, Figure 2.4(b). The ogive is 19, 20, 18, 15, 18, 17, 20, 18, 16, 17, 18, 17.
useful in determining the quartiles, deciles or percentiles of Starting with the tally chart, tabulate the frequency
grouped data. distribution. On separate axes graph the: -
y (a) Bar chart
1.0 (b) Frequency histogram
(c) Frequency polygon
0.8
(d) Cumulative frequency function
c.r.f
0.6
(e) Ogive.
TOPIC 3.0
MEASURES OF CENTRAL TENDENCY
Measures of central tendency are numerical values that
tend to locate, in some sense, the middle of a set of data.
Three categories are examined. Namely the: -
Measures of position /quantiles (quartiles, deciles
and percentiles)
6
Mode Step 1- Rank the data into an array starting with the
Measures of average (arithmetic, geometric and lowest - valued piece and proceeding to the
harmonic means) largest valued.
Step 2- Evaluate the position number, q, for the i
th
decile as follows:
L 25% 25% 25% 25% H iN
q , where i 1, 2, 3,...,9.
10
Q1 Q2 Q3 Again: -
Figure 3.1 Quartiles. If iN/10 is not an integer then q is equal
to the next larger integer.
The first or lower quartile (Q1) is a number such that at If iN/10 is an integer then q is equal to
most one-fourth of the data are smaller in value than Q1, (iN/10) + 0.5.
and at most three- fourths are larger. Step 3- Locate the value of Di by counting as before.
The second quartile (Q2) is the median i.e. the numerical
value of the variable that divides the distribution into two For grouped data the deciles are evaluated using the
equal parts. formula: -
The third or upper quartile (Q3) is a number such that at
q Ci
most three -fourths of the data are smaller in value than Q3, Di li h
and at most one-fourth are larger. fi
Procedure for determining the Quartiles with all variables taken as previously defined or determined.
Evaluation of the quartiles of an ungrouped data proceeds
as follows: - 3.1.3 Percentiles
7
Percentiles are number values of the variable that divide a
set of ranked data into 100 equal parts; each set of data
th SOLUTION
has 99 percentiles (Fig. 3.3). The k percentile Pk is a Step #1 The given data is ranked in ascending order as
number such that at most k% of the data are smaller in follows:
value than Pk and at most (100 - k)% of the data are larger.
25 26 26 27 27 27 27 28 28 28
28 28 28 28 29 29 29 29 29 29
30 30 30 30 30 31 31 31 32 32
Ranked data, increasing order
The frequency distribution is therefore:
Variable frequency Cumulative
L 1% 1% 1% 1% 1% H frequency
25 1 1
P1 P2 P3 P98 P99 26 2 3
27 4 7
Figure 3.3 Percentiles 28 7 14
Procedure for determining the Percentiles 29 6 20
Evaluation of the percentiles of an ungrouped data 30 5 25
proceeds as follows: - 31 3 28
32 2 30
Step 1- Rank the data into an array starting with the
Step #2 Evaluate the position numbers for each of the
lowest - valued piece and proceeding to the
required quantities and find them as follows:
largest valued. Median = Q2 = D5 = P50
Step 2- Evaluate the position number, q, for the i
th
2 N 5 N 50 N N
percentile as follows: q
iN 4 10 100 2
q , where i 1, 2, 3,...,99.
100
Again: -
= 30 / 2 =15 (an integer)
If iN/100 is not an integer then q is equal Hence q = 15 + 0.5 = 15.5
to the next larger integer. The median is therefore the arithmetic average of the
If iN/100 is an integer then q is equal to 15th and 16th members, which are both equal to 29.
(iN/10) + 0.5. 29 29
Step 3- Locate the value of Pi by counting as before. Q2 29 washers.
2
For Q1, i = 1
For grouped data the percentiles are evaluated using the
i N 1 30
formula: - q N 7.5 (not an integer)
4 4 4
q Ci
Pi li h Therefore q is taken as the next larger integer which is
fi 8. Counting off gives the 8th member of the ranked data
as 28, and hence Q1= 28 washers.
with all variables taken as previously defined or determined.
For P91, i = 91
NOTE: The following statements are true: - i N 91
Q1 P25 because the position number, q, q N (0.91) 30 27.3
100 100
corresponding to both points is the same, i.e. Again q is taken as the next larger integer which is 28.
1 N 25 N 1 Counting off gives the 28th member of the ranked data as
q N. 31, and hence P91 = 31 washers.
4 100 4
Similarly
Q3 P75 Example 3.2
For the grouped frequency distribution below
Q2 D5 P50 = the median. Variable 6.2 - 6.4 6.5 - 6.7 6.8 - 7.0 7.1 - 7.3
Frequency 1 4 6 5
Example 3.1 Variable 7.4 - 7.6 7.7 - 7.9
The contents of each of 30 packets of washers are Frequency 3 1
rd th
recorded as follows: Evaluate median, the 83 percentile, and the 8 decile
28, 31, 29, 27, 30, 29, 29, 26, 30, 28, 28, 29, 27, 26, 32, 28,
32, 31, 25, 30, 27, 30, 29, 30, 28, 29, 31, 27, 28, 28. Find
st st
the median, the 1 quartile, and the 91 percentile.
8
SOLUTION
Step #1 Draw the cumulative frequency distribution:
Graphical Determination of the Measures of Position
Variable Class f c.a.f The quantiles of a grouped frequency distribution can be
x [mm] value determined graphically as follows.
6.2 - 6.4 6.3 1 1 Draw the cumulative frequency curve (ogive) for
6.5 - 6.7 6.6 4 5 the given data.
6.8 - 7.0 6.9 6 11 Evaluate the position number (q) of the required
7.1 - 7.3 7.2 5 16 Read of the variable (Xj) on the x - axis that
7.4 - 7.6 7.5 3 19 corresponds to q as shown in Fig. 3.4
7.7 - 7.9 7.8 1 20
n =20 y
f x
1
f1 = frequency of the class x i i >(2)
preceding the modal class n
i 1
f2 = frequency of the class n
h
succeeding the modal class.
= class interval.
where n = total sample size = f
i 1
i
a is:
Mode L (U L )
h m
f x
1
x i i
a n
i 1
f (A d )
Figure 3.5 Graphical Determination of the Mode. 1 1
x fi xi i i
n n
Advantages and disadvantages of the mode i 1 i 1
1. Advantages n n
f d
1 1
(a) It is easy to understand fi A i i
(b) Extreme items do not affect its value n n
i 1 i 1
n n
f d
(c) Like the quantiles, only the middle items need to A 1
be known fi i i since f = n
n n
2. Disadvantages i 1 i 1
(a) It is often not clearly defined A xc
(b) Exact location is often uncertain m
f d
1
(c) Arrangement of data is tedious. Where xc = coded mean = i i
n
i 1
3.3 Measures of Average For a tabulated frequency distribution in which the classes
Three common measures are used, namely: have equal class intervals (h), a new coded variable (ui)
Arithmetic mean can be evaluated as:
Geometric mean x A di
Harmonic mean xc i , or di hxc
These three are discussed in the following sections. h h
Therefore the coded mean now becomes:
3.3.1 Arithmetic mean (or just Mean) m m
f (hx )
1 1
The mean or average of a sample is the mathematical ratio xc fi di i c
n n
of the total sum of all the sample variables to the total i 1 i 1
m
f x
sample size. h h
i c fxc
sum of all the sample v alues n n
Mean i 1
total number of samples and x A xc
Several mathematical formulations for determining the
Example 3.3
arithmetic mean are possible depending on the nature of
Find the mean and mode of the following data
the data.
Marks Number of Marks Number of
(a) Mean of an ungrouped data students students
Consider n values of a variable x in a sample that are Under 10 5 Under 60 60
" 20 9 " 70 70
enlisted as x1, x2, x3, , xn . The sample mean x is defined
" 30 17 " 80 78
as:
1
x1 x2 x3 xn 1 xn
x " 40 29 " 90 83
n " 50 45 " 100 85
or using the sigma - notation as:
n
x
1
x i >(1)
n
i 1
If the sample has variables x1, x2, x3, , xn each with
frequencies of f1, f2, f3,, fn respectively, then the frequency
- dependent sample mean x is stated as:
10
Consider deviations about an assumed mean A that is
SOLUTION chosen arbitrarily.
f x A f x x x A
First convert the given cumulative frequency distribution into 2 2
a continuous distribution and then choose an appropriate
xi A d i
f x x 2x x x A x A2
2
assumed mean, A. determine xc . f x x 2 2x A f x x f x A2
h h
A = 55 and h =10 f x x 2 f x A2, since f x x 0
Marks Class cf. f fx di xc f xc The square sign in the second term of this equation
value
means that x A2 is always positive and hence the
x
0 - 10 5 5 5 25 - 50 -5 - 25 expression is minimum when A x , i.e. x A2 =0.
10 - 20 15 9 4 60 - 40 -4 - 16
This implies that the sum of the squares of the
20 - 30 25 17 8 200 - 30 -3 - 24
deviations of a set of values is minimum when taken
30 - 40 35 29 12 420 - 20 -2 - 24
40 - 50 45 45 16 720 - 10 -1 - 16
about the mean.
50 - 60 55 60 15 825 0 0 0 If x1 and x2 are the means of two samples of
60 - 70 65 70 10 650 10 1 10 sizes n1 and n2 , then the mean M of the combined
70 - 80 75 78 8 600 20 2 16
80 - 90 85 83 5 425 30 3 15 sample of size n1 n2 is
90- 100 95 85 2 190 40 4 8 n1x1 n2 x2
Total n = f =85, fx = 4115 f xc = - 56 M
n1 n2
(i) Apply the formula to determine the mean In general if x1 , x2,, xk are the means of k
m
fd f x x
which is incorrect.
fx The correct fx 10,000 (92 8) (192 88 ) 10,180
fx x f , but x Hence the corrected mean is
f
fx corrected fx 10,180
fx f 0, x 50.9
f n 200
The sum of the squares of the deviations of a set of
values is minimum when taken about the mean.
Proof Treatment of an open-end grouped sample
11
An open- end class is one in which the lower or upper Consider n values of a variable, x stated as x1, x2, x3,, xn .
boundary, for the first or last class respectively, is not given.
The harmonic mean of these variables is defined
For example the grouped data whose classes are enlisted
mathematically as:
as below 15, 15 - 19, 20 - 24, 25 -29, 30 and above, is open
1
- ended. (Fig. 3.6) H
1 1 1 1 1
Variable Frequency n x1 x2 x3 xn
Below 15 2 For example the harmonic mean of 4, 8 and 16 is
15 - 19 18 1 48
20 - 24 300 H 6.855
1 1 1 1 7
25 - 29 40
3 4 8 16
30 and above 1
The harmonic mean is useful when averaging rates such as
Figure 3.6 Open-ended grouped sample.
the speed (distance / time taken) of a moving object that
A reasonable assumption about the size of the open - end covers different parts of the distance with different speeds,
classes is necessary to facilitate computation. The best machine output per minute, etc.
guide to its probable size is the pattern of the distribution. In
the above example, since the class width is fixed at 4, the Harmonic Mean of Grouped Data
first class can be taken as 10 -14 and the last class as 30 - If the frequencies of variables x1, x2, x3, , xn are
34. The error introduced by making this assumption is f1, f2, f3,, fn respectively, then the harmonic mean H is
minimal.
given by:
Advantages and disadvantages of the arithmetic mean 1
H
1. Advantages 1 f1 f2 f f
3 n
n x1 x2 x3
(a) It is rigidly defined and easy to understand
(b) It is based on all the sample variables
xn
1 1
(c) It is amenable to algebraic treatment
f / x
n 1
(d) It is easy to calculate 1 fi
(e) It is unique and always exists n
n xi
(f) It is least affected by fluctuations of sampling, i.e. i 1
it is a stable average. Example 3.5
2. Disadvantages A car covers the first 200 km of a 400 km journey with an
(a) It is unduly affected by extreme values average speed of 120 km/h and the remaining distance with
(b) It may not be present in actual data a speed of 90 km/ h. Find the time taken for the entire
(c) It cannot be located by inspection journey.
(d) It may lead to wrong conclusions in the absence of
the raw data. SOLUTION
The car travels 200 km at 120 km/h and 200 km at 90 km /
h. Since the distance in both cases is constant, the average
3.3.2 Geometric Mean
speed is the harmonic average of 120 km/ h and 90 km/ h
Consider n values of a variable, x stated as x1, x2, x3,, xn . 1 720
The Geometric mean G is defined as: - Average speed = 102 .86 km/h
1 1 1 7
G x1 x2 x3 xn n
1
2 120 90
For example the geometric mean G of 4, 16 is Total distance 400
Time taken = 3.89 hours
G 4 16 64 8
1
2 average speed 102 .86
The geometric mean is used to determine parameters like
the rate of interest or rate of population growth. It is also Example 3.6
used in the construction of index numbers. Find the geometric and harmonic means of the following
Geometric Mean of a Grouped Data distribution:
If the frequencies of variables x1, x2, x3, , xn are Income
No. of workers
[K£ / week]
f1, f2, f3,, fn respectively, then the geometric mean G is
0 - 10 5
given by:
10 - 19 8
1
f log x
1
fi log xi
n n
i 1
12
SOLUTION
First prepare the following table for computation
Class x f log x f log x f/x
0-9 4.5 5 0.6532 3.2661 1.1111
10 - 19 14.5 8 1.1613 9.2909 0.5517
20 - 29 24.5 3 1.3891 4.1675 0.1224 TOPIC 4.0
30 - 39 34.5 4 1.5378 6.1513 0.1159 MEASURES OF DISPERSION
20 22.8758 1.9011 Measures of dispersion are numerical values that describe
the amount of spread or variability that is found among a
The geometric mean is evaluated as: set of data. Five common measures of dispersion are: -
f log x Range
log G Mean deviation
n
22.8758 Quartile deviation
1.1438 Variance
20
G = 13.93 14 K£ / week Standard deviation
The harmonic mean H for this data is:
1 n Section objectives
H At the end of this section you should be able to:
1
f /x
f /x 1. Describe the measures of dispersion
n 2. Compute the range, the mean (absolute) deviation,
20
10 .52 quartile deviation, variance and the standard deviation
1.9011 of an ungrouped or grouped set of sampled data.
11 K£ / week 3. Describe the characteristics of given data samples using
the computed measures of dispersion.
Tutorial Exercise 3.1 Consider the results of two students A and B in seven
1. For the following distribution evaluate the: - examinations:
(i) Mode
(j) Median A: 79 20 39 100 96 22 32
(k) P45 B: 48 61 56 53 58 57 55
(l) D2
It can be shown by computation that the average score for
(m) P85 + D4
both students is 55 marks. Therefore it may be concluded
(n) Q3
that both students are equally good performers, but this is
(o) Arithmetic mean
not so. The scores of Student A vary more than those of B
(p) Geometric mean, and
implying that Student B is more consistent and hence
(q) Harmonic mean
dependable.
x 5-9 10 - 14 15- 19 20 - 24 25 - 29 30 - 34 In general closely grouped data yields smaller values of
spread while largely spread -out data will have larger
f 4 9 16 12 6 3
values for the various measures of dispersion.
2. A company which manufactures bulbs has four 4.1 Range
machines on which the bulbs can be made. Owing to The range is the difference between the largest and the
differences in age and design these machines run at least values of the variate.
different speeds, as follows:
Range = largest variable - least variable
Machine Number of bulbs per minute
A 2 The bigger the range the larger the dispersion. For example
B 3 the range for Student A is (100 - 20) = 80, while the range
C 5 for B is (61 - 48) = 13. Hence A has much more dispersion
D 6 than B even if they have the same average score.
The range is not a good measure of dispersion because it
depends on extreme values of the variable while ignoring
(a) When all the machines are running what is the total
all others, and also it gives no information about the spread
number of bulbs produced per hour? [Ans. 288 bulbs]
of data within it.
(b) Over a period of three hours, machines B, C and D 4.2 Mean Deviation
were run for the first 2 hours, and machines A, B and D
The mean deviation (or sometimes called the mean
only were run for the last hour. What was the average
absolute deviation) is the arithmetic mean of the absolute
number of articles produced per hour over the 3-hour
deviations of all the variables about a central value.
period? [231.4 231 bulbs per hour]
Consider a sample of variables x1, x2, x3, , xn with an
arithmetic mean x . The mean deviation MD is:
xx
MD
n
where x x is the sum of the absolute deviations
taken about the mean.
13
For grouped frequency distributions the mean deviation is 2
f (x x ) 1
defined as: 2 f x 2 2 xx x 2
n n n
f x x fx f
fx2 2x x 2
f x x
1 1
MD i i
n n n n n
i 1
fx2 2x 2 x 2
1
A Comparison of the results of students A and B by using n
fx2 x 2
the mean deviation about the known mean ( 55%) 1
proceeds as follows: n
2
2 fx
fx x fx
1 2 1
Student A Student B Hence 2 2
>(b)
x n n n
x 55 x x 55
For ungrouped data we have:
79 24 48 7
2 x 2 x 2
1
20 35 61 6 >(c)
n
39 16 56 1
100 45 53 2 The coded mean technique for determining the
96 41 58 3 variance of a grouped distribution.
22 33 57 2 x A di
Let xc i , where A is the assumed mean h is
32 23 55 0 h h
217 21 the class interval and xc is the coded variable. The variance
xx 217 is defined as:
The MD For A = 31 marks, and 1
n 7 2 h2 fxc 2 xc 2
xx 21 n
For B = 3 marks 1 fxc
2 > (d)
n 7 h2 f ( xc )2
Smaller values of the mean deviation indicate more n n
consistency and reliability. As such student B is more 4.5 Standard Deviation
consistent and reliable than student A. The mean deviation
The standard deviation of a sample (denoted by ) is the
is based on all the observations and is a better measure of
positive square root of the variance:
dispersion than the range or the quartile deviation. It is least
when taken from the median. However, taking of absolute n
x x
1 2
values, i.e. ignoring the signs makes it artificial and renders 2 i >(e)
n
it useless for further mathematical analysis. i 1
For grouped frequency distributions:
4.3 Quartile Deviation
2
f (x x )
fx x
As seen earlier quartiles divide the sample distribution into 1 2
or 2
>(f)
four equal parts. The quartile deviation or semi - n n
interquartile range Q is given by: Using the coded mean (also called the step deviation
Q Q3 Q1
1 method):
2 1 fx
2
1
where Q1 = first (or lower) quartile; Q3 = third (or h fxc 2 xc 2 h f ( xc )2 c >(g)
n n n
upper) quartile, and Q3 Q1 = Inter -quartile range.
The Quartile deviation takes into account 50% of the data. Example 4.1
For the frequency distribution given below evaluate the
4.4 Variance standard deviation using
The variance of a sample, denoted by Var (x) or 2 , is the (a) the square sum of the deviations
(b) the coded mean technique
numerical average of the sum of the squares of the
Temperature
deviations from the arithmetic mean ( x ). Expressed o Frequency
( C)
mathematically: 30.0 - 30.2 6
n
x x
1 2 30.3 - 30.5 12
2 i
n 30.6 - 30.8 15
i 1
30.9 - 31.1 20
For grouped frequency distributions:
n
31.2 - 31.4 13
f (x x )
2
f x x
1 2 31.5 - 31.7 9
2 i i >(a)
n n 31.8 - 32.0 5
i 1
The equation (a) can be modified to give an alternative
formula as follows:
14
SOLUTION Example 4.2
(a) First prepare the following table for computation.
fx 2476 .7 Show that s 2 2 d 2, where d x A and A is an
x 30 .96 31 ; d = x x arbitrary number.
n 80
2 2 2
Temp. x f fx d d fd fx
SOLUTION
30.0 - 30.2 30.1 6 180.6 -0.9 0.81 4.86 5436 By definition, the LHS
30.3 - 30.5 30.4
30.6 - 30.8 30.7
12
15
364.8
460.5
-0.6
-0.3
0.36
0.09
4.32
1.35
11090
14137
2 1 2 1
s f x A f x x x A
2
n n
f x x 2x x x A x A
30.9 - 31.1 31.0 20 620.0 0 0 0 19220 1 2 2
n
f x x
31.2 - 31.4 31.3 13 406.9 0.3 0.09 1.17 12736
2 f
f x x 2x A x A
1 2
31.5 - 31.7 31.6 9 284.4 0.6 0.36 3.24 8987
n n n
31.8 - 32.0 31.9 5 159.5 0.9 0.81 4.05 5088
f x x x A , since f x x 0
1 2 2
Totals 80 2476.7 18.99 76,694
n
n
fd
2
2
1 2
18.99
0.487 oC f (x x )
But and x A d ,
n 80 n
i 1
OR Therefore s 2 2 d 2
15
Q3 Q1 2Q2 (b) For a set of 25 measurements the mean and the
Bowley's coefficient = standard deviation were found to be 56 cm and 2
Q3 Q1
cm respectively. Later on it was found that a
mean mod e
Karl Pearson's coefficient = mistake had been made in one of the
measurements which was recorded as 64 cm
3 instead of 61 cm. Find the correct value of the
Coefficient of skewness = mean and the standard deviation. [55.88,1.56 cm]
3
4. Compute the different measures of skewness for the
3 2 following distribution: 3 to 7 (2 counts), 8 to 12 (108
The coefficients 1 and 1 , given by 1 and
23 counts), 13 to 17(80 counts), 18 to 22 (175 counts), 23 -
27 (80 counts), 28 to 32 (32 counts), 33 to 37 (18
1 1 , are also used to measure the coefficient of counts), 38 to 42 (5 counts). [Bowley's 0.22, Pearson's
skewness of a frequency distribution. 0.69, 3 3 =1.157]
NOTE: The coefficient of skewness is a number that equals 5. Calculate the first four moments about the mean and
zero for a symmetric distribution. In general Bowley's hence find 1 and 2 and comment on the results.
coefficient lies between + 1, while Pearson's coefficient lies
[ 1 3 4 0;2 2;1 0,2 2.75 , The curve is
between + 3.
The relative flatness of the top of a frequency distribution is symmetric and slightly platykurtic]
x 0 1 2 3 4 5 6 7 8
called Kurtosis. It is given by the coefficient 2 4 2 and f 1 8 28 56 70 56 28 8 1
2 2 3 gives the deviation from Kurtosis. For normal
curves (i.e. curves which are neither flat nor sharply TOPIC 5.0
peaked), 2 0 . If 2 0 then the curve is platykurtic i.e. RANDOM EXPERIMENTS AND EVENTS
flatter than the normal curve and if 2 0 the curve is Statistics is concerned with the analysis of real
leptokurtic i.e. more sharply peaked than normal. experimental data that is random in nature. To some
extent, the results of some experiments depend on chance.
Home Study Exercise 4 For example in an experiment to measure the diameters of
1. (a) Find the mean and median age from the following steel rods that are randomly selected from a batch, it is
cumulative frequency table: normal to expect differences in the results observed.
Cumulative Cumulative Probability is the study of random experiments.
Age in years frequency Age in years frequency A random experiment is one that is:
performed according to a fully described set of
20 - 25 21 45 - 50 166 rules
25 - 30 40 50 - 55 176 can be repeated arbitrarily often, and
30 - 35 90 55 - 60 186 whose result(s) depends entirely on chance.
35 - 40 130 60 - 65 195 This section introduces the terminology used in discussing
40 - 45 146 65 - 70 199 probability. Particular reference is made to the convenient
[Answers: 38.6, 36.2] graphical representation of experimental results in Venn
(b) Using the same axis graph the histogram and the diagrams.
frequency polygon for this data.
(c) Calculate the mean deviation, quartile deviation, Section objectives:
and the standard deviation. At the end of this section you should be able to:
2. The lengths, in millimetres, of 40 bearings were 1. Describe the basic terminology used with respect
determined with the following results: to probability such as random experiments,
16.6, 15.3, 16.3, 14.2, 16.7, 17.3, 18.2, 15.6, 14.9, outcomes, and sample space.
17.2, 18.7, 16.4, 19.0, 15.8, 18.4, 15.1, 17.0, 18.9, 2. Distinguish between the different kinds of events
18.3, 15.9, 13.6, 18.3, 17.2, 18.0, 15.8, 19.3, 16.8, such as equally likely, mutually exclusive/non-
17.7, 16.8, 17.9, 17.3, 16.6, 15.3, 16.4, 17.3, 16.9, exclusive, exhaustive, favourable and dependent
14.7, 16.2, 17.4, 15.6 and independent events.
(a) Starting with 13.5, group the data into six 3. Represent samples and events using set
classes of equal width and tabulate the notations.
frequency distribution. 4. Use Venn diagrams to represent sample spaces
(b) Calculate the standard deviation using the and their events.
(i) square sum of deviations
(ii) the coded mean technique [1.299 mm] 5.1. BASIC TERMINOLOGY
3. (a) In two samples of sizes n1 and n2 , the means are The following are definitions of some of the terms that are
used in the study of probability.
m1 and m2 ,and the standard deviations are 1 and
2 . If the two samples are pooled together to form 5.1.1 Trial and Outcome (or Event)
A trial is the single performance of a random experiment.
a sample size N n1 n2 , show that the standard The result of each trial is called an outcome or event. For
deviation of the composite sample is: example tossing a coin is a trial that has two possible
2 2 outcomes, either head (H) or tail (T). Rolling a die is a trial
n11 n22
1 2 m1 m2 2
nn
that has six possible outcomes, either 1, 2, 3, 4, 5 or 6.
N N Events can be simple or compound depending on the
nature of the trial.
16
A simple event is a single possible outcome of an Find A, B and C and state the most likely score for the
experiment. For example getting a H or T when one coin is experiment. How many outcomes give a score of at most
tossed. 11?
A compound event combines the outcomes of two or more
trials into one event. For example if two coins are tossed SOLUTION
simultaneously the outcome H on the first and T on the First generate the set of all possible outcomes for the
second is a compound event. All the compound events in experiment rolling two dice simultaneously.
this experiment are HH, HT, TH, and TT. Sample space
1 2 3 4 5 6
1 (1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6)
5.1.2 Sample Space 2 (2, 1) (2, 2) (2, 3) (2, 4) (2, 5) (2, 6)
The set of all the possible outcomes of a trial (i.e. the single 3 (3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6)
performance of a random experiment) is called the sample 4 (4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6)
space. It is denoted by S. For example: 5 (5, 1) (5, 2) (5, 3) (5, 4) (5, 5) (5, 6)
When a coin is tossed S H,T . The cardinal 6 (6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6)
S S S S
B B
i B
A
iii
A A A
B
ii
(b) (c) (d)
iv
Fig. 5.3 The shaded areas are (a) A or A
(b) A B (disjoint events) (c) A B (non -disjoint
Figure 5.2. A Venn diagram
Each possible outcome in the sample space is assigned to events) (d) A B
one of the four regions to consider (marked i to iv in Fig. 5.2.2 Intersection
5.2): The intersection of two sets A and B is the set of all
(i) outcomes that belong to event A but not event B elements that are common to BOTH A AND B, Fig. 5.2(d).
(ii) outcomes that belong to event B but not event A The intersection of A and B is denoted by A B, which is
(iii) outcomes that belong to both event A and event B read as “ A intersection B”. Thus
(iv) outcomes that belong to neither event A nor event A B A ANDB {x | x A and x B}
B
Example 5.2 It follows that if A B then nA B nA In general
A six-sided fair die is thrown. Let event A be the 'number nA B nA and nA B nB . Two events A and B
obtained is divisible by two' and event B 'the number is are mutually exclusive or disjoint events if A B .
divisible by 3'. Draw a Venn diagram to represent these Mutually exclusive events cannot occur together, for
events. example, in tossing a die, the event that a head turns up
SOLUTION and the event that a tail turns up cannot occur at the same
The sample space S for the experiment time, in a single trial. Mutually non-exclusive events, on
S = {1, 2, 3, 4, 5, 6} the other hand, can occur simultaneously. For example in
A ={2, 4, 6} rolling a die the event that a multiple of three and the event
B = {3, 6} that a multiple of two turns up occur together when a 6 is
The Venn diagram is therefore. thrown.
S 5.2.3 Union
A
2 The union of two sets A and B is the set consisting of all
elements that are members of EITHER A OR B or of both A
6
B AND B, Fig. 5.2(c). Thus
4
A B A OR B { x | x A or x B, or x A and x B}
3 5
1 In this case nA B nA nB nA B . Again if A B
then nA B nB . For mutually exclusive or disjoint
events,) nA B nA nB, since nA B n 0 ,
5.2.1 Complement Fig. 5.2 (b).
The complement of a set A (denoted by A or A and read
5.2.4 Set relationships for more than two events
as "not A'') is defined as the set of all elements in the
sample space S that are not in the set A, Fig 5.3(a). In set Consider n events A1, A2, A3,, An in a sample space S,
builder notation: then the union of A1, A2, A3,, An is denoted by
A (or A ) { x | x S and x A} A1 A2 A3 ,,An . Similarly the event consisting of all
where the symbol is used to mean “ is a member of” and the outcomes that belong to every one of the sets is
the symbol means “is not a member of”. A1 A2 A3 ,,An . If for any pair of value i, j with i j
For example in the trial tossing a coin if event A is "a head Ai A j then Ai and A j are said to be mutually
turns up" then A or A is the event "a tail turns up". The exclusive or disjoint.
sum of the cardinal number of a set A and that of its
complement should always equal the cardinal number of
the sample space, i.e.
18
5.3 Sequential Experiments Find R, S and T. How many outcomes include at least
Often the main experiment or trial consists of a sequence of one tail?
sub-experiments, which may or may not influence the 2. (a) A fair coin is tossed twice. List a sample space
outcome of the next one. If the outcome of the first event showing the possible outcomes.
does not affect the outcome of the next event then sub- (b) A biased coin (it favours heads in the ratio 5 to 1) is
experiments and their outcomes are said to be tossed twice. List a sample space showing the
independent. If the outcome of a preceding experiment possible outcomes.
affects the outcome the next one, then the event that [(a) S HH,HT, TH, TT and (b) S HH,HT, TH, TT.
comes second is said to be dependent on the first. Note: The sample space listings are identical because sample
Sequential experiments form the basis of conditional space listings do not indicate relative probabilities.]
probability. 3. Two dice are rolled simultaneously to observe the
NOTE: By definition independent events are different from numbers that appear on the faces that turn up. If the
mutually exclusive events, and therefore the two should not events A, B and Care defined as follows: -
be taken to have the same meanings. A: both dice turn up the same number
B: the sum of the outcomes is at most 9
5.4 Sampling C: both dice turn up an odd number.
In the quality control of mass produced components it Find A, B and C and draw the Venn diagram for the
would be both time-consuming and uneconomical to sample space, S. Show the following set relationships
using Venn diagrams: (i) A B (ii) B A C
subject each component to full inspection. In practice a
sample is randomly selected as being representative of the
whole population (or total output of components). This 4. A coin is tossed and a head or tail is observed. If a
sampling can be done with or without replacement from a head results the coin is tossed a second time. If a tail
st
larger finite batch of such components. results on the 1 toss, a die is rolled. Construct the
Sampling with replacement is a sequential sample space and find how many outcomes have at
experiment in which each unique component that is most one head.
withdrawn from an isolated batch is replaced before
the next random selection is performed. The TOPIC 6.0
sample space remains unchanged throughout and INTRODUCTION TO PROBABILITY
experiment yields independent outcomes
Sampling without replacement is a sequential Probability is a measure of the likelihood that a particular
experiment in which each unique component that is outcome occurs in a random experiment. The ability to
withdrawn from an isolated batch is not replaced predict likely occurrences has applications in industrial
before the next random selection is performed. The quality control and efficient use of resources.
sample size varies and the experiment yields
dependent outcomes. Section objectives
At the end of this section you should be able to: -
5.5 Tree diagram representations of sample spaces 1. Differentiate between empirical and classical probability
Tree diagrams branch -structured pictorial construction that 2. Understand the concept of statistical regularity
is used to represent the sample space of experiments that 3. State and apply the laws of probability.
are performed sequentially. Each 'branch' within the 4. Solve problems that involve conditional probability.
diagram shows a possible outcome. For example, in an
experiment where one coin is tossed twice in succession, 6.1 INTRODUCTION
the sample space has been determined as There are two approaches to determining probabilities: -
S HH,HT, TH, TT. Fig. 5.4 illustrates the tree diagram Empirical (or experimental) probability
representation of this sample space. Classical (or theoretical) probability
21
lies either outside or inside the closed curve of any
particular event.
With two choices (inside or outside) for each of the n closed
n
curves, there are 2 different possible combinations to
characterise R. 6.4 CONDITIONAL PROBABILITY
n Conditional probability is the probability that a particular
The 2 regions will break down into n + 1 types with the
numbers of each type as follows: - event occurs given the occurrence of another, possibly
related, event.
n
C0 1 no events;
Let A be an event in the sample space S (i.e. A S). The
n
C1 n one event but no intersections; probability that an event B occurs given that A has already
occurred is
n
C2 1 n(n 1) two-fold intersections; P( A B)
2 P B / A
P ( A)
n
C3 1 n(n 1)(n 2) three-fold intersections
3! P( A B) P( A) P(B / A) > (a)
where P(B/A) is the conditional probability of B given
n
Cn 1 n - fold intersections. that event A has already occurred.
Similarly
When the probabilities are combined to calculate the
P (B A) P ( A B )
probability of the union of the n events, account must be P A / B
taken of the double, triple, etc. counting. This result can be P (B ) P (B )
shown to be: P( A B) P(B) P( A / B) > (b)
P ( A1 A2 An ) From (a) and (b) it can be seen that
P ( Ai ) P ( Ai A j ) P ( Ai A j Ak )
i i, j i , j ,k P( A B) P( A) P(B / A) P(B) P( A / B) > (c)
1n 1P ( A1 A2 An )
Each summation runs over all possible sets of subscripts, The last equation (c) is called the multiplication rule for
but omitting those in which any two subscripts in a set are arbitrary events in sequential experiments.
the same. In terms of Venn diagrams P(B/A) can be interpreted as the
For example, probability of B in the reduced sample space of A.
If n = 2 Example 6.7
P( A1 A2 ) P( A1) P( A2 ) P( A1 A2 ) Two dice are rolled simultaneously. Find the probability that
one of the dice turns up a 2 given that the total score is 4.
If n = 3
P ( A1 A2 A3 ) P ( A1) P ( A2 ) P ( A3 ) SOLUTION
- P ( A1 A2 ) P ( A1 A3 ) P ( A2 A3 ) Sample space
P( A1 A2 A3 ) 1 2 3 4 5 6
and so on. 1 (1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6)
2 (2, 1) (2, 2) (2, 3) (2, 4) (2, 5) (2, 6)
Example 6.6
3 (3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6)
Find the probability of drawing from a pack a card that has 4 (4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6)
at least one of the following properties: 5 (5, 1) (5, 2) (5, 3) (5, 4) (5, 5) (5, 6)
A: it is an ace 6 (6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6)
B: it is a spade n(S) = 36
C: it is a black honour card (ace, king, Let A: the total score is 4
queen, jack or 10) B: a die turns up a 2
D: it is a black ace A = {(3, 1), (2, 2), (1, 3), P(A) = 3/36 or 1/12
B = {(2, 1), (1, 2), (2, 2), (2, 3), (2, 4), (2, 5), (2, 6), (3,
SOLUTION 2), (4, 2), (5, 2), (6, 2)}
There are four events. And hence 24 =16 regions which AB = {(2, 2)}, P(AB) = 1/36
can be broken down into Therefore, by definition
1
4
C1 4 regions with one event. Measuring all probabilities P( A B)
P B / A
1
36
in units of 1/52, the individual ones are P ( A) 1/ 12 3
P(A) = 4, P(B) = 13, P(C) =10, P(D) =2
4
There are C2 6 two-fold intersection probabilities,
Multiplication rule for Independent events
P( A B) 1 P( A C ) 2 P( A D) 2 Two events A and B are statistically independent if the
P(B C ) 5 P(B D) 1 P(C D) 2 probability of the simultaneous occurrence of both events is
4 P( A B) P( A) P(B / A) P(B) P( A / B) P( A) P(B)
There are C3 4 three-fold intersection probabilities,
It can be seen that for independent events
P( A B C) 1 P( A B D) 1 P(B / A) P(B) and P( A / B) P( A) , the occurrence
P( A C D) 2 P(B C D) 1 of one event does not affect the probability of occurrence of
4
There is C4 1four-fold intersection probability the next event.
Also if A and B are independent events, then
P( A B C D) 1 (in units of 1/52)
Hence P( A B ) P( A) P(B )
P( A B C D) P( A B) P( A ) P(B)
1
[4 13 10 2 1 2 2 5 5 2 P( A B ) P( A ) P(B )
52
(1 1 2 1) 1]
22
=20/52.
A class has ten boys and five girls. Three students are
Example 6.8 selected at random from the class without replacement.
Let A and B be two independent events in S such that P(A) What is the probability that
= 1/2 and P(AB) = 2/3. Find (i) P(B) and (ii) P(B / A) . (i) All three are girls?
(ii) The first two students chosen are boys and the third
is a girl?
SOLUTION
(i) By definition (iii) The first and the third are of the same sex while the
P ( A B ) P ( A) P (B ) P ( A B ) second is of the opposite sex.
P ( A) P (B ) P A P B
P(A) [1 P A]P B SOLUTION
Let the events
P A B P ( A)
2 1 Gi: ith girl is chosen
P B 3 2 1
ith boy is chosen
1 P A
Bi:
1 3
2 n(G) = 5; n(B) = 10; n(S)= 10 + 5 = 15 students
(i) P(All are girls) =
(ii) For independent events A and B
P(G1 G2 G3 ) P(G1)P(G2 / G1)P(G3 / G1 G2 )
P( A B ) P A P (B )
P (B / A) P (B ) 5 4 3 2
P ( A) P ( A)
1 2 15 14 13 91
1 (ii) P(First two are boys and the third is a girl) =
3 3
P(B1 B2 G1) P(B1)P(B2 / B1)P(G1 / B1 B2 )
The multiplication rule for probabilities can straightforwardly 10 9 5 15
be extended to several events. For example consider three
15 14 13 91
events A, B and C.
(iii) P(first and the third are of the same sex while the
P A B C P (C ) P A B second is of the opposite sex) =
PB C P A / B C
PC P B / C P A / B C P {(B1 G1 B2 ) (G1 B1 G2 )
OR = P(B1 G1 B2 ) P(G1 B1 G2 )
= P A P B / A P C / A B 10 5 9 5 10 4 5
=
OR 15 14 13 15 14 13 21
= P B P C / B P A / C B , etc.
The following are important relationships involving Example 6.9 could alternatively have been solved using the
conditional probabilities: tree diagram representation Figure 6.2.
For a set of mutually exclusive events Ai, whose 8/13 B
union is A. For some other event B,
P A / B
B
P( A / B) i
9/14
5/13 G
i B 9/13 B
10/15
This is the addition law for conditional probabilities 5/14 G
If the set of mutually exclusive events AI exhausts 4/13 G
the sample space S then the probability P(B) of an
event B in S is given by the total probability law 10B, 5G
9/13 B
which is stated as:
P (B ) P B / A P A , or more generally
i
i i 5/15 10/14 B
4/13 G
G 10/13 B
as
P B / A P A / C
4/14 G
P (B / C ) i i
3/13 G
i
B = Boy, G = Girl
where C is any other event in S. Figure 6.2 Tree diagram representation of
Example 6.9
In a sampling experiment there are two ways in which
objects may be drawn into the sample, namely: Tutorial Exercise 6.1
(i) Sampling with replacement, and 1. A box contains 10 screws, 3 of which are defective.
(ii) Sampling without replacement. Two screws are drawn at random. Find the probability
that none of the screws is defective if they are drawn
In Sampling with replacement each object drawn is with and without replacement [49%, 47%]
returned to the set before the next object is randomly drawn. 2. A batch of 100 iron rods consists of 25 oversized, 25
Each subsequent selection is independent of the previous undersized and 50 rods of the desired length. If two
one. rods are drawn at random without replacement, what is
In Sampling without replacement each object drawn put the probability of obtaining:
aside from the set before the next object is randomly drawn. (a) Two rods of desired length [24.75%]
This kind of sampling continuously reduces the sample size (b) One of desired length [30.5%]
therefore making sequential experiments dependent on one (c) None of desired length [24.75]
another. (d) Two undersized rods [6.06%]
3. A box of 100 gaskets contains 10 gaskets with type A
Example 6.9 defects, 5 gaskets with type B defects and 2 gaskets
23
with both types of defects. Find the probability that a There are three empty spaces on a bookshelf and there are
gasket with a type B defect is drawn given that it has a seven different books available. Determine the number of
type A defect. [20%] ways in which these books can be arranged in the available
spaces if the arrangement is done (a) without repetition and
Bayes' Theorem (b) with repetition.
From the multiplication rule it has been shown that
P( A B) P( A) P(B / A) P(B) P( A / B) , from which we SOLUTION
Take n = 7 and k = 3
obtain Bayes' theorem
(a) without repetition,
P ( A)
P( A / B) P (B / A ) Number of permutations
P (B ) n Pk
This theorem shows that P(B / A) P( A / B) unless P(A) n! 7!
=P(B). n k ! 7 3!
7 6 5 4!
210
Odds 4!
Probabilities can also be expressed using 'Odds'. If the (b) with repetition
k 3
odds in favour of an event A are a to b (or a: b), then Number of permutations = n = 7 = 343
25
2 and 12 (both inclusive). The specific values that X 7.2.1 Discrete Probability Distribution
takes depend on chance. Let x1, x2, x3, be the value for which X has the distinct
(ii) When two fair coins are tossed, the number of heads
that turn up (X) is an integer equal to 0, 1 or 2. probabilities p1, p2, p3, respectively, such that
(iii) If three screws are randomly selected from a box P( X x1) p1 and so on. The probability function of X is
containing left-hand and right-hand screws, the stated as: -
f x p j
number of left-hand screws drawn (X) is 0, 1, 2 or 3.
when X x j (j 1,2,3...)
In a given experiment, 0 otherwise
the event corresponding to a number "a" is denoted Since P(S) = 1
p
by "X = a" and the corresponding probability that
this event occurs by P(X = a). f ( x j ) 1 or j 1
the probability that X assumes any value in the j 1 j 1
interval a < X < b is denoted by P(a < X < b) Similarly
P(X < c) is the probability that X assumes a value
smaller than c or equal to c.
P (a X b ) f ( x ) a
a x j b
j p
x b
j
26
SOLUTION
limits (often - to ). Examples include height, volume,
S = {1, 2, 3, 4, 5, 6}, n(S) = 6 weight, temperature, etc.
P x X x dx f (x )
1 2 3 4 5 6
F(x) = P(X<x)
6 6 6 6 6 6
For a discrete random variable,
For any continuous random variable with a probability
f (x ) 1
function f(x) to be a pdf, then
j a2
j 1
For the probability distribution f ( x )dx f ( x )dx 1
a1
6
1 1 1 1 1 1 6
f (x j ) 1 where a1 and a2 are the limits that define the domain of the
6 6 6 6 6 6 6 random variable.
j 1
Therefore X is a discrete random variable. Similarly,
x2
0.5
1 2 3 4 5 6 x x
Distribution function a1 x1 x2 a2
f ( x )dx 1
Find k. Find x such that (a) P(X< x) = 10%. (b) P(X< x)
= 95%. [k=0.5, x=0.632, 1.95]
5. Suppose that certain bolts have lengths L=200 + X
For the given function mm, where X is a random variable with density
1
3
2x 3dx 1
1 f ( x ) 1 x 2 when 1 x 1and 0 otherwise.
f ( x )dx 4
4
0 Determine c so that with a probability of 95% a bolt will
1 2 1 40
x 3x 0 1 have any length between 200 - c and 200 + c.
4 4 6. Find the probability that none of 3 bulbs in a signal-light
Therefore X is a continuous random variable. will have to be replaced during the first 1200 hours of
(b)
operation if the lifetime X of a bulb is a random variable
2x 3dx x 3x 0
12
1 1 2 12 7 with the density f ( x ) 6[0.25 x 1.52 ] where 1< x<
P (0 X 1 2)
4 4 16 2 and f(x) =0 otherwise, where x is measured in
0
multiples of 1000 hours. [72%]
OR ALTERNATIVELY
x x
7.4 Mean and Variance of a probability distribution
f u du
2u 3du
1
F(x) It is conventional to characterise the probability function f(x)
4
in terms of the mean and variance.
0
u 3u 0 x 3
1 2 x x
4 4 7.4.1 Mean or Expectation
P (0 X 1 2) F (1 2) F (0) Let X be a random variable with a probability function f(x).
1 1 7 The mean (also called the expectation of X), denoted by
3 0
8 2 16 E[X] or , is defined as: -
Example 7.5
i
xi f ( xi ) f or a discrete distribution
Given the continuous random variable E[ x ]
2
3
x
1 x
xf ( x )dx f or a continuous distribution
f (x)
Example 7.6
0 Otherwise Two fair dice are rolled and the sum of the numbers on the
upturned faces (X) is recorded. Find the expectation of X.
Find F(x).
SOLUTION
SOLUTION By definition for discrete distributions
x x
E[ x ] x f x
u
1 i i
F(x) f (u )du 2 3
du i
1 xi f ( xi ) P( X xi ) x i f ( xi )
x
1 1
2 2 1 2 2 1/36 2/36
2u 1 x 3 2/36 6/36
Therefore, 4 3/36 12/36
5 4/36 20/36
1
1 2 f or1 x 6 5/36 30/36
x 7 6/36 42/36
F(x) 8 5/36 40/36
9 4/36 36/36
0 Otherwise 10 3/36 30/36
11 2/36 22/36
For any fixed a and b (> a), in the case of a continuous 12 1/36 12/36
random variable X the probabilities corresponding to the 257
TOTALS 1.00 7
intervals a x b, a x b, a x b are the same. 36
However they differ in the case of discrete random
variables. Therefore the expectation is approximately equal to 7.
x 2 f ( x )dx f or a continuous distribution
The variance is always positive and its square root is Example 7.8
known as the standard deviation (). A small filling station is supplied with gasoline weekly.
Assume that the volume of sales in thousands of litres has
Properties of the Variance the density function f ( x ) 6x 1 x when 0 < x < 1and 0
From the definition of the variance the following properties otherwise. Find the mean and the variance of X.
can be derived: - If a and b are constants then
V[a] = 0 SOLUTION
By definition
V[aX+ b] = a2V [ X ] E[X] =
1
V [aX bY ] a2V [ X ] b2V [Y ] , where X and Y are
x 1 x dx
xf ( x )dx 6 2
independent random variables. 0
1
6 x x dx 6
V [ X Y ] V [ X Y ] V [ X ] V [Y ] 1 x x 3 4
2 3
A useful result that relates the variance, the mean and the
second moment of X about zero is derived below: -
0 3 4
0
1
V [ X ] E [ X 2 ]
2
E [ X 2 2X μ 2 ] Hence E[X] = 500 litres
E [ X 2 ] 2E [ X ] 2 V [ X ] E [ X ] E [ X ]
2 2
E [ X 2 ] 2
1
E[ X 2 ]
x f ( x )dx 6 x 1 x dx
2 3
where E [ X 2 ] is the second moment of the probability
th 0
distribution. In general the k moment of a distribution is 1
6 x x dx 6
1 x x 4 5
3 4 3
defined by
0 4 5 10
x kj f ( xi ) f or a discrete distribution 0
Hence
E[ X k ]
j
2
x k f ( x )dx f or
a continuous distribution V[X]
3 1
10 2
1
20
(Thousand litres) = 50 litres.
Example 7.7
A biased six-sided die has probabilities p/2, p, p, p, p, 2p of Home Study Exercise 7.2
showing 1, 2, 3, 4, 5, 6 respectively. Find the mean, the 1. Find the mean and the variance of a discrete random
variance and the second moment of this probability variable having the probability distribution f(0) = 1/4,
distribution f(1) = 1/2, f(2) = 1/4. [1, 1/2]
2. Let X be the diameter of bolts in a production. Assume
SOLUTION
that X has the density f ( x ) k( x 0.9)(1.1 x ) if 0.9 < x
P (S ) P (1 2 3 4 5 6)
p 13 p < 1.1. Determine k, graph f(x) and find the mean and
p p p p 2p the variance. [k = 750, =1, =0.002]
2
2 2
But P(S) = 1,
3. Assume that the mileage (in thousands of kilometres)
which car owners get with a certain kind of tires is a
hence 13p/2 = 1 or p = 2/13. d= xi
random variable X having the density
2 Xi2
f x ex if x > 0
Xi pi xi pi d d pi d2 pi Xi2
1 1 1/13 -40 1600 1600 1 1 and zero otherwise. Here > 0 is a parameter.
13 13 169 2197 13
(a) What mileage can the car owner expect to
2 2 4/13 -27 729 1458 4 8
13 13 169 2197 13
get with one of these tires?
3 2 6/13 -14 196 392 9 18 (b) Find the probability that the tire will last at
13 13 169 2197 13 least 30,000 kilometres
4 2
13
8/13 -1
13
1
169
2
2197
16 32
13
4. A random variable X has a pdf f ( x )
1
80
3 x 2 4 in
5 2 10/13 12 144 288 25 50
the interval 0< x< 4 (and zero elsewhere). Find the
13 13 169 2197 13
6 4 24/13 25 625 2500 36 144
mean, the variance and the second moment of X.
13 13 169 2197 13
1 53 480 253
13 169 13 29
5. Show that V [aX bY ] a2V [ X ] b2V [Y ] , where X and (a) there will be 4 defective items
(b) there will be not more than three defective items
Y are independent random variables
(c) all the items will be non-defective
(d) there will be at least one defective item.
TOPIC 8.0
SOLUTION
PROBABILITY DISTRIBUTIONS Let A: Component is defective
X: number of defective components
Common probability distributions that are encountered in P(A) = p = 20% = 0.2. q = 1 - p = 0.8
physical applications can be classified as being either The Binomial distribution is defined as
discrete or continuous. 6
Discrete probability distributions include -the Binomial P(X = x) nCx p x (1 p)n x = (0.2)x (0.8)6 x and
x
distribution, the Hypergeometric distribution and the
the recurrence formula is
Poisson distribution. The most frequently referred to
pnx 16x
Continuous probability distributions is the Normal P ( X x 1) P ( X x ) = P ( X x )
distribution (also called Gaussian distribution). q x 1 4 x 1
x P(X = x) = f(x) F(x)
Section Objectives 6
At the end of this section you should be able to: - 0 (0.2)0 (0.8)60 0.2621 0.2621
1. Describe the Binomial, Hypergeometric and Poisson 0
distributions and use them to calculate probabilities. 6
2. Describe the Normal distribution as applied in the 1 (0.2)1(0.8)5 0.3932
determination of probabilities. 1
OR 0.6553
3. Find the mean and variance of common discrete and
continuous probability distributions 160
4. Justify the use of specific probability distributions in (0.2621) 0.3932
4 0 1
solving problems.
1 6 1
2 0.3932 0.2458 0.9011
8.1 DISCRETE PROBABILITY DISTRIBUTIONS 4 1 1
Discrete random variables have probability distributions that 3 162
are discontinuous, i.e. cannot be represented on a 0.2458 0.0819 0.9830
continuous scale. Examples of important discrete 4 2 1
distributions include the Binomial and Poisson distributions. 4 1 63
(0.08193 ) 0.0154 0.9984
4 3 1
8.1.1 The Binomial Distribution
Let A be an event which occurs in n independent 5 164
performances of an experiment, such that P(A) = p, the (0.0154) 0.0015 0.9999
4 4 1
probability that A occurs in a single trial. The probability that
6 165
A does not occur is denoted by P( A) q 1 p . If X is the (0.0015) 0.0001 1.0000
4 5 1
number of times that A occurs, then the Binomial
distribution is defined as: -
(a) P(X = 4) = f(4) = 0.0154
n
P ( X x ) f ( x ) p x q n x nCx p x (1 p)n x , x = 0, 1, .. (b) P(X < 3) = F(3) = 0.9830
x (c) P(X = 0) = f(0) = 0.2621
(d) P(at least one defective item) = P(X>1)
This distribution is also called the Bernoulli distribution. The = 1 -P(X<1) = 1- P(X = 0)
occurrence of the event A is called a success and its =1 - 0.2621 =0.7379
complement, A denotes a failure.
8.1.2 The Hypergeometric Distribution
For the binomial distribution The Binomial distribution is important in sampling with
The mean E[ X ] np replacement. Consider for example a box that contains N
The variance V [ X ] npq objects of which M are defective. If a screw is drawn at
random, the probability of obtaining a defective screw is
p nx
P ( X x 1) P ( X x ) , which is called M
q x 1 p
N
the Binomial recurrence formula which enables Hence in drawing n screws with replacement, the
successive probabilities P( X x k ), k 0,1,2... to probability that precisely x screws are defective is given by
be calculated once P(X = x) is known. the Binomial distribution i.e.
n n n
NOTE:
x 0
f (x)
x 0
n
Cx p x (1 p)n x [ p (1 p)]n 1 , P ( X x ) f ( x ) p x q n x
x
In the case of sampling without replacement, the probability
as required.
is
Example 8.1 M N M
20% of the components that a machine produces are x n x
defective. If 6 components are selected at random, find P ( X x ) f ( x ) x 0,1,...,n
N
the probability distribution and hence determine the
probability that n
30
This is called the hypergeometric distribution in which If 2% of the electric light bulbs produced by a company are
there are defective, find the probability that in a sample of 60 bulbs
N (a) 3 bulbs,
(a) different ways of picking n objects from N. (b) not more than three bulbs,
n (c) at least 2 bulbs will be defective.
M
(b) different ways of picking x defectives from M SOLUTION
x Here p = 2% =0.02, n = 60. np 1.2 5 and
N M hence we use of the Poisson distribution
(c) different ways of picking n-x non-
nx x e 1.2 x e 1.2
P( X x ) f ( x ) =
defectives from N - M x! x!
Each way in (b) combined with each way in (c) gives the
1.23 e 1.2
total number of mutually exclusive ways of obtaining x (a) P X 3 0.0867
defectives in n drawings without replacement. (a) gives the 3!
total number of equally possible outcomes. (b) P(no more that three defective bulbs) =
P X 3 P ( X 0) P X 1 P X 2 P X 3
For the Hypergeometric distribution e 1.2 1.2e 1.2 1.22 e 1.2
0.0867
M 0! 1! 2!
the Mean E ( x ) np n 0.9662
N
nM N M N n
(c) P(at least two defective bulbs) =
the variance V [ X } 2 P X 2 1 P ( X 2)
N 2 N 1 1 {P ( X 0) P ( X 1)}
Example 8.2 1 {e 1.2 1.2e 1.2 }
Random samples of 2 gaskets are drawn from a box 0.3374
containing 10 gaskets, 3 of which are defective. Find the
probability function of the number of defective items in the Home Study Exercise 8.1
sample (X) and the probability distributions when sampling 1. Four fair coins are tossed simultaneously. Find the
is done with and without replacement. Find also the mean probability function of the random variable X = number
and variance of X. of heads and compute the probabilities of obtaining no
SOLUTION
heads, precisely 1 head, at least 1 head, not more than
N = 10, M = 3, N - M= 7, n = 2. Therefore p =M/N = 0.3 3 heads. [0.0625, 0.25, 0.9375, 0.9375]
For sampling with replacement 2. Let p = 1% be the probability that a certain kind of light
n 2 bulb will fail in a 24 hour test. Find the probability that a
f ( x ) p x 1 p n x 0.3x 0.72 x , x 0, 1 and 2 sign consisting of 10 such bulbs will burn 24 hour with
x x no bulb failures. [90.4%]
f(0) = 0.72=0.49; f(1) = 2 x 0.3 x 0.7 = 0.42; f(2) = 0.32=0.09 3. Show that the Poisson distribution function satisfies
For sampling without replacement
F() =1.
4. Suppose that in the production of 50 -ohm radio
M N M N 3 7 10
f (x) ,x=0, 1 & 2 resistors, nondefective items are those that have a
x n x n x 2 x 2 resistance between 45 and 55 ohms and the probability
f(0) =f(1) = 21/45 0.47; f(2) = 3/45 0.07. of a resistor being defective is 0.2%. The resistors are
E[X] = np =2(0.3) = 0.6 sold in lots of 100, with the guarantee that all resistors
610 310 2 28 are nondefective. What is the probability that a given lot
V{X] = 0.3733
102 10 1 75 will violate this guarantee? (Use the Poisson
Distribution.) [0.1813]
5. A carton contains 20 fuses, five of which are defective.
NOTE: If N, M and N -M are large compared with n then the
Find the probability that if a sample of 3 fuses is chosen
Hypergeometric distribution may be approximated by the
from the carton by random drawing without
binomial distribution with p = M/N. Thus in sampling an replacement, x (x = 0, 1, 2, 3) fuses in the sample will
"infinite population" the binomial distribution is frequently
91 35 5 1
used. be defective. [ , , , ]
928 76 38 114
8.1.3 The Poisson Distribution 6. A distributor sells rubber bands in packages of 100 and
This is the discrete distribution with the probability function guarantees that at most 10% are defective. A consumer
x e controls each package by drawing at least 10 bands
P( X x ) f ( x ) for x =0, 1, …, n without replacement. If the sample contains no
x! defective rubber bands she accepts the package.
The use of the Poisson distribution is justified when Otherwise she rejects it. Find the probability that any
p 0 and n such that np < 10 or preferably given package is accepted although it contains 20
np < 5. defective rubber bands. [9.5%]
np 2 i.e. the mean is equal to the variance. 7. A process of manufacturing screws is checked every
This distribution provides probabilities of numbers of cars hour by inspecting n screws selected at random from
passing at a given point per unit interval of time, numbers of that hour's production. If one or more screws are
defects per unit length of wire, or per unit area of textile, defective the process is halted and carefully examined.
etc. How large should n be if the manufacturer wants the
probability to be about 95% that the process will be
Example 8.3 halted when 10% of the screws being produced are
31
defective? (Assume independence of the quality of any Figure 8.2 Normal probability estimates
item of that of the other items.)
8. If a magnetic tape contains on average, 2 defects per Example 8.4
100 metres, what is the probability that a roll of 300 The mean mass of 200 people is 67 kg and the standard
metres long will contain (a) x defects and (b) no deviation is 7 kg. Assuming the masses to be normally
defects? distributed, determine how many people
8.2 CONTINUOUS PROBABILITY DISTRIBUTIONS (a) have a mass between 60 and 74 kg.
Data representing measured quantities such as length, (b) have a mass of more than 81 kg.
mass, current, temperature, luminous intensity, etc. is (c) have a mass of between 53 and 88 kg.
continuous and their probability distributions approximate to SOLUTION
the Gauss or Normal distribution Given that = 67 kg and = 7kg, n =200 people
Compute
8.2.1 The Normal Probability Distribution 60
74
The normal probability distribution has the density 2 53
2 81
x2 3 46
3 88
1 22 (a) P(mass between 60 and 74) = P( X )
P ( X x ) f ( x ) e
= 2/3 0.6667
where is the standard deviation. A random variable
having this distribution is said to be normally distributed.
f(x)
The normal probability curve is a graphical representation 66.67%
of the normal distribution. This curve exhibits symmetry
about the mean ().
f(x) x
x2
E[X] = np = 200(0.6667) 134 people.
1 22
f (x) e
(b) P(mass of more than 81 kg)
= P( X 81) P( X 2)
=
1
100 95% 0.025
f(x) 2
95%
x Hence
2.5% 2.5%
E[X] = np =200 x 0.025
Figure 8.1 The Normal probability curve = 5 people.
53 67 81 x
From the normal probability curve it can be shown that: - (c) P(mass between 53 and 88 kg)
2
66 % of the data, i.e. two-thirds of the values lie
3 = P( 2 X 3)
95% f(x)
between and . 2 99¾% 1
About 95% of the data lie between 2 = (95 99.75)% 0.9738
2
2 and 2 .
About 99¾% of the data lie between E[X]= 200(0.9738) 195 people
3 and 3 . 53 88 x
Example 8.6
The following considerations should be made when If X is a normal distribution of mean and variance ,
2
using this table:
calculate the probabilities that X lies within 1, 2 and 3 of
(z)=P(0 < Z <z) the mean.
P (Z a) 0.5 (a) if a 0
0.5 (a) if a 0
SOLUTION
P( X ) P(1 Z 1) , since Z =(X - )/.
P Z a 0.5 (a) if a 0
0.5 (a) if a 0 Using Appendix B
P( 1 Z 1) P( 1 Z 1) (1) ( 1)
P (a Z b ) (b) (a) if a 0 and b 0 2(1) 20.3413 68.3%
(b) (a) if a 0 and b 0
Similarly for 2
Again Z =(X - )/. P( 2 X 2) P(2 Z 2)
= 2(2) 2(0.4772) 95.4%
Example 8.5
For3
A certain machine produces components that have a mean
P( 3 X 3) P(3 Z 3)
length of 15 cm and a standard deviation of 0.2 cm. In a
batch of 1000 components, determine the number of = 2(3) 2(0.4987) 99.7%
components that are likely to:
(a) have a length of less than 14.95 cm.
(b) have a length of between 14.95 and 15.15 Home Study Exercises 8.2
(c) be larger than 15.43 cm. 1. Let X be normal with a mean 0 and variance 1.
Determine the constant c such that P( X c ) 45%
33
[1.645] percentage of packets with between 43 and 45 washers
2. If the thickness of iron plates X (in mm) is normally (both inclusive). [14 marks]
distributed with a mean of 10mm and a variance of
(b) On a combined axes graph draw the bar chart, the
4 104 mm. Find the percentage of defective plates
histogram and the frequency polygon for the frequency
expected, assuming that the defective plates (a) are
distribution in 1(a) above. Use the graph paper
thinner than 9.97mm, (b) thicker than 10.05 mm (c) provided. [6 marks]
deviate more than 0.03mm from 10mm.
[6.7%, 0.6%, 13%]
2. The diameters of 34 samples of steel rods were
measured and the following results obtained (in
TOPIC 9.0 millimeters):
SAMPLE EXAMINATION PAPER 19.63 19.82 19.96 19.75 19.86
19.89 20.16 19.56 20.05 19.72
19.73 19.93 20.03 19.86 19.81
19.66 19.77 19.99 20.00 20.11
19.82 19.61 19.97 20.07 20.01
19.96 19.68 19.87 19.90 19.84
19.77 19.78 19.75 19.87
THIS PAPER CONTAINS SEVEN (7) QUESTIONS. 4. (a) Three screws are drawn at random without
ANSWER ANY FIVE (5) QUESTIONS. replacement from a box containing 4 left-hand and 6 right-
ALL QUESTIONS CARRY EQUAL MARKS. hand screws. If X is a random variable that denotes the
total number of left-hand screws drawn, construct the
1. The contents of thirty packets of washers, randomly probability distribution and show that x is a discrete random
sampled from a batch destined for the market, were variable. [10 marks]
determined as follows: (b) The volume of sales of a liquid detergent (in
thousands of litres) has the probability density
43 44 41 41 44 43 f (x) 6x(1 x) if 0 x 1
42 45 41 44 46 41 0 otherwise
44 42 45 43 43 44 Show that f(x) is a probability density function and the
43 45 45 42 44 44 determine the mean and variance of the distribution.
45 46 44 46 43 44 [10 marks]
34
(b) A chemical firm produces aspirin tablets having a
mean mass of 4g and a standard deviation of 0.2g.
Assuming that the masses are normally distributed and that
each tablet is chosen at random, what is the probability that
a tablet
(i) has a mass between 3.55g and 3.85g?
(ii) differs from the mean by less than 0.35g?
No. of 0 1 2 3 4 5 6
particles
No. of cards 45 65 52 24 11 3 0
35