Statistics
Statistics
Qualitative:
• He is brown and black
• He has long hair
• He has lots of energy
Quantitative:
• Discrete:
o He has 4 legs
o He has 2 eyes
• Continuous:
Fig. 2 A dog is jumping.
o He weighs 25.5 kg
o He is 565 mm tall
Collecting: Data can be collected in many ways.
1. Direct observation (It is the simplest way).
2. Survey questions and etc. (Your TER form)
You might want to count many 10-minute intervals Fig. 3 Cars passing through a certain points of road.
at different times during the day, and on different
days too!
Population: Population means an aggregate of elements possessing certain characteristics of interest in any
particular investigation or enquiry.
Sample: Sample is a representative part of the population.
Variable: The measurements of elements of a population having certain characteristics may vary from elements
to elements either in magnitude or in quantity. These measurable characteristics are called variable. If variable
can take only one value, it is called constant.
Raw data: Raw data are collected data that have not been organized numerically. An example is the set of heights
of 100 male students obtained from an alphabetical listing of university records.
Array: An array is an arrangement of raw numerical data in ascending or descending order of magnitude. The
difference between the largest and smallest numbers is called the range of the data. For example, if the largest
height of 100 male students is 74 inches (in) and the smallest height is 60 in, the range is 74 60 ¼ 14 in.
Frequency distribution: When summarizing large masses of raw data, it is often useful to distribute the data into
classes, or categories, and to determine the number of individuals belonging to each class, called the class
frequency. A tabular arrangement of data by classes together with the corresponding class frequencies is called a
frequency distribution, or frequency table. Table 1 is a frequency distribution of heights (recorded to the nearest
inch) of 100 students at Northern University Bangladesh.
Problem 2 Construct a frequency distribution table for the following raw data (height of 20 students): 50, 54, 56,
60, 55, 45, 48, 51, 53, 57, 49, 50, 52, 58, 57, 59, 57, 58, 55, and 54.
Graph: A graph is a pictorial presentation of the relationship between variables. Many types of graphs are
employed in statistics, depending on the nature of the data involved and the purpose for which the graph is
intended. Among these are bar graphs, pie graphs, pictographs, etc.
35
30
25
20
15
10
5
0
60 65 70 75
Height (inche)
Bar Diagram: The frequency distribution of 100 students is given in Table 1. We have to plot the Bar diagram.
In this respect, firstly it is required to find the middle class. Then, we have to plot frequencies relative to the
respective class.
Height (in) Middle class (𝑥𝑖 ) Number of students (𝑓𝑖 )
60-62 61 5
63-65 64 18
66-68 67 42
69-71 70 27
72-74 73 8
Total 100
40
30
20
10
0
61 64 67 70 73
Height (inche)
Frequency polygon: The frequency distribution of 100 students is given in Table 1. We have to draw frequency
polygon. In this respect, firstly it is required to find the middle class. Then, we have to plot frequencies relative
to the respective class.
Number of students
35
30
25
20
15
10
5
0
60 62 64 66 68 70 72 74
Height (inche)
Average or measures of central tendency: An average is a value that is typical, or representative, of a set of
data. Since such typical values tend to lie centrally within a set of data arranged according to magnitude, averages
are also called measures of central tendency.
Several types of averages can be defined, the most common being the arithmetic mean, the median, the
mode, the geometric mean, and the harmonic mean. Each has advantages and disadvantages, depending on the
data and the intended purpose.
Arithmetic mean: The arithmetic mean of a set of 𝑁 numbers 𝑥1 , 𝑥2, 𝑥3 , . . . 𝑥𝑁 is denoted by 𝑋 and defined by,
1
For group data/frequency distribution, arithmetic mean can defined as 𝐴𝑀 = 𝑁 ∑𝑁
𝑖=1 𝑓𝑖 𝑥𝑖 .
a. The algebraic sum of the deviations of a set of numbers from their arithmetic mean is zero.
Geometric mean: The geometric mean of a set of 𝑁 positive numbers 𝑥1, 𝑥2 , 𝑥3, . . . 𝑥𝑁 is denoted by 𝐺 and
defined by, 𝐺 = 𝑁√𝑥1 𝑥2 𝑥3. . . 𝑥𝑁.
1
For group data, 𝐺 = 10 𝑁 ∑ 𝑓𝑖 𝑙𝑜𝑔(𝑥𝑖 ) .
Harmonic mean: The harmonic mean of a set of 𝑁 nonzero numbers 𝑥1, 𝑥2 , 𝑥3, . . . 𝑥𝑁 is denoted by 𝐻 and defined
1 1
by, 𝐻 = 1 1 1 1 1 = 1 1 .
( + + +⋯+𝑥 ) ∑
𝑁 𝑥1 𝑥2 𝑥3 𝑁 𝑥𝑖
𝑁
1
For group data, 𝐻 = 1 1 .
∑ 𝑓𝑖
𝑁 𝑥𝑖
Quadratic mean (Root mean square): The root mean square (RMS), or quadratic mean, of a set of numbers
2
𝑥1 , 𝑥2, 𝑥3 , . . . 𝑥𝑁 is sometimes denoted by √𝑋 and is defined by,
2 1
𝑅𝑀𝑆 = √𝑋 = √ ∑ 𝑥𝑖2 .
𝑁
2 1
𝑅𝑀𝑆 = √𝑋 = √𝑁 ∑ 𝑓𝑖 𝑥𝑖2 . (For group data)
Problem: Find the arithmetic mean, geometric mean and harmonic mean of data 8, 5, 6, 3, 10, 12, 13, 9, 7, 6, 5,
3, 8.
8+5+6+3+10+12+13+9+7+6+5+3+8
Solution: 𝐴𝑀 = = 7.308
13
13
𝐺𝑀 = √8 × 5 × 6 × 3 × 10 × 12 × 13 × 9 × 7 × 6 × 5 × 3 × 8 =6.6623 (do yourself)
1
𝐻𝑀 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 =6.0068 (do yourself)
( + + + + + + + + + + + + )
13 8 5 6 3 10 12 13 9 7 6 5 3 8
Problem: Daily wages range of 35 labors in a certain factory are given bellow. Find the arithmetic mean by both
direct and short-cut method.
Solution:
1 632
𝐴𝑀 = 𝑁 ∑ 𝑓𝑖 𝑥𝑖 = = 18.06.
35
1 1 2
𝐴𝑀 = 𝐴 + (𝑁 ∑ 𝑓𝑖 𝑢𝑖 ) × ℎ = 18 + 35 × 2 = 18 + 35 = 18.06.
Problem: Daily wages range of 35 labors in a certain factory are given bellow. Find the Geometric mean and
Harmonic mean.
Problem: Find the relation between arithmetic mean, geometric mean and harmonic mean.
Problem: Daily wages range of 35 labors in a certain factory are given bellow. Find the quadratic mean or root
mean square (RMS)
Class 11-13 13-15 15-17 17-19 19-21 21-23 23-25
Number of labors 3 4 5 10 6 4 3
Median: The median of a set of numbers arranged in order of magnitude (i.e., in an array) is either the middle
value or the arithmetic mean of the two middle values.
Examples:
𝑁+1
If total number of data (𝑁) is odd then, ( ) 𝑡ℎ data will be median.
2
𝑁 𝑁
If total number of data (𝑁) is even then, the average of ( 2 ) 𝑡ℎ and ( 2 + 1) 𝑡ℎ data will be median.
𝑁
−𝐹
𝑚𝑒𝑑𝑖𝑎𝑛 = 𝐿 + 2 𝑓 × ℎ,
where, 𝐿 is the lower limit of median class, 𝐹 cumulative frequency of pre median class, 𝑓 frequency
of median class, ℎ length of median class, 𝑁 total number of observations.
Mode: The mode of a set of numbers is that value which occurs with the greatest frequency; that is, it is the most
common value. The mode may not exist, and even if it does exist it may not be unique. A distribution having only
one mode is called unimodal.
Examples:
𝛥1
mode = 𝐿 + (𝛥 ) ℎ,
1 +𝛥2
where, 𝐿 is the lower limit of modal class, 𝛥1 difference in frequencies of modal and pre-modal class, 𝛥2
difference in frequencies of modal and post modal class, ℎ length of modal class.
Problem: Daily wages range of 35 labors in a certain factory is given bellow. Find the median and mode from
the given frequency distribution.
Solution:
𝑁
−𝐹
It is known to us, 𝑚𝑒𝑑𝑖𝑎𝑛 = 𝐿 + 2
× ℎ. (1)
𝑓
35
Now, we have to identify the median class. Here the half of the total frequency is = 17.5 which lies in
2
in range of 22 (cumulative frequency) and 22 lies in the class 17-19. Hence the class 17-19 is the median
class. Thus 𝐿 = 17, 𝑁 = 35, 𝐹 = 12, 𝑓 = 10 and ℎ = 19 − 17 = 2.
𝑁 35
−𝐹 − 12 17.5 − 12
𝑚𝑒𝑑𝑖𝑎𝑛 = 𝐿 + 2 × ℎ = 17 + 2 × 2 = 17 + × 2 = 17 + 0.55 × 2 = 18.1
𝑓 10 10
𝛥1
Again, mode = 𝐿 + (𝛥 ) ℎ. (2)
1 +𝛥2
𝛥1 5
mode = 𝐿 + (𝛥 ) ℎ = 17 + (5+4) × 2 = (do yourself)
1 +𝛥2
Quartile, deciles and percentiles: If a set of data is arranged in order of magnitude, the middle value (or
arithmetic mean of the two middle values) that divides the set into two equal parts is the median. By extending
this idea, we can think of those values which divide the set into four equal parts. These values, denoted by Q1,
Q2, and Q3, are called the first, second, and third quartiles, respectively, the value Q2 being equal to the median.
Similarly, the values that divide the data into 10 equal parts are called deciles and are denoted by D1, D2,
. . . , D9, while the values dividing the data into 100 equal parts are called percentiles and are denoted by P1, P2, .
. . , P99.
𝑁
𝑖−𝐹𝑖
𝑄𝑖 = 𝐿𝑖 + 4
× ℎ; 𝑖 = 1,2,3.
𝑓𝑖
Problem 03: Frequency distribution of the weekly wages of 65 employees at the P&R Company is given bellow
𝑁
𝑖−𝐹𝑖
𝑄𝑖 = 𝐿𝑖 + 4
× ℎ; 𝑖 = 1,2,3.
𝑓𝑖
𝑁 65
For first quartile (i=1) 4 𝑖 = × 1=16.25 which lies in class 260-270. Thus, 𝐿1 = 260, 𝐹1 = 8, 𝑓1 = 10, and ℎ =
4
10. Therefore,
𝑁
×1−𝐹1 16.25−8
𝑄1 = 𝐿1 + 4
× ℎ = 260 + × 10 = 260 + 8.25 = 268.25.
𝑓1 10
𝑁 65
For second quartile (i=2) 4 𝑖 = × 2=32.50 which lies in class 270-280. Thus, 𝐿2 = 270, 𝐹2 = 18, 𝑓2 = 16, and
4
ℎ = 10. Therefore,
𝑁
×2−𝐹2 32.50−18 14.50×10
𝑄2 = 𝐿2 + 4
× ℎ = 270 + × 10 = 270 + =?
𝑓2 16 16
𝑁 65
For third quartile (i=3) 𝑖= × 3=48.75 which lies in class 290-300. Thus, 𝐿3 = 290, 𝐹3 = 48, 𝑓3 = 10, and
4 4
ℎ = 10. Therefore,
𝑁
×3−𝐹3 48.75−48
𝑄3 = 𝐿3 + 4
× ℎ = 290 + × 10 =?
𝑓3 10
1. Write down the merits and demerits of arithmetic mean, geometric mean and harmonic mean.
(Assignment)
2. Write down the merits and demerits of median and mode. (Assignment)
3. For any two positive data show that (𝐺𝑀)2 = 𝐴𝑀 × 𝐻𝑀.
4. Prove that the quadratic mean (QM) of any two positive unequal numbers a and b is greater than their
geometric mean.
5. For n positive observations, show that arithmetic mean≥geometric mean≥harmonic mean.
6. Form a frequency distribution table from the following data and hence draw a different frequency
diagrams (scattered dot plot, histogram, polygon).
3, 4, 4, 5, 5, 5, 5, 5, 5, 6, 6, 6, 6, 7, 7, 7, 7, 7, 8, 8, 9, 10, 10, 10, 10, 10, 10, 12, 13, 15, 15, 16, 17, 17, 18, 19, 21, 23,
23, 25, 26, 27, 28, 29, 31, 33, 35, 34, 34, 35, 36, 37, 37, 38, 39, 42, 45.
7. Find the arithmetic mean, median and mode for the following frequency distribution:
Class 6-10 11-15 16-20 21-25 26-30 31-35 36-40 41-45
frequency 3 4 6 7 12 9 6 3
8. Find the arithmetic mean (in both direct and shortcut method), geometric mean, harmonic mean, median
and mode for the following frequency distribution.
Class 0-5 5-10 10-15 15-20 20-25 25-30 30-35 35-40
frequency 2 5 7 13 21 16 8 3
9. Suppose you have some data including positive, negative and zero. Which mean do you like to choose for
measuring data location and why?
Dispersion, or variation: The degree to which numerical data tend to spread about an average value is called
the dispersion, or variation, of the data. Various measures of this dispersion (or variation) are available, the
most common being the range, mean deviation, semi-interquartile range, 10–90 percentile range, and standard
deviation.
Range: The range of a set of numbers is the difference between the largest and smallest numbers in the set.
The mean deviation: The mean deviation, or average deviation, of a set of N numbers 𝑥1 , 𝑥2, 𝑥3 , . . . 𝑥𝑁 is
abbreviated by MD and is defined by
1
𝑀𝐷 = 𝑁 ∑ |𝑥𝑖 − 𝑥|.
1
𝑀𝐷 = 𝑁 ∑ 𝑓𝑖 |𝑥𝑖 − 𝑥|.
The semi-interquartile range: The semi-interquartile range, or quartile deviation, of a set of data is denoted by
Q and is defined by
𝑄3 −𝑄1
𝑄= ,
2
where Q1 and Q3 are the first and third quartiles for the data. The interquartile range (𝑄3 − 𝑄1 ) is sometimes
used, but the semi-interquartile range is more common as a measure of dispersion.
The 10–90 percentile range: The 10–90 percentile range of a set of data is defined by
1
where P10 and P90 are the 10th and 90th percentiles for the data. The semi-10–90 percentile range, 2 (𝑃90 − 𝑃10),
Problem 02: Find the semi-interquartile range, or quartile deviation, and 10–90 percentile range from the
following frequency distribution.
𝑁
𝑖−𝐹𝑖
𝑄𝑖 = 𝐿𝑖 + 4
× ℎ; 𝑖 = 1, 2, 3.
𝑓𝑖
𝑁 65
For first quartile (i=1) 4 𝑖 = × 1=16.25 which lies in class 260-270. Thus, 𝐿1 = 260, 𝐹1 = 8, 𝑓1 = 10, and ℎ =
4
10. Therefore,
𝑁
×1−𝐹1 16.25−8
𝑄1 = 𝐿1 + 4
× ℎ = 260 + × 10 = 260 + 8.25 = 268.25.
𝑓1 10
𝑁 65
For third quartile (i=3) 𝑖= × 3=48.75 which lies in class 290-300. Thus, 𝐿1 = 290, 𝐹1 = 48, 𝑓1 = 10, and
4 4
ℎ = 10. Therefore,
𝑁
×3−𝐹3 48.75−48
𝑄3 = 𝐿3 + 4
× ℎ = 290 + × 10 = 290.75
𝑓3 10
1 1 1
Now, the semi-interquartile range 𝑄 = 2 (𝑄3 − 𝑄1 ) = 2 (290.75 − 268.25) = 2 . 22.50 = 11.25
1
The semi 10-90 percentile range= 2 (𝑃90 − 𝑃10 ) =?
The standard deviation: The standard deviation of a set of N numbers 𝑥1, 𝑥2 , 𝑥3, . . . 𝑥𝑁 is denoted by 𝜎 and is
defined by
1
𝜎 = √𝑁 ∑(𝑥𝑖 − 𝑥)2.
Thus 𝜎 is the root mean square (RMS) of the deviations from the mean, or, as it is sometimes called, the root-
mean-square deviation.
1
𝜎 = √𝑁 ∑ 𝑓𝑖 (𝑥𝑖 − 𝑥)2.
1 1 2
𝜎 = √𝑁 ∑ 𝑓𝑖 𝑥𝑖2 − (𝑁 ∑ 𝑓𝑖 𝑥𝑖 ) .
If sample data are given then standard deviation is dented by 𝑠 and defined by
1
𝑠=√ ∑(𝑥𝑖 − 𝑥)2
𝑁−1
Problem: Find the standard deviation (SD) and variance from the following frequency distribution.
Class 11-13 13-15 15-17 17-19 19-21 21-23 23-25
Number of labors 3 4 5 10 6 4 3
Solution:
Class 𝑥𝑖 𝑓𝑖 𝑓𝑖 𝑥𝑖 𝑥𝑖 2 𝑓𝑖 𝑥𝑖 2
11-13 12 3 36 144 432
13-15 14 4 56 196 784
15-17 16 5 80 256 1280
17-19 18 10 180 324 3240
19-21 20 6 120 400 2400
21-23 22 4 88 484 1936
23-25 24 3 72 576 1728
Total 35 632 11800
We know,
1 1 1 2 1 2
Standard deviation, 𝑆𝐷 = √𝑁 ∑ 𝑓𝑖 𝑥𝑖2 − (𝑁 ∑ 𝑓𝑖 𝑥𝑖 ) =√35 × 11800 − (35 × 632)
1 1 2
Variance, 𝜎 2 = 𝑁 ∑ 𝑓𝑖 𝑥𝑖2 − (𝑁 ∑ 𝑓𝑖 𝑥𝑖 ) =11.083 (approximately)
The variance: The variance of a set of data is defined as the square of the standard deviation and is thus given
by 𝜎 2 . When it is necessary to distinguish the standard deviation of a population from the standard deviation of a
sample drawn from this population, we often use the symbol 𝑠 for the latter and 𝜎 (lowercase Greek sigma) for
the former. Thus 𝑠 2 and 𝜎 2 would represent the sample variance and population variance, respectively.
Coefficient of variation (CV): If the absolute dispersion is the standard deviation 𝑠 and if the average is the
mean 𝑥̄ , then the relative dispersion is called the coefficient of variation, or coefficient of dispersion; it is
denoted by V and is given by
SD
Coefficient of variation, C𝑉(%) = × 100%,
x̅
and is generally expressed as a percentage. Note that the coefficient of variation is independent of the units
used. For this reason, it is useful in comparing distributions where the units may be different. A disadvantage of
the coefficient of variation is that it fails to be useful when 𝑥̄ is close to zero.
Problem Show that for two observations, standard deviation is the half of the range.
Solution: Let a and b be two data.
𝑎+𝑏
The mean of the data 𝑥̅ = .
2
1
We know standard deviation, 𝑆𝐷 = √(𝑁 ∑(𝑥𝑖 − 𝑥̅ )2 ).
1
or, 𝑆𝐷 = √(𝑁 ∑(𝑥𝑖 − 𝑥̅ )2 )
1 𝑎+𝑏 2 𝑎+𝑏 2
or, 𝑆𝐷 = √( {(𝑎 − ) + (𝑏 − ) })
2 2 2
1 2𝑎−𝑎−𝑏 2 2𝑏−𝑎−𝑏 2
or, 𝑆𝐷 = √(2 {( ) +( ) })
2 2
1 𝑎−𝑏 2 −𝑎+𝑏 2
or, 𝑆𝐷 = √(2 {( ) +( ) })
2 2
1 𝑎−𝑏 2 𝑎−𝑏 2
or, 𝑆𝐷 = √(2 {( ) +( ) })
2 2
1 𝑎−𝑏 2
or, 𝑆𝐷 = √(2 . 2. ( ) )
2
𝑎−𝑏 2
or, 𝑆𝐷 = √(( ) )
2
𝑎−𝑏 2
or, 𝑆𝐷 = √(| | )
2
𝑎−𝑏
or, 𝑆𝐷 = | |
2
1
or, 𝑆𝐷 = 2 |𝑎 − 𝑏|
1
or, 𝑆𝐷 = 2 × 𝑟𝑎𝑛𝑔𝑒. (proved)
Lecture 07
(𝑛2 −1)
Problem Show that the standard deviation of first n natural numbers is √ .
12
𝑁1 𝑋1 +𝑁2 𝑋2
The mean of the combined group: 𝑋12 = 𝑁1 +𝑁2
Problem A group of 40 students was selected and measured their heights which give mean 4.5 ft and standard
deviation 2.1 ft. Another group of 50 students was also selected and measured their mean 4.3 ft and standard
deviation is 1.9 ft. Find the mean and standard deviation of the combined group.
Problem: Calculate combined mean and standard deviation (SD) from the following table of two groups
Problem A student while calculating mean and standard deviation of 25 observations obtained mean=56 and
standard deviation=2. At the time of checking, it was found that he has copied 64 instead of 46. What would be
the actual value of the mean and standard deviation?
Problem: The mean and standard deviation of 20 observation are found to be 10 and 2 respectively. On
rechecking it was found that an observation 8 was incorrect. Calculate the correct mean and standard deviation
in each of the following cases:
(i) If wrong item is omitted
(ii) If it is replaced by 12
Problems for practice:
1. Find the mean deviation, standard deviation, quartile deviation and percentile deviation from the data 3,
4, 4, 5, 5, 5, 5, 5, 5, 6, 6, 6, 6, 7, 7, 7, 7, 7, 8, 8, 9, 10, 10, 10, 10, 10, 10, 12.
2. Find the mean deviation, standard deviation, quartile deviation and percentile deviation from the
frequency distribution
Class 6-10 11-15 16-20 21-25 26-30 31-35 36-40 41-45
frequency 3 4 6 7 12 9 6 3
4. Find the mean deviation, standard deviation and variance of the following table.
Class 6-10 11-15 16-20 21-25 26-30 31-35 36-40 41-45
frequency 3 4 6 7 12 9 6 3
5. Show that for two observations, standard deviation is the half of the range.
(𝑛2 −1)
6. Show that the standard deviation of first n natural numbers is √ .
12
7. A group of 60 students was selected and measured their heights which give mean 5.5 ft. and standard
deviation 2.1 ft. Another group of 40 students was also selected and measured their mean 5.3 ft. and
standard deviation is 1.9. Find the mean and standard deviation of the combined group.
8. A student while calculating mean and standard deviation of 50 observations obtained mean=56 and
standard deviation=2.50. At the time of checking, it was found that he has copied 64 instead of 46. What
would be the actual value of the mean and standard deviation?
Curve fitting: The general problem of finding equations of approximating curves that fit given sets of data is
called curve fitting.
Regression: Regression is the relationship between dependent variable and independent variable.
Regression curve: On the basis of sample data, we wish to estimate the value of a variable Y corresponding to a
given value of a variable X. This can be accomplished by estimating the value of Y from a least-squares curve
that fits the sample data. The resulting curve is called a regression curve of Y on X, since Y is estimated from X.
∑𝑌 = ∑𝑎 + 𝑏∑𝑋
∑ 𝑋𝑌 = 𝑎 ∑ 𝑋 + 𝑏 ∑ 𝑋 2
Solution:
We know, the equation of straight line is 𝑦 = 𝑎 + 𝑏𝑥 (1)
Again, we know for straight line fitting (using least square method)
∑ 𝑌 = ∑ 𝑎 + 𝑏 ∑ 𝑋; ∑ 𝑋𝑌 = 𝑎 ∑ 𝑋 + 𝑏 ∑ 𝑋 2
Now,
X Y 𝑋2 𝑋𝑌
0 2 0 0
1 5 1 5
2 7 4 14
3 10 9 30
4 12 16 48
5 18 25 90
6 25 36 150
∑ 𝑋=21 ∑ 𝑌=79 ∑ 𝑋 2 =91 ∑ 𝑋Y=337
Thus,
79 = 7𝑎 + 21𝑏 (2)
337 = 21𝑎 + 91𝑏 (3)
Solving Eqs. (2) & (3), we have
𝑎 = 0.571 and 𝑏 = 3.571
Therefore, equation of the straight line is 𝑦 = 0.571 + 3.571 𝑥
2. Wheat is grown on 9 equal-sized plots. The amount of fertilizer put on each plot is given in the
following table along with the yield of wheat. Fit the second-degree polynomial curve or fit a curve in
form 𝑦 = 𝑎 + 𝑏𝑥 + 𝑐𝑥 2 and also fit a linear regression line.
wheat X: 2.3 3.4 4.4 5.1 5.5 5.2 4.9 4.4 3.9
fertilizer Y: 1.2 2.3 3.3 4.1 4.8 5.0 5.5 6.1 6.9
∑ 𝑌 = ∑ 𝑎 + 𝑏 ∑ 𝑋 + 𝑐 ∑ 𝑋2;
∑ 𝑋𝑌 = 𝑎 ∑ 𝑋 + 𝑏 ∑ 𝑋 2 + 𝑐 ∑ 𝑋 3
and , ∑ 𝑋 2 𝑌 = 𝑎 ∑ 𝑋 2 + 𝑏 ∑ 𝑋 3 + 𝑐 ∑ 𝑋 4
Solution:
𝑋 𝑌 𝑋𝑌 𝑋2 𝑋2 𝑌 𝑋3 𝑋4
2.3 1.2 2.76 5.29 6.348 12.167 27.984
3.4 2.3 7.82 11.56 26.588 39.304 133.6336
4.4 3.3 14.52 19.36 63.888 85.184 374.8096
5.1 4.1 20.91 26.01 106.641 132.651 676.5201
5.5 4.8 26.40 30.25 126.720 166.375 914.0625
5.2 5.0 26.00 27.04 130.000 140.608 731.1616
4.9 5.5 26.95 24.01 148.225 117.649 576.4801
4.4 6.1 26.84 19.36 163.724 85.184 376.8096
3.9 6.9 26.91 15.21 185.679 59.319 231.3441
∑ 𝑋 =39.10 ∑ 𝑌 =39.2 ∑ 𝑋𝑌 =179.11 ∑ 𝑋 2 = 177.99 ∑ 𝑋2 𝑌 = ∑ 𝑋3 = ∑ 𝑋4 =
957.813 838.441 4042.8052
Moments:
Skewness: Skewness is the degree of asymmetry, or departure from symmetry of the distribution.
Kurtosis:
Correlation
Hypothesis Test
Probability: Probability of an event is the measurement of the chance that the event will occur as a result of an
experiment.
Sample Space: A set S that consists of all possible outcomes of a random experiment is called a sample space,
and each outcome is called a sample point.
Event: An event is a subset A of the sample space S, i.e., it is a set of possible outcomes. If the outcome of an
experiment is an element of A, we say that the event A has occurred. An event consisting of a single point of S is
often called a simple or elementary event.
Example:
Referring to the experiment of tossing a coin twice, let A be the event “at least one head occurs” and B the event
“the second toss results in a tail.” Then A = {HT, TH, HH}, B = {HT, TT}, and so we have
𝐴 ∩ 𝐵 = {𝐻𝑇}
𝐴 − 𝐵 = {𝐻𝐻, 𝑇𝐻}
𝐴′ = 𝑆 − 𝐴 = {𝑇𝑇}
Mutually Exclusive events: Two events are said to be mutually exclusive event if the occurrence of one of them
excludes the occurrence of the other.
Axioms of probability:
Solution: If a coin is being tossed, the sample space 𝑆 = {𝐻, 𝑇}. Suppose 𝐴 is the event of getting head.
Thus 𝐴 = {𝐻}.
𝑛(𝐴) 1
Now 𝑃(𝐴) = = .
𝑛(𝑆) 2
Problem 2: If 10 of 500 randomly selected cars manufactured at a certain factory are found to be the default.
What is the probability of a randomly chosen car will be default?
Solution: Suppose the total number of cars, 𝑛(𝑆) = 500 and the total number of default cars 𝑛(𝐴) = 10.
𝑛(𝐴) 10 1
Thus 𝑃(𝐴) = = 500 = 50.
𝑛(𝑆)
Problem 3: A single dice is tossed once. Find the probability of a 2 or 5 turning up.
Solution: If a dice is tossed, the sample space 𝑆 = {1, 2, 3, 4, 5, 6}. Suppose A=’turning up 2’ and
B=’turning up 5’.
1 1 2 1
𝑃(𝐴 or 𝐵) = 𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) = 6 + 6 = 6 = 3.
Problem 04: Two coins are tossed “A” is the event of “getting two heads” and “B” is the event of “getting the
second coin shows head”. Evaluate 𝑃(𝐴 ∪ 𝐵).
Solution: If two coins are tossed ones, the sample space could be written as 𝑆 = {𝐻𝐻, 𝐻𝑇, 𝑇𝐻, 𝑇𝑇}.
Let A=the event of “getting two heads”= {𝐻𝐻} and
B=the event of “getting the second coin shows head” = {𝐻𝐻, 𝑇𝐻}.
𝐴 ∩ 𝐵 = {𝐻𝐻} ∩ {𝐻𝐻, 𝑇𝐻} = {𝐻𝐻} ≠ {}, implies A and B is not mutually exclusive.
It is known to us for not mutually exclusive event
𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) − 𝑃(𝐴 ∩ 𝐵)
𝑛(𝐴) 𝑛(𝐵) 𝑛(𝐴∩𝐵)
or, 𝑃(𝐴 ∪ 𝐵) = + 𝑛(𝑆) −
𝑛(𝑆) 𝑛(𝑆)
1 2 1 2 1
or, 𝑃(𝐴 ∪ 𝐵) = 4 + 4 − 4 = 4 = 2.
Problem 05: Two coins are tossed “A” is the event of “getting two tails” and “B” is the event of “getting the
second coin shows head”. Evaluate 𝑃(𝐴 ∪ 𝐵). (Do it now, Answer: 3/4)
Problem 06: A dice is tossed once. Assume “A” is the event of getting “odd numbers” and “B” is the event of
getting “the number divisible by 3”. Calculate 𝑃(𝐴 ∪ 𝐵). (HW, Answer: 2/3)
Conditional Probability: Let 𝐴 and 𝐵 be two events such that 𝑃(𝐴) > 0. The probability of an event 𝐵 when
event 𝐴 is occurred is known as conditional probability. It is denoted by 𝑃(𝐵\𝐴). Since 𝐴 is known to have
occurred, it becomes the new sample space replacing the original 𝑆.
𝑃(𝐴∩𝐵)
𝑃(𝐵\𝐴) = (1)
𝑃(𝐴)
Problem 1: Compute the probability that a single toss of a dice will result in a number less than 4 if (a) no other
information is given and (b) it is given that the toss resulted in an odd number.
Solution:
Problem 2: Compute the probability that a single toss of a dice will result in a number less than 5 if (a) no other
information is given and (b) it is given that the toss resulted in an even number. (HW)
Problem 03: A study showed that 65% of the managers had some business education background and 50% of
the managers had engineering education background. Furthermore 20% of the managers had some business
education but no engineering education. Calculate the probability that a manager has some business education
when there is given that he has some engineering education.
𝑃(𝐴∩𝐵)
We know, 𝑃(𝐴\𝐵) = (1)
𝑃(𝐵)
65%
65 13
𝑃(𝐴) = 65% = 100 = 20
50 1
𝑃(𝐵) = 50% = 100 = 2
45 9
𝑃(𝐴 ∩ 𝐵) = 65% − 20% = 45% = 100 = 20.
Problem 04: A box contains 10 green and 8 red balls. Two balls are drawn successively at random from the box.
Find the probability that both balls will be red, considering both cases with and without replacement of the balls
during drawing.
Solution: A box contains 10 green and 8 red balls. The total number of balls is 10+8=18. If two balls are drawn
successively, two cases are raised.
8
Case I (with replacement): The probability of the first ball is to be red is . If the ball is replaced to the box,
18
the number of red balls and the number of total balls will remain same. Thus, the probability will be same for the
8 8 8 16
second ball drawn 18. Therefore, the probability of the both balls to be red is 18 × 18 = 81.
8
Case II (without replacement): The probability of the first ball is to be red is 18. If the ball is not replaced to the
box, the number of red balls will be 8-1=7 and the number of total balls will be 18-1=17. Thus, the probability of
7 8 7 56
drawing the second ball to be red is 17
. Therefore, the probability of the both balls to be red is 18 × 17 = 306.
Problem 05: A box contains 2 red and 3 blue marbles. Find the probability that if two marbles are drawn at
random successively (without replacement), (a) both are blue, (b) both are red, (c) the first marble is red and the
second is blue. (HW)
Problem 06: Find the probability of drawing 3 aces at random, successively, from a deck of 52 ordinary cards if
the cards are (a) replaced, (b) not replaced. (HW)
Problem 07: Two cards are drawn from a well-shuffled ordinary deck of 52 cards, successively. Find the
probability that they are both aces if the first card is (a) replaced, (b) not replaced. (HW)
Problem 08: Find the probability of getting a 7 or 11 total on either phases of a pair of fair dice.
Let A be the event of getting a total 7 and B be the event of getting a total 11.
We have to find 𝑃(𝐴 ∪ 𝐵). As the event A and B are mutually exclusive (i.e., 𝐴 ∩ 𝐵 = {})
𝑛(𝐴) 𝑛(𝐵)
or, 𝑃(𝐴 ∪ 𝐵) = + 𝑛(𝑆)
𝑛(𝑆)
6 2 8 2
or, 𝑃(𝐴 ∪ 𝐵) = 36 + 36 = 36 = 9.
Problem 09: Find the probability of getting 5 in a face or a greater than 10 total on either of two tosses of a pair
of fair dice. (HW)
Independent event: If 𝑃(𝐵\𝐴) = 𝑃(𝐵), i.e., the probability of 𝐵 occurring is not affected by the occurrence or
non-occurrence of 𝐴, then we say that 𝐴 and 𝐵 are independent events. This is equivalent to 𝑃(𝐴 ∩ 𝐵) =
𝑃(𝐴)𝑃(𝐵).
Bayes’ theorem: Suppose that 𝐴1 , 𝐴2 , . . . , 𝐴𝑛 are mutually exclusive events with 𝑃(𝐴𝑖 ) ≠ 0whose union is the
sample space 𝑆. Then for an event 𝐴 that occurs when the experiment is performed, such that 𝑃(𝐴) > 0, we have
𝑃(𝐴𝑖 )𝑃(𝐴/𝐴𝑖 )
𝑃(𝐴𝑖 /𝐴) = ∑𝑛 .
𝑖=1 𝑃(𝐴𝑖 )𝑃(𝐴/𝐴𝑖 )
Problem 1: There are three identical boxes containing, 4 white and 3 red balls, 3 white and 7 red balls, 5 white
and 4 red balls, respectively. A box is chosen at random and a ball is drawn from it. If the ball is white, find the
probability that is of the first box.
Random Variable: Suppose we have a sample space 𝑆. Let we be assigning a number for each sample points.
Thus, we can define a function on sample space. This function is called random variable or more precisely random
function (Stochastic function). It is usually denoted by capital letter such as 𝑋 or 𝑌.
Example:
Probability distribution:
Mathematical Expectation
Definition: Mathematical expectation or expected value or expectation of random variable for a discrete
random variable 𝑋 with the possible values 𝑥1 , 𝑥2 , 𝑥3 , . . . , 𝑥𝑛 is denoted as 𝐸(𝑋) and defined as
Problems:
𝑋 0 1 2 3
𝑃(𝑋 = 𝑥) 0.2 0.4 0.25 0.15
We know the expected value or mathematical expectation or expectation
𝐸(𝑋) = ∑𝑛𝑖=1 𝑥𝑖 𝑃(𝑋 = 𝑥𝑖 )
or, 𝐸(𝑋) = 𝑥1 𝑃(𝑥1 ) + 𝑥2𝑃(𝑥2 ) + 𝑥3 𝑃(𝑥3) + 𝑥4𝑃(𝑥4 )
or, 𝐸(𝑋) = 0(0.2) + 1(0.4) + 2(0.25) + 3(0.15)
or, 𝐸(𝑋) = 0 + 0.4 + 0.50 + 0.45
or, 𝐸(𝑋) = 1.35
2. Let 𝑋 be a random variable with probability distribution function as follows:
𝑋 0 1 2 3
𝑓(𝑋 = 𝑥) 1 1 0 1
3 2 6
Find the values of 𝐸(𝑋), 𝐸(𝑎𝑋 + 𝑏) and 𝐸((𝑋 − 1)2 ).
Solution: For random variable 𝑋
𝑋 0 1 2 3
𝑓(𝑋 = 𝑥) 1 1 0 1
3 2 6
Again 𝐸(𝑎𝑋 + 𝑏) = (𝑎𝑥1 + 𝑏)𝑓(𝑥1) + (𝑎𝑥2 + 𝑏)𝑓(𝑥2 ) + (𝑎𝑥3 + 𝑏)𝑓(𝑥3 ) + (𝑎𝑥4 + 𝑏)𝑓(𝑥4 )
1 1 1
or, 𝐸(𝑎𝑋 + 𝑏) = (𝑎. 0 + 𝑏) 3 + (𝑎. 1 + 𝑏) 2 + (𝑎. 2 + 𝑏).0 + (𝑎. 3 + 𝑏) 6
1 1 1 1 1
or, 𝐸(𝑎𝑋 + 𝑏) = 3 𝑏 + 2 𝑎 + 2 𝑏 + 2 𝑎 + 6 𝑏
or, 𝐸(𝑎𝑋 + 𝑏) = 𝑎 + 𝑏
Further, 𝐸((𝑋 − 1)2 ) = (𝑥1 − 1)2 𝑓(𝑥1) + (𝑥2 − 1)2 𝑓(𝑥2 ) + (𝑥3 − 1)2 𝑓(𝑥3) + (𝑥4 − 1)2 𝑓(𝑥4 )
1 1 1
or, 𝐸((𝑋 − 1)2 ) = (0 − 1)2 + (1 − 1)2 + (2 − 1)2 . 0 + (3 − 1)2
3 2 6
1 1 21
or, 𝐸((𝑋 − 1)2 ) = 1. 3 + 0. 2 + 2 6
1 4
or, 𝐸((𝑋 − 1)2 ) = 3 + 6
Definition:
Problem 03: The mean of a binomial distribution is 20 and the standard deviation is 4. Compute n, p, q.
Solution: It is given that the mean and standard deviation of a binomial distribution are 20 and 4, respectively.
For binomial distribution, it is also known to us Mean, 𝜇 = 𝑛𝑝 and standard deviation, 𝜎 = √𝑛𝑝𝑞.
Thus 𝑛𝑝 = 20.
And √𝑛𝑝𝑞 = 4
or, 𝑛𝑝𝑞 = 16
or, 20𝑞 = 16
16
or, 𝑞 = 20
4
or, 𝑞 = 5.
4 1
For binomial distribution, 𝑝 + 𝑞 = 1, or, 𝑝 = 1 − 𝑞 = 1 − 5 = 5.
20 20
Again 𝑛 = = 1 = 20 × 5 = 100.
𝑝
5
Problem 04: The mean of a binomial distribution is 40 and the standard deviation is 5. Compute n, p, q. (HW)
Problem 05: The mean of a binomial distribution is 40 and the variance is 20. Compute n, p, q. (HW)
Normal Distribution
Definition:
Mean: 𝜇 = 𝑚; variance: 𝜎 2 = 𝑚
Poisson Distribution
𝑒 −𝑚 𝑚 𝑥
Definition: The Poisson distribution is given by 𝑓(𝑥) = 𝑃(𝑋 = 𝑥) = ; 𝑥 = 0,1,2, . . . , 𝑚. Where 𝑚is called
𝑥!
parameter of the distribution and is the average number of occurrences of the random event 𝑥.
Mean: 𝜇 = 𝑚; variance: 𝜎 2 = 𝑚
Test of hypothesis
𝑋 0 1 2 3 4
𝑃(𝑋 = 𝑥) 0.1 0.35 0.25 0.15 0.15