Unit 9
Unit 9
Definition:-
- A histogram is a graphical representation of a frequency distribution in which a variable (V) is plotted on the x-axis and the
frequency (f) is plotted on the y-axis.
3. From the above frequency distribution construct a histogram.
Y - Axis
Frequency
X – Axis
Variable
1. What is the highest variable (variant) = 72 and 88.s
2. What is the highest frequency = 4.
3. How many variant occur 3 times = 2
4. What present of student score less than 80 1 +1+2+2+4+4 % = 14%
72
=
Exercise:- 2
1. What is the difference between a frequency distribution table and a histogram?
2. The ages (to the nearest year) of 40 children in a certain village are as follows
10 7 4 5 1 9 3 6 5 4
2 7 5 3 2 5 6 2 8 9
5 8 9 9 5 2 1 3 9 4
3 5 7 9 6 3 6 8 1 2
- Prepare a frequency distribution table and histogram for the given data.
3. A sample of 50 couples married for 10 years were asked how many children they had. The result of the survey is as follows:-
0 4 2 2 1 0 3 3 2 4
3 3 1 3 3 3 3 3 2 2
1 3 3 2 4 1 3 5 2 2
2 0 0 2 1 2 2 2 3 2
3 3 3 4 3 3 1 0 3 2
Properties of mean
𝒇𝒏 = 5.025
1. The sum of the deviation of individual population from mean μ is zero .
Example:-
1. Let the ages of 5 children be 2,3,6,9,10. Then find the sum of the deviations from the mean.
Solution
𝟐+𝟑+𝟔+𝟗+𝟏𝟎
𝒙𝟏 + 𝒙𝟐 + 𝒙𝟑 + −−− + 𝒙𝒏 = = 6
𝒙̅ = 𝒏 𝟓
So, the sum of the deviation from the mean = (2-6)+ ( 3-6)+ (6-6)+ (9-6)+ (10-6) = 0
2. The mean of the sum or difference of two population function(of equal number of
observation) is equal to the sum or difference of the means of the two population.
That is:-
̅𝒙 (x + y) = ̅𝒙 𝒙 + ̅𝒙 𝒚
Example:-
1. The mean of 2, 4, 6, 8 is 5 and the mean of 5, 7, 9, 7,is 7. Then find the mean of 7,
11, 15, 15.
Solution
The mean of 7, 11, 15, 15 = mean of 2, 2, 6, 8 + mean of 5, 7, 9, 7,is 7
=5+7
= 12
2. If a constant K is added to (or subtracted from) each data value, then the new mean is the sum (or the difference) of
the old mean and the constant K.
New 𝒙̅ = Old 𝒙̅ + K
3. The mean of the constant times a population function is equal to the constant times the mean of the population function.
𝑥̅ 1
4. If each member of a population function with mean 𝑥̅ is divided by K, then the new mean will be or ̅𝒙
𝑘 𝑘
Example
1. Consider the amount of money in the pocket of five student in birr given as
3,7,8,12,15. So, the mean became:-
𝟑+𝟕+𝟖+𝟏𝟐+𝟏𝟓
𝒙𝟏 + 𝒙𝟐 + 𝒙𝟑 + −−− + 𝒙𝒏 = =9
𝒙̅ = 𝒏 𝟓
If we add birr 2 for each student the new mean = 9 + 2 = 11
a. If we take birr 2 for each student the new mean = 9 – 2 = 7
b. If we multiply the amount of money in the pocket of each student by 2 the new mean = 9 x 2 = 18
c. If the amount in the pocket of each students is divided by 2 the new mean = 9 ÷ 2 = 4.
1. Median
Median: is the value that lies in the middle of the data when it is arranged in ascending and descending order. So, half the
data is below the median and half the data is above the median.
Remark :-
1. If the data is odd, then the median can
𝒕𝒉
be calculated by:-
𝒏+𝟏
M.d = ( ) value
𝟐
2. If the data is even, then the median is:-
𝒏 𝒕𝒉 𝒏 𝒕𝒉
M.d = ( ) value + ( + 𝟏) value
𝟐 𝟐
Example:-
1. Find the median of 6,7,9,7,11,13,15.
Solution
- When we rearrange the data 6,7,9,7,11,13,15 it becomes 6,7,7,9,11,13,15. Then the median of the data is
M.d = (7+1)th value = 4
2
After counting 4 data either from the right or left we get the value 9. So, the median is 9.
Note:-
a. The median can be obtained even some of the data values are not known.
b. It is not affected by extreme value.
c. It is unique for a given data set.
2. Mode:-
is the value of the variable which occurs most frequently in a data set. A set of data can have
1. No mode
3. One mode (uni- modal)
4. Two modes(bi- modal)
5. More than two modes (multi- modal)
Example:-
1. Find the mode of each of the following data sets.
a. 4,6,12,10,17. No mode
b. 12,10,11,13,10,14,12,18,17. Two mode(10 and 12).
c. 9,8,7,10,6,8. One mode that is 8.
Note:-
1. The mode is not always unique.
2. It is not affected by extreme value.
3. The mode can also be used for qualitative data.
Measure of Dispersion for Ungrouped data.
Dispersion or variation is the scatter(or spread) of data values from a measure of central tendency.
The three measure of dispersion is range, variance and standard deviation.
1. Range:-
is the difference between the highest and the lowest value for the given data.
That is :-
Range = Highest value – lowest value
Example:-
1. The age of six student are 24,20,18,13,16,15 years respectively, what is the range?
Range = Maximum Value – Minimum Value
= 24 – 13
= 11
2. Variance (s2)
is the average of the squared deviations of each value from the arithmetic mean.
3. x1, x2, x3, ------,xn are n observation values, then the variance for the simple data is given by-
If
(𝒙𝟏−̅𝒙)𝟐+ (𝒙𝟐−̅𝒙)𝟐 + −−−−−−−−− + (𝒙𝒏−̅𝒙)𝟐 ̅ )𝟐
∑(𝒙𝒊− 𝒙
S2 = 𝒏 = 𝒏
Where i = 1,2,3,--------, n.
4. If x1, x2, x3, -------- xn are values of n observations with corresponding frequencies f1, f2, f3,-------, fn. Then the variance is given
by:-
(𝒙𝟏−̅𝒙)𝟐+ (𝒙𝟐−̅𝒙)𝟐 + −−−−−−−−− + (𝒙𝒏−̅𝒙)𝟐𝒊 ̅ )𝟐
∑(𝒙 − 𝒙
S2 = 𝒏 = 𝒏
Where i = 1,2,3,--------, n
3. Standard Deviation(S.d)
is the positive square root of the square deviations of each value from the arithmetic mean
S.d = √𝒔𝟐
Example:-
1. From the following observations find the variance and standard deviation of 2,5,6,7,10.
Solution:-
∑ 𝑥𝑖 𝟐+𝟓+𝟔+𝟕+𝟏𝟎
̅𝒙 = =
𝑛 𝟓
=𝟔
X 2 5 6 7 10
x - ̅𝒙 2-6 =4 5-6 =1 6-6 =1 7-6=1 10 -6=4 ∑(𝑥 − 𝑥̅)2
(x - ̅𝒙 ) 2 16 1 0 1 16 34
a. ̅ )𝟐
∑(𝒙− 𝒙 𝟑𝟒
S2 = 𝒏 = 𝟓
b. S.d = √𝒔𝟐
𝟑𝟒
=√𝟓
= √6.8
Properties of variance and standard deviation
- If a constant C is added to each value of a population function, then the new variance is the same as the old variance. The new
standard deviation is also the same as the old standard deviation.
That is:-
1. New variance = old variance
2. New standard deviation = old standard deviation
. If each value of a population function is multiplied by a constant C, then the new variance is C2 times the old variance
That is:
The new variance = C2 x old variance
2. The new standard deviation is |C | times the old standard deviation
That is:-
New standard deviation = |C | old standard deviation
Exercise:-
1. Find the mean, mode and median of the following
data. 11, 9, 14, 3, 11, 4, 10, 21, 8, 15, 35.
2. Find the range, variance, and standard deviation of the following
data. 4, 2, 3, 3, 2, 1, 4, 3, 2, 6.
3. For the frequency distribution of the population function V. Find
V 1 2 3 4 5
F 6 5 4 3 2
a. Population mode
b. Population mean
c. Population variance
d. Population standard deviation.
4. Given below is a frequency distribution of a population function V. Find
V −16 −14 0 1
2
1
3
F 12 16 21 8 27
a. The mean
b. The mode
c. The median
d. Percent of the population is non-negative value.
e. Percent of the population is negative value.
5. If the mean of 4, 6, 7, 10, 9, x, is 7, then find the value of x and the mode.
6. What is the mean deviation of the numbers 1, 3, 5, 4, 2.
7. What is the value of y, if the standard deviation of 8, 8, 8, 8, y, 8 is 0?
8. If the variance of a, b ,c, d is k, then what is
a. The variance of a + c, b + c, 2c, d + c?
b. The standard deviation of a + c, b + c, 2c, d + c?
Probability
Introduction
In this unit of the first section we were try to introduce the basic concept of statistics and its subdivision such that descriptive
and inferential statistics. The basic tools of inferential statistic is probability. In the second of this section we study the basic
concept of probability and introduce them by using some examples.
Definition:-
Probability is the study of chance of occurrence of events.
Basic Concepts in Probability
Consider a single coin that has two sides(face). If we toss flip the coin, then the experiment shows that the possible outcomes
are the head(H) and tail (T). In these case we have two possible outcomes such as H and T. Hence, the chance getting exactly
one of the two is ½.that is the probability getting H is ½ and that of tail is ½.
Consider a single die whose sides are labeled as 1,2,3,4,5,or 6. If we roll the die, then the experiment exactly one of the sides
shows up. In these case there are six possible outcomes such as 1,2,3,4,5,or 6. Hence the probability getting exactly one of the
sides is 1/6. For more understanding, we shall be familiar with the following terms.
1. Experiment is the process in which measurement is made and there by data are collected.
2. Possibility set of an experiment (sample space) which contains all the possible outcomes.
3. Event is any subset of the possibility set.
To illustrate the above explanation, consider the following summary of probability experiments.
Experiment Event Sample space(U)
Toss a coin Head or tail (H or T) { H,T}
Toss two coin The first head or tail or the second head { HH, HT,TH,TT}
or tail or HH or HT or TH or TT
Roll a die Odd numbers or even numbers, 1or 2 or {1,2,3,4,5,6}
3 or 4 or 5 or 6
Answer a true or T or F {T,F}
falls question
Definition:-
If an experiment has n equally likely outcomes and if m of these represent a particular event, then the probability of this event
occurring is m/n.
Example:-
1. In an experiment of selecting students at a random a researcher found the following result after 50 trials.
Student Boy Girl Total
Number 20 30 50
a. What is the probability that a randomly selected student is a girl?
Solution
n(G) = 30 = m
n(S) = 50= n
𝑚 30
p(G) = =
𝑛 50
3
=
5
2. In an experiment of tossing two coins,
a. What are the possible outcomes?
b. How many different possible outcomes are there?
c. What is the probability of the coins landing with
1. Two heads
2. One head
3. Two tails
Solution:-
HH
H
HT
Coin
TH
T
TT
a. Possible outcomes = {HH, HT, TH, TT}
b. 4
𝟏 𝟑 𝟏
c. 1. 2. 3.
𝟒 𝟒 𝟒
Theoretical Probability Of An Event
Let U be the sample space of an experiment and E is an event, then the theoretical probability is defined by:-
That is:-
Since probability is on the formation of failure and success. That is an event is between failure and success.
𝟎 𝒏(𝑬) 𝒏( 𝑼 )
Hence, ≤ ≤
𝒏( 𝑼 ) 𝒏( 𝑼 ) 𝒏( 𝑼 )
𝒏(𝑬)
0 ≤ ≤ 1
𝒏( 𝑼 )
Failure Certain
Example:-
2. If you toss a die and observe the number appearing on the upper face of the die, then what is the probability to see an
odd number.
Solution :- U = {1,2,3,4,5,6,}------n(U) = 6
E = {1,3,5}---------------n(E)= 3
P(E) = n(E) = 3 = 1
n(U) 6 2
3. A jar contains three white balls, one red ball, and one yellow ball. If two balls are taken at random from the jar, then what
is the probability that neither ball is white?
Solution :
Neither ball is white it mean the ball is either red or yellow.
W2
W3
W1
R
Y
W3
Ball W2 R
Y
R
W3
Y
R Y
Y
- Possible out comes = {W1W2, W1W3, W1R, W1Y, W2W3,W2R, W2Y, W3R, W3Y, RY}.
- Events = {RY}
- Number of Possible out Comes = 10
- Number of events = 1
Thus:-
𝒏(𝑬)
P(E) =
𝒏𝟏(𝑼)
= 𝟏𝟎
Exercise:-
1. Two dice are simultaneously thrown once. List the ways in which the following events can occur.
a. A = the same number is shown on each die.
b. B = the sum of the numbers is thirteen.
c. C = the product of the two numbers is 1s
d. D = the quotient of the two numbers is 7
2. Three coin are tossed at the same time. Sketch a tree diagram for the outcomes
of this experiment. What is the possibility set?
3. The bag contains four red balls and three black balls. What is the possibility set for color, if two balls are chosen at random?
4. A fair die is rolled once. Calculate the probability of getting:-
a. An odd number c. A prime number
b. A score of 5 d. A score of 0
5. A number is selected randomly from the set of whole numbers 1 – 20 both inclusive. Find the probability that the number
selected is:-
a. Even d. Even or Odd
b. A multiple of 2 and 3 e. The square of 2
c. A multiple of 3 f. The square of 6
6. A Bag contains 5 red balls, 3black balls and 4white balls. A ball is drown out of the bag at random. What is the probability
that the ball drawn is:-
a. White? b. Red? C. Black?
7. A bag contains 100 identical cards on which the numbers 1 to 100 are marked. A card is drown out of the bag randomly.
Find the probability that the number on the card is:-
a. Greater than 32 b. A factor of 24 C. Less than 76