Quantitative Methods Notes
Quantitative Methods Notes
METHODS
QUANTITATIVE
METHODS
NOTES
School of Distance Education
CONTENTS
1 QUANTITATIVE TECHNIQUES 5
2 CORRELATION ANALYSIS 11
3 REGRESSION ANALYSIS 34
4 THEORY OF PROBABILITY 49
5 PROBABILITY DISTRIBUTION 72
6 BINOMIAL DISTRIBUTION 75
7 POISSON DISTRIBUTION 83
8 NORMAL DISTRIBUTION 87
9 TESTING OF HYPOTHESIS 94
CHAPTER 1
QUANTITATIVE TECHNIQUES
Meaning and Definition:
Quantitative techniques may be defined as those techniques which provide the decision
makes a systematic and powerful means of analysis, based on quantitative data. It is a scientific
method employed for problem solving and decision making by the management. With the help of
quantitative techniques, the decision maker is able to explore policies for attaining the
predetermined objectives. In short, quantitative techniques are inevitable in decision-making
process.
There are different types of quantitative techniques. We can classify them into three
categories. They are:
A technique in which quantitative data are used along with the principles of mathematics
is known as mathematical quantitative techniques. Mathematical quantitative techniques involve:
=
2. Set Theory:-
Set theory is a modern mathematical device which solves various types of critical
problems.
4. Determinants:
It is a powerful device developed over the matrix algebra. This device is used for finding
out values of different variables connected with a number of simultaneous equations.
5. Differentiation:
It is a mathematical process of finding out changes in the dependent variable with
reference to a small change in the independent variable.
6. Integration:
7. Differential Equation:
1. Collection of data:
One of the important statistical methods is collection of data. There are different methods
for collecting primary and secondary data.
Correlation is used to study the degree of relationship among two or more variables. On
the other hand, regression technique is used to estimate the value of one variable for a given
value of another.
Index numbers measure the fluctuations in various Phenomena like price, production etc
over a period of time, They are described as economic barometres.
8. Ratio Analysis:
Ratio analysis is used for analyzing financial statements of any business or industrial
concerns which help to take appropriate decisions.
9. Probability Theory:
Theory of probability provides numerical values of the likely hood of the occurrence of
events.
10. Testing of Hypothesis
Testing of hypothesis is an important statistical tool to judge the reliability of inferences
drawn on the basis of sample studies.
Programming Techniques:
1. Linear Programming:
2. Queuing Theory:
Queuing theory deals with mathematical study of queues. It aims at minimizing cost of
both servicing and waiting.
Inventory theory helps for optimizing the inventory levels. It focuses on minimizing cost
associated with holding of inventories.
7. Simulation:
8. Replacement Theory:
10. Sequencing:
Sequencing tool is used to determine a sequence in which given jobs should be performed
by minimizing the total efforts.
Quantitative techniques render valuable services in the field of business and industry.
Today, all decisions in business and industry are made with the help of quantitative techniques.
Some important uses of quantitative techniques in the field of business and industry are
given below:
2. Inventory control techniques are useful in dividing when and how much items are to
be purchase so as to maintain a balance between the cost of holding and cost of
ordering the inventory
3. Quantitative techniques of CPM, and PERT helps in determining the earliest and the
latest times for the events and activities of a project. This helps the management in
proper deployment of resources.
4. Decision tree analysis and simulation technique help the management in taking the
best possible course of action under the conditions of risks and uncertainty.
5. Queuing theory is used to minimize the cost of waiting and servicing of the
customers in queues.
Even though the quantitative techniques are inevitable in decision-making process, they
are not free from short comings. The following are the important limitations of quantitative
techniques:
Quantitative Techniques for Business 9
School of Distance Education
1. Quantitative techniques involves mathematical models, equations and other mathematical
expressions
2. Quantitative techniques are based on number of assumptions. Therefore, due care must be
ensured while using quantitative techniques, otherwise it will lead to wrong conclusions.
3. Quantitative techniques are very expensive.
4. Quantitative techniques do not take into consideration intangible facts like skill, attitude
etc.
5. Quantitative techniques are only tools for analysis and decision-making. They are not
decisions itself.
CHAPTER - 2
CORRELEATION ANALYSIS
Introduction:
In practice, we may come across with lot of situations which need statistical analysis of
either one or more variables. The data concerned with one variable only is called univariate data.
For Example: Price, income, demand, production, weight, height marks etc are concerned with
one variable only. The analysis of such data is called univariate analysis.
The data concerned with two variables are called bivariate data. For example: rainfall and
agriculture; income and consumption; price and demand; height and weight etc. The analysis of
these two sets of data is called bivariate analysis.
The date concerned with three or more variables are called multivariate date. For
example: agricultural production is influenced by rainfall, quality of soil, fertilizer etc.
The statistical technique which can be used to study the relationship between two or more
variables is called correlation analysis.
Definition:
Two or more variables are said to be correlated if the change in one variable results in a
corresponding change in the other variable.
According to Simpson and Kafka, Correlation analysis deals with the association
between two or more variables.
Lun chou defines, Correlation analysis attempts to determine the degree of relationship between
variables.
Boddington states that Whenever some definite connection exists between two or more
groups or classes of series of data, there is said to be correlation.
In nut shell, correlation analysis is an analysis which helps to determine the degree of
relationship exists between two or more variables.
Correlation Coefficient:
Correlation analysis is actually an attempt to find a numerical value to express the extent
of relationship exists between two or more variables. The numerical measurement showing the
degree of correlation between two or more variables is called correlation coefficient. Correlation
coefficient ranges between -1 and +1.
Eg: 1) A: 10 20 30 40 50
B: 80 100 150 170 200
2) X: 78 60 52 46 38
Y: 20 18 14 10 5
Negative Correlation:
When the variables are moving in opposite direction, it is called negative correlation. In
other words, if an increase in the value of one variable is accompanied by a decrease in the value
of other variable or if a decrease in the value of one variable is accompanied by an increase in the
value of other variable, it is called negative correlation.
Eg: 1) A: 5 10 15 20 25
B: 16 10 8 6 2
Linear Correlation
In a correlation analysis, if the ratio of change between the two sets of variables is same,
then it is called linear correlation.
For example when 10% increase in one variable is accompanied by 10% increase in the
other variable, it is the problem of linear correlation.
X: 10 15 30 60
Y: 50 75 150 300
Here the ratio of change between X and Y is the same. When we plot the data in graph
paper, all the plotted points would fall on a straight line.
Non-linear correlation
In a correlation analysis if the amount of change in one variable does not bring the same
ratio of change in the other variable, it is called non linear correlation.
X: 2 4 6 10 15
Y: 8 10 18 22 26
Here the change in the value of X does not being the same proportionate change in the value of Y.
Degrees of correlation:
If an increase in the value of one variable is followed by the same proportion of increase in
other related variable or if a decrease in the value of one variable is followed by the same
proportion of decrease in other related variable, it is perfect positive correlation. eg: if 10% rise in
price of a commodity results in 10% rise in its supply, the correlation is perfectly positive.
Similarly, if 5% full in price results in 5% fall in supply, the correlation is perfectly positive.
If an increase in the value of one variable is followed by the same proportion of decrease
in other related variable or if a decrease in the value of one variable is followed by the same
proportion of increase in other related variably it is Perfect Negative Correlation. For example if
10% rise in price results in 10% fall in its demand the correlation is perfectly negative. Similarly
if 5% fall in price results in 5% increase in demand, the correlation is perfectly negative.
For example, if 10% rise in price of a commodity results in 5% rise in its supply, it is
limited degree of positive correlation. Similarly if 10% fall in price of a commodity results in 5%
fall in its supply, it is limited degree of positive correlation.
For example, if 10% rise in price results in 5% fall in its demand, it is limited degree of
negative correlation. Similarly, if 5% fall in price results in 10% increase in demand, it is limited
degree of negative correlation.
I Graphic Methods
1) Scatter Diagram
2) Correlation graph
Scatter Diagram
This is the simplest method for ascertaining the correlation between variables. Under this
method all the values of the two variable are plotted in a chart in the form of dots. Therefore, it is
also known as dot chart. By observing the scatter of the various dots, we can form an idea that
whether the variables are related or not.
A scatter diagram indicates the direction of correlation and tells us how closely the two
variables under study are related. The greater the scatter of the dots, the lower is the relationship
Y Y
X X
X X
X X
X X
X X
X X
X X
X X
0 X 0 X
Perfect Positive Correlation Perfect Negative Correlation
Y X Y X
X X
X X
X X
X X
X X
X X
X X
X X
X X
X X
X X
X X
X X
X
X X
0 X
0 X
High Degree of Negative Correlation
High Degree of Positive Correlation
Y X Y X
X X X
X X X
X X X X
X X X X
X X X X
X X X X
X X X X
X X X X
X X X X
X X X X
X X X
X X X X
X X X X
X
X X X X
X
0 X
0 X
Low Degree of Negative Correlation
Low Degree of Positive Correlation
Y
X X X X X
X X X X X
X X X X X
X X X X X
X X X X X X
0 X
No Correlation (r = 0)
r = Covariance of X and Y
(SD of X) x (SD of Y)
Interpretation of Co-efficient of Correlation
Pearsons Co-efficient of correlation always lies between +1 and -1. The following
general rules will help to interpret the Co-efficient of correlation:
i.e r=
r=
or
Calculate Pearsons co-efficient of correlation between age and playing habits of students:
Age: 20 21 22 23 24 25
No. of students 500 400 300 240 200 160
Regular players 400 300 180 96 60 24
Let X = Age and Y = Percentage of regular players
Percentage of regular players can be calculated as follows:-
400 x 100 = 80; 300 x 100 = 75; 180 x 100 = 60; 96 x 100 = 40 ,
500 400 300 240
Pearsons Coefficient of
Correlation (r) =
= = = 22.5
= = = 50
r = = = = -0.9912
Assumed mean method:
Under assumed mean method, correlation coefficient is calculated by taking assumean only.
r=
Where dx = deviations of X from its assumed mean; dy= deviations of y from its assumed mean
Find out coefficient of correlation between size and defect in quality of shoes:
r=
r=
= = = -0.9485
Direct Method:
Under direct method, coefficient of correlation is calculated without taking actual mean or
assumed mean
r=
Price : 10 12 14 15 19
Demand (Qty) 40 41 48 60 50
= 70 = 239
r=
r=
r= = = +0.621
PE =
If the value of coefficient of correlation ( r) is less than the PE, then there is no evidence of
correlation.
If the value of r is more than 6 times of PE, the correlation is certain and significant.
By adding and submitting PE from coefficient of correlation, we can find out the upper
and lower limits within which the population coefficient of correlation may be expected to lie.
Uses of PE:
1) PE is used to determine the limits within which the population coefficient of correlation
may be expected to lie.
2) It can be used to test whether the value of correlation coefficient of a sample is significant
with that of the population
If r = 0.6 and N = 64, find out the PE and SE of the correlation coefficient. Also determine
the limits of population correlation coefficient.
Sol: r = 0.6
N=64
SE =
= = = 0.08
P.E = 0.6745
= 0.05396
= 0.6 0.05396
= 0.54604 to 0.6540
Qn. 2 r and PE have values 0.9 and 0.04 for two series. Find n.
Sol: PE = 0.04
0.6745 0.04
0.0593 = 0.19
N = 3. = 10. 266
N = 10
1. This is the most widely used algebraic method to measure coefficient of correlation.
2. It gives a numerical value to express the relationship between variables
3. It gives both direction and degree of relationship between variables
4. It can be used for further algebraic treatment such as coefficient of determination
coefficient of non-determination etc.
5. It gives a single figure to explain the accurate degree of correlation between two variables
The correlation coefficient obtained from ranks of the variables instead of their
quantitative measurement is called rank correlation. This was developed by Charles Edward
Spearman in 1904.
N = number of pairs
Qn: Find the rank correlation coefficient between poverty and overcrowding from the
information given below:
Town: A B C D E F G H I J
Poverty: 17 13 15 16 6 11 14 9 7 12
Over crowing: 36 46 35 24 12 18 27 22 2 8
Sol: Here ranks are not given. Hence we have to assign ranks
R = 1-
N = 10
A 17 36 1 2 1 1
B 13 46 5 1 4 16
C 15 35 3 3 0 0
D 16 24 2 5 3 9
E 6 12 10 8 2 4
F 11 18 7 7 0 0
G 14 27 4 4 0 0
H 9 22 8 6 2 4
I 7 2 9 10 1 1
J 12 8 6 9 3 9
44
R=1-
= 1-
=
1 - 0.2667
+
= 0.7333
Qn:- Following were the ranks given by three judges in a beauty context. Determine which pair
of judges has the nearest approach to Common tastes in beauty.
Judge I: 1 6 5 10 3 2 4 9 7 8
Judge I: 3 5 8 4 7 10 2 1 6 9
Judge I: 6 4 9 8 1 2 3 10 5 7
R = 1=
N= 10
R=1-
= 1-
= 1- 1.2121
= - 0.2121
= 1-
= - 0.297
= 1- 0.364
= +0.636
R = 0.2
R = 1- = 0.2
= 90 0.8= 792
Correct = = 132 - +
= 164
Correct R 1=
= 1-
= 1-
= 1 - 0.9939
= 0.0061
Qn: The coefficient of rank correlation between marks in English and maths obtained by a
group students is 0.8. If the sum of the squares of the difference in ranks is given to be
33, find the number of students in the group.
ie, 1- = 0.8
1-08 =
N3 - N =
N = 10
Suppose 4 observations got 6th rank, here we have to assign the rank, 7.5 (ie.
to each of the 4 observations.
When there is equal ranks, we have to apply the following formula to compute rank
correlation coefficient:-
R= 1-
X: 68 64 75 50 64 80 75 40 55 64
Y: 62 58 68 45 81 60 68 48 50 70
Here, ranks are not given we have to assign ranks Further, this is the case of equal ranks.
R= 1-
R= 1-
R=1-
= 1-
=1-
=1-
= 1- = 1 - 0.4545
= 0.5455
r=
r=
If 2c
Qn:- Calculate coefficient if correlation by concurrent deviation method:-
Year : 2003 2004 2005 2006 2007 2008 2009 2010 2011
Supply : 160 164 172 182 166 170 178 192 186
Price : 292 280 260 234 266 254 230 190 200
r=
= == = -1
CHAPTER - 3
REGRESSION ANALYSIS
Introduction:-
Correlation analysis analyses whether two variables are correlated or not. After having
established the fact that two variables are closely related, we may be interested in estimating the
value of one variable, given the value of another. Hence, regression analysis means to analyse the
average relationship between two variables and thereby provides a mechanism for estimation or
predication or forecasting.
The term Regression was firstly used by Sir Francis Galton in 1877. The dictionary
meaning of the term regression is stepping back to the average.
Definition:
Regression is the measure of the average relationship between two or more variables in
terms of the original units of the date.
Regression analysis is an attempt to establish the nature of the relationship between
variables-that is to study the functional relationship between the variables and thereby provides a
mechanism for prediction or forecasting.
It is clear from the above definitions that Regression Analysis is a statistical device with
the help of which we are able to estimate the unknown values of one variable from known values
of another variable. The variable which is used to predict the another variable is called
independent variable (explanatory variable) and, the variable we are trying to predict is called
dependent variable (explained variable).
The dependent variable is denoted by X and the independent variable is denoted by Y.
The analysis used in regression is called simple linear regression analysis. It is called
simple because three is only one predictor (independent variable). It is called linear because, it is
assumed that there is linear relationship between independent variable and dependent variable.
Types of Regression:-
There are two types of regression. They are linear regression and multiple regression.
Linear Regression:
It is a type of regression which uses one independent variable to explain and/or predict the
dependent variable.
Multiple Regression:
It is a type of regression which uses two or more independent variable to explain and/or
predict the dependent variable.
x = a + by
Normal equations are:
b = = 0.83
a = = 10.12
Substitute a = 10.12 and b =0.83 in regression equation:
700 b = 840
b= = 1.2
a= = -12
4 14 16 196 56
12 4 144 16 48
8 2 64 4 16
6 2 36 4 12
4 4 16 16 16
4 6 16 36 24
16 4 256 16 64
8 12 64 144 96
Regression equation y on x
y = a + bx
Normal equations are:
48 = 8a + 62 b (1)
e
-1052 b = 320
b=
a = = 8.36
If x = 20, then,
y=8.36 (0.3042x20)
= 8.36 6.084
= 2.276
eq
144 b = -40
b= = - 0.2778
Substitute b = -0.2778 in equation (1)
62 = 8a + (48
62 = 8a + -13.3344
62+13.3344 = 8 a
8a = 75.3344
a= = 9.4168
x = 6.6388
x - = bxy (y - )
i.e x - =
y - = byx (x - )
i.e y - =
4. If x and are the same, then the regression coefficient and correlation coefficient will be
the same.
Regression coefficient y on x (b yx ) =
Where x = x
y = y-
Regression Coefficient x on y
(bxy )
Regression Coefficient y on x
(byx)
Qn:- Following information is obtained from the records of a business organization:-
Sales ( in 000): 91 53 45 76 89 95 80 65
Advertisement Expense
( in 000) 15 8 7 12 17 25 20 13
You are required to:-
1. Compute regression coefficients under 3 methods
2. Obtain the two regression equations and
3. Estimate the advertisement expenditure for a sale of Rs. 1,20,000
Let x = sales
y = Advertisement expenditure
x y x- y- xy x2 y2
91 15 16.75 0.375 6.28 280.56 0.14
= = = 74.25
= = = 14.625
Regression coefficient x on y
=
= 2.61
Regression coefficient Y on X
(
=
= 0.28
Regression Coefficient x on y
(bxy)
=
=
= 2.61
Regression coefficient y on x
(byx)
=
=
= 0.28
x y xy x2 y2
53 8 424 2809 64
45 7 315 2025 49
Regression coefficient x on y
(bxy)
= 5294
2031
= 2.61
Regression coefficient y on x
(byx) =
= 0.28
3) a) Regression equation X on Y:
b) Regression equation y pm x:
y =14.625 = 0.28
y = -6.165+0.28
y =0.28
Mean
x- = bxy (y- )
x- =
x - 65 = 0.8 (y-67)
x - 65 = 0.5714 (y-67)
x - 65 = 0.5714y-38.2838
x = 65 38.2838+0.5714y
x = 26.72 + 0.5714y
Regression equation y on x is:
y- = byx (x- )
y- =
y - 67 = 0.8 (x-65)
y - 67 = 1.12 (x-65)
y = 67 (1.12 x65) = 1.12 x
y = 67.72.8 + 1.12
= 20, = 4, r = 0.7
= 15, =3
Obtain regression lines and find the most likely value of y when x=24
Sol: Regression Equation x on y is
(x - ) = bxy (y - )
(x - ) =
(x - 20) = (y-15)
(x - 20) = 0.93(y-15)
x = 20 + 0.93y 13.95
x = 20 - 13.95 + 0.93y
x = 6.05 + 0.93y
Regression Equation y on x is
(y - ) = byx (x - )
(y - ) =
(y - ) = 0.7 (x - 20)
y 15 = 0.525 -10.5
y = 15-10.5+ 0.525x
y =4.5+0.525x
If X = 24, then
y = 4.5 + (0.525×24)
Qn: For a given set of bivariate data, the following results were obtained:
Find the most probable value of y when x = 60. Also find r.
Sol: Regression Equation y on x is:
(y - ) = (x- )
y = 79.8+27.9 1.5
y = 107.7 1.5
If x = 60, then
=- =- = -0.5477
Correlation Regression
2 It is not used for prediction purposes It is basically used for prediction purposes
CHAPTER - 4
THEORY OF PROBABILITY
INTRODUCTION
Probability refers to the chance of happening or not happening of an event. In our day
today conversations, we may make statements like probably he may get the selection, possibly
the Chief Minister may attend the function, etc. Both the statements contain an element of
uncertainly about the happening of the even. Any problem which contains uncertainty about the
happening of the event is the problem of probability.
Definition of Probability
The probability of given event may be defined as the numerical value given to the likely
hood of the occurrence of that event. It is a number lying between 0 and 1 0 denotes the
even which cannot occur, and 1 denotes the event which is certain to occur. For example, when
we toss on a coin, we can enumerate all the possible outcomes (head and tail), but we cannot say
which one will happen. Hence, the probability of getting a head is neither 0 nor 1 but between 0
and 1. It is 50% or ½
Sample Point
Every indecomposable outcome of a random experiment is called a sample point. It is
also called simple event or elementary outcome.
Eg. When a die is thrown, getting 3 is a sample point.
Sample space
Sample space of a random experiment is the set containing all the sample points of that
random experiment.
Eg:- When a coin is tossed, the sample space is (Head, Tail)
Event
An event is the result of a random experiment. It is a subset of the sample space of a
random experiment.
Sure Event (Certain Event)
An event whose occurrence is inevitable is called sure even.
Eg:- Getting a white ball from a box containing all while balls.
Quantitative Techniques for Business 49
School of Distance Education
Impossible Events
An event whose occurrence is impossible, is called impossible event. Eg:- Getting a white ball
from a box containing all red balls.
Uncertain Events
An event whose occurrence is neither sure nor impossible is called uncertain event.
Eg:- Getting a white ball from a box containing white balls and black balls.
Exhaustive Events:
A group of events is said to be exhaustive when it includes all possible outcomes of the random
experiment under consideration.
Dependent Events:
Two or more events are said to be dependent if the happening of one of them affects the
happening of the other.
PERMUTATIONS
Permutation means arrangement of objects in a definite order. The number of arrangements
(permutations) depends upon the total number of objects and the number of objects taken at a time
for arrangement.
The number of permutations is calculated by using the following formula:
Factorial
=n!
Question:-
A factory manager purchased 3 new machines, A, B and C. How many number of times he can
arrange the 3 machines?
Solution :
r= 3
Here ‘n’ and ‘r’ are same
= 3! = 3 × 2 × 1 = 6.
Question :
In how many ways 3 people be seated on a bench if only two seats are available.
Solution
r= 2
= = = 6 ways
ie ,
Question:
Find the number of permutations of letters in the word COMMUNICATION
Solution
C = 2, 0 = 2; M = 2 ; U = 1; N = 2;
1 = 2; A =1; T=1
= 194594400 times
COMBINATIONS
Combination means selection or grouping of objects without considering their order. The number
of combinations is calculated by using the following formula:
=
Question
A basket contains 10 mangoes. In how many ways 4 mangoes from the basket can be selected?
Solution
n = 10
r=4
= = =
= 210 ways
n = 20
r=5
= = =
= 15504 Sets
P(A) =
P(A) =
Question
What is the chance of getting a head when a coin is tossed?
Question
A die is thrown. Find the probability of getting.
(1) A 4
(2) an even number
(3) 3 or 5
(4) less than 3
Solution
Sample space is (1,2, 3, 4, 5, 6)
(1) Probability (getting 4) =
Question
A ball is drawn from a bag containing 4 white, 6 black and 5 yellow balls. Find the
probability that a ball drawn is :-
(1) White (2) Yellow (3) Black (4) Not yellow (5) Yellow or white
Solution
Question
There are 19 cards numbered 1 to 19 in a box. If a person drawn one at random, what is
the probability that the number printed on the card be an even number greater than 10?
Quantitative Techniques for Business 54
School of Distance Education
Solution
The even numbers greater than 10 are 12, 14, 16 and 18.
=
P (drawing a card with an even number greater than 10)
Question
Two unbiased dice are thrown. Find the probability that :-
(a) Both the dice show the same number
(b) One die shows 6
(c) First die shows 3
(d) Total of the numbers on the dice is 9
(e) Total of the numbers on the dice is greater than 8
(f) A sum of 11
Solution
When 2 dice are thrown the sample space consists of the following outcomes :-
(1,1) (1,2) (1,3) (1,4) (1,5) (1,6)
(2,1) (2,2) (2,3) (2,3) (2,5) (2,6)
(3,1) (3,2) (3,3) (3,4) (3,5) (3,6)
(4,1) (4,2) (4,3) (4,4) (4,5) (4,6)
(5,1) (5,2) (5,3) (5,4) (5,5) (5,6)
(6,1) (6,2) (6,3) (6,4) (6,5) (6,6)
(a) P(that both the dice shows the same number) = =
= =
Question
What is the probability of getting 3 red balls in a draw of 36 balls from a bag containg 5
red balls and 46 black balls?
Solution
= =
= = =
Question
A committee is to be constituted by selecting three people at random from a group
consisting of 5 Economists and 4 Statisticians. Find the probability that the committee will
consist of :
Quantitative Techniques for Business 56
School of Distance Education
(a)3 Economists
(b) 3 Statisticians
(c)3 Economists and 1 Statistician
(d) 1 Economist and 2 Statistician
= =
= =
= =
= =
(d) P(Selecting 1 Economist
and 2 Statistician =
= = =
Questions
A committee of 5 is to be formed from a group of 8 boys and 7 girls. Find the probability that the
committee consists of at least one girl.
Solution
P(one girl & 4 boys) or P(2 girls & 3 boys)
P (that committee consists of
at least one girl) = or P(3 girls & 2 boys) or P(4girls & 1 boy)
or P(5 girls)
= =
=1( )
=1 = 1-
=1-
= =
Question
The compensation received by 1000 workers in a factory are given in the following table :-
Wages: 80-100 100-120 120-140 140-160 160-180 180-200
No. of
10 100 400 250 200 40
Workers:
Find the probability that a worker selected has
(1) Wages under Rs.100/-
(2) Wages above Rs.140/-
(3) Wages between Rs. 120/- and Rs.180/-
Solution
THEOREMS OF PROBABILITY
There are two important theorems of probability. They are :
1. Addition Theorem
2. Multiplication Theorem
Addition Theorem
Here, there are 2 situations.
(a) Events are mutually exclusive
(b) Events are not mutually exclusive
Solution
Here the events are mutually exclusive
Question
The probability that a contractor will get a plumping contract is and the probability that
he will not get an electric contract is . If the probability of getting at least one contract is , what
is the probability that he will get both the contracts?
Solution
= =
(a) P(A B)
(b) P(A)
MULTIPLICATION THEOREM
Here there are two situations:
(a) Events are independent
(b) Events are dependent
Question
Single coin is tossed for three tones. What is the probability of getting head in all the 3 times?
Solution
P(getting head in all the 3 times) = P(getting H in 1 st toss) × P(getting Head in 2nd toss) ×
P(getting H in 3rd toss)
= × ×
P(A B) = P(A).P(B/A)
Question
A bag contains 5 white balls and 8 black balls. One ball is drawn at random from the bag.
Again, another one is drawn without replacing the first ball. Find the probability that both the
balls drawn are white.
Solution
Question
The probability that A solves a problem in Maths is and the probability that B solves it is .
If they try independently find the probability that :-
(i) Both solve the problem.
(ii) at least one solve the problem.
(iii) none solve the problem.
Solution
(i) P(that both solve the problem) = P(that A solves the problem)×
P(that B solves the problem)
= × = =
= + -( × )
= + -
= =
(iii) P(that none solve the problem) =1-P(at least one solve the problem)
= 1 P(A or B solve the problem)
=1- [P(A solve the problem +
P(B solve the problem) -
P(A & B solve the problem)]
= 1-
= 1-( )
Quantitative Techniques for Business 64
School of Distance Education
= 1- ( )
=1- = =
Question
A university has to select an examiner from a list of 50 persons. 20 of them are women
and 30 men. 10 of them know Hindi and 40 do not. 15 of them are teachers and remaining are
not. What is the probability that the university selecting a Hindi knowing woman teacher?
Solution
Here the events are independent.
P(selecting Hindi knowing = P(selecting Hindi knowing person,
Woman teacher) woman and teacher
P(selecting Hindi Knowing persons =
P(selecting woman) =
P (selecting teacher) =
= =
Question
A speaks truth in 70%cases and B in 85% cases. In what percentage of cases they likely to
contradict each other in stating the same fact?
Let P(A) = Probability that A speaks truth.
P (A and B contradict each other) = P(A speaks truth and B does not
OR, A does not speak truth & B
speaks
=P(A & B) (A &B)
=(0.7 ×0.15) + (0.3 ×0.85)
=0.105 + 0.255
= 0.360
Percentage of cases in in
which A and B contradict = 0.360 ×100
each other = 36 %
Question
20% of students in a university are graduates and 80% are undergraduates. The
probability that graduate student is married is 0.50 and the probability that an undergraduate
student is married is 0.10. If one student is selected at random, what is the probability that the
student selected is married?
P(selecting a married student) = P(selecting a graduate married
student or selecting an undergraduate
married student)
= P(Selecting a graduate & married OR
selecting un undergraduate & married)
= (20% ×0.50) + (80% ×0.10)
= ( × 0.50) + ( × 0.10)
Question
A certain player say Mr. X is known to win with possibility 0.3 if the truck is fast and 0.4 if the
track is slow. For Monday there is a 0.7 probability of a fast track and 0.3 probability of a slow
track. What is the probability that Mr. X will win on Monday?
Solution
P(X will won on Monday) = P(to win in fast track OR to win in slow track)
= P (to get fast track & to win
OR
to get slow track & to win)
= (0.7 × 0.3) + (0. 3 × 0. 4)
= 0.21 + 0.12
= 0.33
CONDITIONAL PROBABILITY
Multiplication theorem states that if two events, A and B, are dependent events then, the
probability of happening both will be the product of P(A) and P(B/A).
i.e., P(A and B) or
= P(A).P(B/A)
P(A B)
Here, P (B/A) is called Conditional probability
P(A B) = P(A).P( )
P( ) =
Similarly, P( )) =
If 3 event, A, B and C and dependent events, then the probability of happening A, B and C is :-
P( ) =
Question
(a) P(A/B)
(b) P(B/A)
Solution
Here we know the events are dependent
(a)P(A/B) = = = × = =
(b) P(B/A) =
= = × = =
Inverse Probability
If an event has happened as a result of several causes, then we may be interested to find out the
probability of a particular cause of happening that events. This type of problem is called inverse
probability.
Bayes theorem is based upon inverse probability.
BAYES THEOREM:
Bayes theorem is based on the proposition that probabilities should revised on the basis of
all the available information. The revision of probabilities based on available information will
help to reduce the risk involved in decision-making. The probabilities before revision is called
priori probabilities and the probabilities after revision is called posterior probabilities.
P(A/B) =
Steps in computation
1. Find the prior probability
2. Find the conditional probability.
3. Find the joint probability by multiplying step 1 and step 2.
4. Find posterior probability as percentage of total joint probability.
Question
A manufacturing firm produces units of products in 4 plants, A, B, C and D. From the past
records of the proportions of defectives produced at each plant, the following conditional
probabilities are set:-
A: 0.5; B: 0.10; C:0.15 and D:0.02
The first plant produces 30% of the units of the output, the second plant produces 25%, third 40%
and the forth 5%
A unit of the products made at one of these plants is tested and is found to be defective. What is
the probability that the unit was produced in Plant C.
Solution
0.101 1.0000
==== =====
Question
There are two Urns, one containing 5 white balls and 4 black balls; and the other
containing 6 white balls and 5 black balls. One Urn is chosen and one ball is drawn. If it is white,
what is the probability that the Urn selected is the first?
Solution
Probability of
drawing white
No. of ball Conditional Joint Posterior
Urn Probabilities Probability Probability
(Priori
Probability)
= 0.2727 = 0.4954
IInd
0.5505 1.00
CHAPTER - 5
PROBABILITY DISTRIBUTION
(THEORETICAL DISTRIBUTION)
DEFINITION
Probability distribution (Theoretical Distribution) can be defined as a distribution obtained
for a random variable on the basis of a mathematical model. It is obtained not on the basis of
actual observation or experiments, but on the basis of probability law.
Random variable
Random variable is a variable who value is determined by the outcome of a random
experiment. Random variable is also called chance variable or stochastic variable.
For example, suppose we toss a coin. Obtaining of head in this random experiment is a random
variable. Here the random variable of obtaining heads can take the numerical values.
Now, we can prepare a table showing the values of the random variable and corresponding
probabilities. This is called probability distributions or theoretical distribution.
In the above, example probability distribution is :-
0
1
P(X) = 1
1. Every value of probability of random variable will be greater than or equal to zero.
i.e., P(X) 0
i.e., P(X) Negative value
X: 0 1 2 3 4
Solution
Here all values of P(X) are more that zero; and sum of all P(X) value is equal to 1
Since two conditions, namely P(X) 0 and P(X) = 1, are satisfied, the given distribution is a
probability distribution.
MATHEMATICAL EXPECTATION
(EXPECTED VALUE)
If X is a random variable assuming values , , , , with corresponding
probabilities , , , , then the operation of X is defined as + +
+ +
E(X) =
Question
A petrol pump proprietor sells on an average Rs. 80,000/- worth of petrol on rainy days
and an average of Rs. 95.000 on clear days. Statistics from the meteorological department show
that the probability is 0.76 for clear weather and 0.24 for rainy weather on coming Wednesday.
Find the expected value of petrol sale on coming Wednesday.
Expected Value
=
E(X)
= (80.000 × 0.24)+(95000 × 0.76)
= 19.200 + 72.200
= Rs. 91,400
Question
There are three alternative proposals before a business man to start a new project:-
Proposal I: Profit of Rs. 5 lakhs with a probability of 0.6 or a loss of Rs. 80,000 with
a probability of 0.4.
Proposal II: Profit of Rs. 10 laksh with a probability of 0.4 or a loss of Rs. 2 lakhs
with a probability of 0.6
Proposal III: Profit of Rs. 4.5 lakhs with a probability of 0.8 or a loss of Rs. 50,000
with a probability of 0.2
If he wants to maximize profit and minimize the loss, which proposal he should prefer?
Solution
Here, we should calculate the mathematical expectation of each proposal.
Probability Distribution
Normal
Binomial Poisson
Distribution
Distribution distribution
CHAPTER - 6
BIONOMIAL DISTRIBUTION
Meaning & Definition:
Binomial Distribution is associated with James Bernoulli, a Swiss Mathematician.
Therefore, it is also called Bernoulli distribution. Binomial distribution is the probability
distribution expressing the probability of one set of dichotomous alternatives, i.e., success or
failure. In other words, it is used to determine the probability of success in experiments on which
there are only two mutually exclusive outcomes. Binomial distribution is discrete probability
distribution.
Binomial Distribution can be defined as follows: A random variable r is said to follow
Binomial Distribution with parameters n and p if its probability function is:
P(r) = nC r prqn-r
Where, P = probability of success in a single trial
q=1p
n = number of trials
r = number of success in n trials.
Assumption of Binomial Didstribution OR
(Situations where Binomial Distribution can be applied)
Binomial distribution can be applied when:-
1. The random experiment has two outcomes i.e., success and failure.
2. The probability of success in a single trial remains constant from trial to trial of the
experiment.
3. The experiment is repeated for finite number of times.
4. The trials are independent.
Properties (features) of Binomial Distribution:
1. It is a discrete probability distribution.
2. The shape and location of Binomial distribution changes as p changes for a given n.
3. The mode of the Binomial distribution is equal to the value of r which has the largest
probability.
4. Mean of the Binomial distribution increases as n increases with p remaining
constant.
5. The mean of Binomial distribution is np.
6. The Standard deviation of Binomial distribution is
7. If n is large and if neither p nor q is too close zero, Binomial distribution may be
approximated to Normal Distribution.
8. If two independent random variables follow Binomial distribution, their sum also
follows Binomial distribution.
Quantitative Techniques for Business 75
School of Distance Education
Qn: Six coins are tossed simultaneously. What is the probability of obtaining 4 heads?
Sol: P(r) = nC r prqn-r
r=4
n=6
p=½
q=1p=1½=½
4 6-4
p( r = 4) = 6C4 ( ½ ) ( ½ )
= x (½)4+2
= x (½)6
= x
=
= 0.234
Qn: The probability that Sachin Tendulkar scores a century in a cricket match is . What is
the probability that out of 5 matches, he may score century in : -
(1) Exactly 2 matches
(2) No match
Sol: Here p =
q=1- =
P(r) = nC r prqn-r
Probability that Sachin scores centuary in exactly 2 matches is:
2 3
p (r = 2) = 5 C2
= x x
= x x
= = 0.329
= x1x
=
= 0.132
Qn: Consider families with 4 children each. What percentage of families would you expect to
have :-
(a) Two boys and two girls
(b) At least one boy
(c) No girls
(d) At the most two girls
Sol: p (having a boy) =
p (having a girl) =
n=4
a) P (getting 2 boys & 2 girls) = p (getting 2 boys) = p (r = 2)
P(r) = nC r prqn-r
2 4-2
p (r = 2) = 4 C2
2 2
= x X
2+2
= x
4
= X
= X
= =
= 6 X =
3 4-3
p (r = 3) = 4 C3
4
=4X =4 X =
4 4-4
p (r = 4) = 4 C4
4 0
= 4 C4
4+0 4
=1X =1X
=1 X =
= x
=
4 4-4
p (r = 4) = 4 C4
4 0
= 4 C4
4
= 1X
i.e. =q
q = =
= x = =
q =
p =1q
=1- =
np =4
nx =4
n =4
n =4x =6
Qn: For a Binomial Distribution, mean is 6 and Standard Deviation is . Find the parameters.
Standard Deviation ( ) =
npq = 2
q =
p=1q
=1- =
np = 6
nx =6
n= =6x =9
Value of parameters:
p= q= n=9
p (r = 2) = 0.2048
r n-r
p (r = 1) = nC r p q = 5 C1 p1q5-1 = 5pq4 = 0.4096
r n-r
p (r = 2) = nC r p q = 5 C2 p2q5-2 = 10p2q3 = 0.2048
i.e., =
4p = q
4p = 1 p
4p + p = 1
5p = 1
p=
Steps:
1. Find the value of n, p and q
2. Substitute the values of n, p and q in the Binomial Distribution function of nC r prqn-r
3. Put r = 0, 1, 2, .. in the function nC r prqn-r
4. Multiply each such terms by total frequency (N) to obtain the expected frequency.
Qn: Eight coins were tossed together for 256 times. Fit a Binomial Distribution of getting
heads. Also find mean and standard deviation.
q=1- =
n=8
Standard Deviation =
= = = 1.4142
CHAPTER - 7
POISSON DISTRIBUTION
Meaning and Definition:
p (r) =
Sol: p (r) =
m=3
p (r = 4) = =
= 0.16804
Qn: It is known from the past experience that in a certain plant, there are on an average four
industrial accidents per year. Find the probability that in a given year there will be less
than four accidents. Assume poisson distribution.
Sol: p (r<4) = p(r = 0 or 1 or 2 or 3)
= p (r = 0) + p (r =1) + p (r = 2) + p (r = 3)
P (r) =
m=4
p (r = 0) = = = 0.01832
p (r = 1) = = = 0.07328
p (r = 2) = = = 0.14656
p (r = 3) = = = 0.19541
p (r) =
p (r = 0) = = = 0.36788
Number of lots containing no defectives if there are 1000 lots = 0.36788 x 1000
= 367.88
= 368
Qn: In a factory manufacturing optical lenses, there is a small chance of for any one lense
to be defective. The lenses are supplied in packets of 10. Use Poisson Distribution to
calculate the approximate number of packets containing (1) one defective (2) no defective
in a consignment of 20,000 packets. You are given that e -0.02 = 0.9802.
Sol: n = 10
p = probability of manufacturing defective lense = = 0.002
m = np = 10 x 0.002 = 0.02
p (r) =
p (r = 1) = = = 0.019604
p (r = 0) = = = 0.9802
Sol: p (r) =
= = = 0.99
m = 0.99
Poisson Distribution =
3 100 x 0.0601 = 6
= 0.0601
CHAPTER - 8
NORMAL DISTRIBUTION
The normal distribution is a continuous probability distribution. It was first developed by
De-Moivre in 1733 as limiting form of binomial distribution. Fundamental importance of normal
distribution is that many populations seem to follow approximately a pattern of distribution as
described by normal distribution. Numerous phenomena such as the age distribution of any
species, height of adult persons, intelligent test scores of students, etc. are considered to be
normally distributed.
A continuous random variable X is said to follow Normal Distribution if its probability
function is:
P (X) = xe ( )2
= 3.146
e = 2.71828
1. T
The discretee probabiliity distribu utions such h as Bino omial Distrribution an nd Poissonn
D
Distribution tend to normmal distribuution as n becomes laarge.
2. Almost
A all sampling
s distributions conform to o the normaal distributio on for largee values off
n
n.
3. Many
M tests of
o significaance are bassed on the assumption n that the parent popullation from
w
which samplles are draw wn follows normal
n distrribution.
4. The
T normal distribution
d n has numerrous mathem matical propperties whicch make it popular
p andd
co
omparativelly easy to manipulate.
m
5. Normal
N distrribution findds applicatio
ons in Statisstical Qualitty Control.
6. Many
M distribbutions in soocial and ecconomic daata are apprroximately normal.
n Foor example,
biirth, death, etc. are norrmally distriibuted.
Area und
der Standaard Normall Curve
In
n case of noormal distribution, prob bability is determined
d on the basiis of area. But to findd
out the arrea we havee to calculatte the ordinaate of 2 sccale.
Z=
Qn: Fin
nd p(z > 1.88)
Sol: z > 1.8 meanss the area abbove 1.8; i.ee., the area to
t the right of 1.8
T
Total area onn the right side = 0.5
= 0.0359
9
p (z
( > 1.8) = 0.0359
0
T
Total area onn the left sidde = 0.5
= 0.0668
T
Table value of
o 1.96 (Areea between 0 and 1.96)) = 0.4750
T
Total area onn the left sidde of normaal curve = 0..5
= 0.9750
0
Sol: -1
1.78 < z < 1.778 means the entire area beetween -1.78 and +1.78
T
Table value off 1.78 (Area between
b 0 and
d 1.78) = 0.46
625
= 0.9250
Sol: 1..52 < z < 2.011 means areaa between 1.52 and 2.01
T
Table value off 1.52 (Area between
b 0 and
d 1.52) = 0.43
357
T
Table value off 2.01 (Area between
b 0 and
d 2.01) = 0.47
778
Sol: -1
1.52 < z < -00.75 means area betweeen -1.52 an
nd -0.75
T
Table value of
o 0.75 (Areea between 0 and -0.75
5) = 0.2734
T
Table value of
o 1.52 (Areea between 0 and -1.52
2) = 0.4357
= 0.1623
G
Given = 688.22 inches
V
Varriance = 10.8 inchess
X = 72
7 inches
Z=
= = = 1.15023
T
Table value of
o 1.15 = 0.3749
A above 1.15
Area 1 (abovee 6 feet) = 0.5 0.3749
= 0.125
0
Number of
o soldiers who
w have ov
ver 6 feet talll out of 100
00 = 0.125 x 1000 = 12
25
= 42
= 24
X = 58
Z=
= = = 0.667
0
Number of
o students whose
w scoree exceed 58 = 0.2514 x 1000
= 251.4
= 251 stud
dents
(b) Given
n N = 1000
1
= 42
4
= 244
X1 = 30
X2 = 66
Z=
Z1 = = = = - 0.5
Z2 = = = =1
T
Table value of
o 0.5 (Areaa between 0 and -0.5) = 0.1915
T
Table value of
o 1 (Area between
b 0 and
a +1) = 0.3413
Area betw
ween -0.5 annd +1 = 0.19
915 + 0.341
13
= 0.5
5328
Number of
o students whose
w scoree lie between 30 and 66
6 = 0.5328 x 1000
= 532.8
= 533 stu
udents
Procedure :
1. Find the mean and standard deviation of the given distribution. (i.e., and )
2. Take the lower limit of each class.
3. Find Z value for each of the lower limit.
Z=
4. Find the area for z values from the table. The first and the last values are taken as 0.5.
5. Find the area for each class. Take difference between 2 adjacent values if same signs
and take total of adjacent values if opposite signs.
6. Find the expected frequency by multiplying area for each class by N.
No. of students : 4 22 48 66 40 16 4
Sol:
= Assumed mean + +C
= 35 + x 100
= 35 + 9
= 44
= xC
= x 10
= x 10
= x 10
= x 10
= 1.245 x 10 =
CHAPTER - 9
TESTING OF HYPOTHESIS
Statistical Inference:
Statistical inference refers to the process of selecting and using a sample statistic to draw
conclusions about the population parameter. Statistical inference deals with two types of
problems.
They are:-
1. Testing of Hypothesis
2. Estimation
Hypothesis:
Hypothesis is a statement subject to verification. More precisely, it is a quantitative
statement about a population, the validity of which remains to be tested. In other words,
hypothesis is an assumption made about a population parameter.
Testing of Hypothesis:
Testing of hypothesis is a process of examining whether the hypothesis formulated by the
researcher is valid or not. The main objective of hypothesis testing is whether to accept or reject
the hypothesis.
Procedure for Testing of Hypothesis:
The various steps in testing of hypothesis involves the following :-
1. Set Up a Hypothesis:
The first step in testing of hypothesis is to set p a hypothesis about population parameter.
Normally, the researcher has to fix two types of hypothesis. They are null hypothesis and
alternative hypothesis.
Null Hypothesis:-
Null hypothesis is the original hypothesis. It states that there is no significant
difference between the sample and population regarding a particular matter under
consideration. The word null means invalid of void or amounting to nothing. Null
hypothesis is denoted by Ho. For example, suppose we want to test whether a medicine is
effective in curing cancer. Hence, the null hypothesis will be stated as follows:-
H0: The medicine is not effective in curing cancer (i.e., there is no significant
difference between the given medicine and other medicines in curing cancer
disease.)
Alternative Hypothesis:-
Any hypothesis other than null hypothesis is called alternative hypothesis. When a null
hypothesis is rejected, we accept the other hypothesis, known as alternative
hypothesis. Alternative hypothesis is denoted by H1. In the above example, the
alternative hypothesis may be stated as follows:-
Quantitative Techniques for Business 94
School of Distance Education
H1: The medicine is effective in curing cancer. (i.e., there is significant difference
between the given medicine and other medicines in curing cancer disease.)
Sampling Distribution
The distribution of all possible values which can be assumed by some statistic, computed
from samples of the same size randomly drawn from the same population is called Sampling
distribution of that statistic.
Standard Error (S.E)
Standard Error is the standard deviation of the sampling distribution of a statistic.
Standard error plays a very important role in the large sample theory. The following are the
important uses of standard errors:-
1. Standard Error is used for testing a given hypothesis
Test Statistic
The decision to accept or to reject a null hypothesis is made on the basis of a
statistic computed from the sample. Such a statistic is called the test statistic. There are different
types of test statistics. All these test statistics can be classified into two groups. They are
a. Parametric Tests
b. Non-Parametric Tests
PARAMETRIC TESTS
The statistical tests based on the assumption that population or population parameter is
normally distributed are called parametric tests. The important parametric tests are:-
1. z-test
2. t-test
3. f-test
Z-test:
Z-test is applied when the test statistic follows normal distribution. It was
developed by Prof.R.A.Fisher. The following are the important uses of z-test:-
1. To test the population mean when the sample is large or when the population
standard deviation is known.
2. To test the equality of two sample means when the samples are large or when the
population standard deviation is known.
3. To test the population proportion.
4. To test the equality of two sample proportions.
5. To test the population standard deviation when the sample is large.
6. To test the equality of two sample standard deviations when the samples are large or when
population standard deviations are known.
7. To test the equality of correlation coefficients.
Z-test is used in testing of hypothesis on the basis of some assumptions. The important
assumptions in z-test are:-
1. Sampling distribution of test statistic is normal.
Quantitative Techniques for Business 96
School of Distance Education
2. Sample statistics are dose the population parameter and therefore, for finding standard
error, sample statistics are used in place where population parameters are to be used.
T-test:
t-distribution was originated by W.S.Gosset in the early 1900.
t-test is applied when the test statistic follows t-distribution.
Uses of t-test are:-
1. To test the population mean when the sample is small and the population s.D.is unknown.
2. To test the equality of two sample means when the samples are small and population S.D.
is unknown.
3. To test the difference in values of two dependent samples.
4. To test the significance of correlation coefficients.
The following are the important assumptions in t-test:-
1. The population from which the sample drawn is normal.
2. The sample observations are independent.
3. The population S.D.is known.
4. When the equality of two population means is tested, the samples are assumed to be
independent and the population variance are assumed to be equal and unknown.
F-test:
F-test is used to determined whether two independent estimates of population
variance significantly differ or to establish both have come from the same population. For
carrying out the test of significance, we calculate a ration, called F-ratio. F-test is named in
honour of the great statistician R.A.Fisher. It is also called Variance Ration Test.
F-ratio is defined as follows:-
F=
where =
While calculating F-ratio, the numerator is the greater variance and denominator is the smaller
variance. So,
F = Greater Variance
Smaller Variance
Assumptions of F-distribution:-
1. The values in each group are normally distributed.
2. The variance within each group should be equal for all groups. ( )
3. The error (Variation of each value around its own group mean) should be
independent for each value.
Type I Error
The error committed by rejecting a null hypothesis when it is true, is called Type I error.
The probability of committing Type I error is denoted by (alpha).
Type II Error
The error committed by accepting a null hypothesis when it is false is called Type II error.
The probability of committing Type II error is denoted by (beta).
= Prob. (Type II error)
= Prob. (Accepting H0 when it is false)
Degree of freedom
Degree of freedom is defined as the number of independent observations which is obtained
by subtracting the number of constraints from the total number of observations.
Degree of freedom = Total number of observations Number of constraints.
= 1- .
If the calculated value of the test statistic falls in the acceptance region, we accept the null
hypothesis.
TESTIN
NG OF GIIVEN POP
PULATIO
ON MEAN
T test is used
This u to test whether thee given pop
pulation mean is true or
o not. In otther words,
this tesst is used to cheeck wheth her the difference
d between sample mean m andd
populatio
on mean iss significannt or it is only due to sampling fluctuattions. Herre we cann
apply z-test of t-test.
Procedurre:
1. Set the null hypothesis that there is no significant difference betw
ween samplee mean andd
po
opulation mean.
m
H0 : µ = µ0
H1 : µ µ0
2. Decide
D the teest criterionn.
If sam
mple is largee, apply Z-ttest
If sam
mple is smaall, but popu
ulation stand
dard deviatiion is known
n, apply z teest
If sam
mple is smaall and popu
ulation stand
dard deviatio
on is unkno
own, apply t-test.
t
3. Apply
A the formula
Z or t = Differeence
SE
= wherre Sam
mple mean
µ = Population meean
SE = Stanndard Errorr
SE is com
mputed as follows:-
fo
Z = =
SE = = = = 0.5
= = 3.6
The value of z at 5% level of significance for infinity d.f. is 1.96. As the calculated value
is more than the table value , we reject the H0. There is significant difference sample mean and
population mean.
We conclude that the sample has not come from the population with mean 26.8.
Qn. The mean life of 100 bulbs produced by a company is computed to be 1570 hours with
S.D. of 120 hours. The company claims that the average life of bulbs produced by the
company is 1600 hours. Using 5% level of significance, is the claim
acceptable?
Sol: H0 : µ = 1600
H1 : µ 1600
Since sample is large apply z-test.
Z =
SE = = = =
= = 2.5
Table value at 5% level of significance and infinity d.f. is 1.96. As the
calculated value is greater than the table value, we reject the H0. There is significant difference
between mean life of sample and mean life of population.
Companys claim is not acceptable
Qn. The price of shares of a company on the different days in a month were found to be
66,65,69,70,69,71,70,63,64 and 68. Discuss whether mean price of shares in the month is
65.
Sol. H0 : µ = 65
H1 : µ 65
Since small sample, apply t-test.
t =
= =
= = 67.5
= =
= 2.655
S.E =
= = =
= 0.885
= =
= 2.8249
Table value at 5% level of significance and 9 d.f. = 2.262. Calculated value is
more than table value.
We reject the null hypothesis.
We conclude that the mean price of share in the month is not 65/-.
Qn. The mean height obtained from a random sample of 36 children is 30 inches. The
standard deviation of the distribution of height of the population is known to be 1.5 inches.
Test the statement that the mean height of the population is 33 inches at 5% level of
significance. Also set up 99% confidence limits of the mean height of the population.
Sol. H0 : µ = 33
H1 : µ 33
Since small sample is large, apply z-test.
z =
= 30
µ = 33
SE = = = = 0.25
= = = 12
S.E. = = = = 1.875
t = =
SE = = = = = 12.06
= = = 1.24
SE = = = 60.
t = = = 3.33
Table value of t at 5% significance level and 24 d.f. = 1.711
Calculated value is greater than the table value.
We reject the null hypothesis and accept alternative hypothesis. So we conclude that
the new manufacturing process has increased the life of bulbs, i.e, µ = 200.
Z or t = =
SE is computed as follows:
It population S.D. are unknown and samples are large, then assuming
population S.D. are different,
S.E. =
It population S.D. are unknown and samples are small, then assuming
population S.D. are equal,
S.E. =
Z = =
SE =
= = =
= = 0.4
Z = =
SE =
= = =
= = 0.9349.
X Ltd. Y Ltd.
No. of bulbs used 100 100
Mean life in hours 1300 1248
Standard deviation 82 93
Using S.E. of the difference between mean, state whether there is any significant
difference in the life of the two makes.
H0 : µ 1 = µ 2
H1 : µ 1 µ2
Z =
SE =
= = =
= = 12.399.
= .
We reject the H0
So we conclude that there is significant difference in the average life of bulbs of 2 makes.
Qn. The average number of articles manufactured by two machines per day and 200 and 250
with S.D. 20 and 25 respectively on the basis of 25 days production. Can you regard both
the machines are equally efficient at 1% level of significance?.
Sol. H0 : µ 1 = µ 2
H1 : µ 1 µ2
Z =
S.E. =
Degree of freedom = n1 + n2 2
= 25 + 25 2 = 48
Table value of t at 1% level of significance and infinity d.f. = 2.576.
Calculated value is greater than the table value.
We reject the null hypothesis.
i.e, µ1 µ2
So we conclude that the machines are not equally efficient.
Qn: Given below are the gains in weights of dogs on two diets, X and Y
Diet X: 15 22 20 22 18 14 22
Diet Y : 14 24 12 20 32 21 30 20 22 25
Test at 5% level, whether the two diets differ significantly with regard to increase in
weight.
Sol: :
:
Since samples are small, apply t - teat
t=
As mean and S.D. are not given, first we have to find out the same of the 2 groups.
X d (X-20)
15 -5 25
22 2 4
20 0 0
22 2 4
18 -2 4
14 -6 36
22 2 4
d = -7 = 77
=A+
= 20 + = 20 + -7 = 19
= =
= = 3.1623
X d (X-20)
14 -6 36
24 4 8
12 -8 64
20 0 0
32 12 144
21 1 1
30 10 100
20 0 0
22 2 4
25 5 25
d = 20 = 390
=A+
= 20 + = 20 + 2 = 22
= =
= = 5.9161
S.E. =
= =
= = 2.6079
Procedure:
1. Set the null hypothesis that there is no significant difference between two standard
deviations.
:
:
2. Decide the test criterion:
If sample is large, apply Z test
If sample is sample, apply F test
3. Apply the formula:
If Z test:
Z= =
If F test:
For F test:
Sol: :
:
Since samples are large, apply Z - test.
Z=
SE =
= =
= =
= 0.5787
Z = = = 0.8640
Sol: :
:
Since samples are small, apply F-test.
F=
= = = = 13.61
= = = = 9.69
F = = 1,405
Qn: Two random sample were drawn from two normal populations and their
values are :-
A: 66 67 75 76 82 84 88 90 92
B: 64 66 74 78 82 85 87 92 93 95 97
66 -9 81 64 -18 324
67 -8 84 66 -16 256
75 0 0 74 -8 64
76 1 1 78 -4 16
82 7 49 82 0 0
84 9 81 85 3 9
88 13 169 87 5 25
90 15 225 92 10 100
92 17 289 93 11 121
d = 45 = 959 95 13 169
97 15 225
d=11 = 1309
S.D =
SD of sample A = =
= = 9.03
SD of sample B = =
= = 10.86
:
:
Quantitative Techniques for Business 115
School of Distance Education
Since sample are small, apply F-test
F=
= = = = 91.7325
= = = = 129.734
F = = 1.414
CHAPTER 10
NON-PARAMETRIC TESTS
A non-parametric test is a test which is not concerned with testing of parameters. Non-
parametric tests do not make any assumption regarding the form of the population. Therefore,
non-parametric tests are also called distribution free tests.
Following are the important non-parametric tests:-
1. Chi-square test
2. Sign test
3. Signed rank test (Wilcoxon matched pairs test)
4. Rank sum test (Mann-whitney U-test and Kruskal-Wallis H test)
5. Run test
6. Kolmogrov-Smirnor Test (K-S-test)
CHI-SQUARE TEST
The value of chi-square describes the magnitude of difference between observed
frequencies and expected frequencies under certain assumptions. value ( quantity) ranges
from zero to infinity. It is zero when the expected frequencies and observed frequencies
completely coincide. So greater the value of , greater is the discrepancy between observed and
expected frequencies.
-test is a statistical test which tests the significance of difference between observed
frequencies and corresponding theoretical frequencies of a distribution without any assumption
about the distribution of the population. This is one of the simplest and most widely used non-
parametric test in statistical work. This test was developed by Prof. Karl Pearson in 1990.
Uses of - test
The uses of chi-square test are:-
1. Useful for the test of goodness of fit:- - test can be used to test whether there is
goodness of fit between the observed frequencies and expected frequencies.
2. Useful for the test of independence of attributes:- test can be used to test whether two
attributes are associated or not.
3. Useful for the test of homogeneity:- -test is very useful t5o test whether two attributes
are homogeneous or not.
4. Useful for testing given population variance:- -test can be used for testing whether the
given population variance is acceptable on the basis of samples drawn from that
population.
As a non-parametric test, -test is mainly used to test the goodness of fit between the
observed frequencies and expected frequencies.
Procedure:-
1. Set up mull hypothesis that there is goodness of fit between observed and expected
frequencies.
4. Obtain the table value corresponding to the lord of significance and degrees of freedom.
5. Decide whether to accept or reject the null hypothesis. If the calculated value is less than
the table value, we accept the null hypothesis and conclude that there is goodness of fit. If
the calculated value is more than the table value we reject the null hypothesis and
conclude that there is no goodness of fit.
Qn:- A sample analysis of examination result of 200 students were made. It was found that 46
students had failed, 68 secured IIIrd class, 62 IInd class and the rest were placed in the Ist
class. Are these figures commensurate with the general examination results which is in
the ratio of 2 : 3: 3: 2 for various categories respectively?
Computation of value:
O E O-E
46 200 = 40 6 36 0.9000
68 200 = 40 8 64 1.0667
62 200 = 40 2 4 0.0667
= 8.4334
= 8. 4334
The table value at 5% level of significance
and degree of freedom at 3. = 7. 815
(df = n r- 1 =4 0 1 = 3)
we reject the
we conclude that the analytical figures do not commensurate with the general
examination result. In other words, there is no goodness of fit between the observed and expected
frequencies.
Qn: Test whether the accidents occur uniformity over week days on the basis of the
following information:-
Days of the week: Sun Mon Tue Wed Thu Fri Sat
No. of accidents: 11 13 14 13 15 14 18
Sol: : There is goodness of fit between observed and expected frequencies, i.e.,
accidents occur uniformly over week days.
: There is no goodness of fit between observed and expected frequencies; i.e.,
accidents do not accrue uniformly over week days
Computation of value:
O E O-E
11 14 -3 9 0.6429
13 14 -1 1 0.0714
14 14 0 0 0.0000
13 14 1 0.0714
-1
15 14 1 0.0714
1
14 14 0 0.0000
0
18 14 16 1.1429
4
= 2.0000
test is used to find out whether one or more attributes are associated or not.
Procedure:-
1. Set up null and alternative hypothesis.
: Two attributes are independent (i.e., there is no association between the
attributes)
: Two attributes are dependent (i.e., there is an association between the
attributes)
Sol: : The two attributes, election and economic status are independent.
: The attributes, election and economic status are dependent.
Computation of value:
O E O-E
90 69 21 441 6.39
= 19.26
Qn: In a sample study about the tea habit in two towns, following date are observed in a
sample of size 100 each:-
Town A:-
51 persons were male, 31 were tea drinkers and 19 were male tea drinks.
Town B :-
46 persons were male, 17 were male tea drinkers and 26 were tea drinkers.
Is there any association between sex and tea habits ?
If so, in which town it is greater?
Sol:- : The two attributes, sex and tea habits are independent.
: The two attributes sex and tea habits are dependent.
Town A:-
2 2 Contingency table of observed frequency
Sex
Male Female Total
Tea habits
Tea Drinkers 19 12 31
Total 51 49 100
Total 51 49 100
Computation of value:
O E O-E
19 16 3 9 0.5625
32 35 -3 9 0.2571
12 15 -3 9 0.6000
37 34 3 9 0.2647
= 1.6843
Sex
Male Female Total
Tea habits
Tea Drinkers 17 9 26
Total 46 54 100
Total 46 54 100
Computation of value:
O E O-E
17 12 5 25 2.083
29 34 -5 25 0.735
9 14 -5 25 1.786
45 40 5 25 0.625
5.229
The table value of at 5% level of significance for 1 degree of freedom is 3.84. As the
calculated value is more than the table value, we reject the . In other words, attributes sex and
tea habits are not independent (i.e., associated) in Town B.
d.f. = (r-1)(c-1)
Qn: From the adult population of four large cities, random samples were selected and the number
of married and unmarried men were recorded:
Cities
A B C D Total
Married
137 164 152 147 600
Single 32 57 56 35 180
Is there significant variation among the cities in the tendency of men to marry.
Sol:- : The 4 cities are homogeneous.
: The 4 cities are heterogeneous.
Computation of value:
O E O-E
32 39 -7 49 1.2564
57 51 6 36 0.7059
56 48 8 64 1.3333
35 42 -7 49 1.1667
5.8010
test can be used for testing the given population when the sample is small.
Steps:-
=
Where = Sample variance
= Population variance
= = = = 8.8999
a) When the number of matched pairs are less than or equal to 25.
Case:1
Procedure:-
1. Set up null hypothesis:
: There is no significant difference.
: There is significant difference.
2. Find the difference between each pair of values.
3. Assign ranks to the differences from the smallest to the largest without any regard to sign.
4. Then actual signs of each difference are put to the corresponding ranks.
5. Find the total of positive ranks and negative ranks.
6. Smaller value, as per steps 5 is taken as the calculated value.
7. Obtain the table value of Wilcoxons T-Table.
8. Decide whether to accept or reject the null hypothesis.
Qn: Given below is 16 pairs of values showing the performance of two machines A
and B. Test whether there is difference between the performances. Table value of
T at 5% significanterd is 25.
A: 73, 43, 47, 53, 58, 47, 52, 58, 38, 61, 56, 56, 34, 55, 65, 75
B: 51, 41, 43, 41, 47, 32, 24, 58, 43, 53, 52, 57, 44, 57, 40, 68
1 2 3 4 5
Machine Machine Difference Rank of Difference Rank with signs
A B (3) = (1) (2) (without signs) + Sign - Sign
73 51 22 13 13
43 41 2 2.5 2.5
47 43 4 2.5 4.5
53 41 12 11 11
58 47 11 10 10
47 32 15 12 12
52 24 28 15 15
58 58 0 - -
38 43 -5 6 - -6
61 53 8 8 8
56 52 4 4.5 4.5
56 57 -1 1 - -1
34 44 -10 9 - -9
55 57 -2 2.5 - -2.5
65 40 25 14 14
75 68 7 7 7
Total 101.5 -18.5
U=
CHAPTER 11
ANALYSIS OF VARIANCE
SSC =
MSC = =
6. Compute MSE
MSE = =
7. Compute F ratio:
F=
ANOVA TABLE
Source of Sum of Degree of Means square
F - Ratio
Variation Squares freedom
Between SSC C-1 F=
MSC =
Samples
Within SSE N-C
MSE =
Sample
Total SST N-1
9. Obtain table value at corresponding to the level of significance and for degree of freedom
of (C-1, N-C).
10. Decide whether to accept or reject the null hypothesis.
Qn: Given below are the yield (in Kg.) per acre for 5 trial plots of 4 varieties of
treatments.
Treatment
Plot name 1 2 3 4
A 42 48 68 80
B 50 66 52 94
C 62 68 76 78
D 34 78 64 82
E 52 70 70 66
Carry out an analysis of variance and state whether there is any significant difference in
treatments.
= (11,968+22,268+22,100+32,400) -
= 88,736 -
= 88,736 -
SSC =
= 11,520+21,780+21,780+32,000 84,500
= 87, 080 84, 500 = 2, 580
Qn: The following data relate to the yield of 4 varieties of rice each shown on 5
plots. Find whether there is significant difference between the mean yield
of these varieties.
Treatment
Plot name 1 2 3 4
P 99 103 109 104
S 99 105 97 107
T 98 95 99 106
Sol: Apply coding method. Subtract 100 from all the observations.
-1 3 9 4 1 9 81 16
1 2 3 0 1 4 9 0
3 0 7 3 9 0 49 9
-1 5 -3 7 1 25 9 49
-2 -5 -1 6 4 25 1 36
=0 =5 = 15 = 20 16 63 149 110
= (16+63+149+110) -
= 338 -
= 338 -
= 338 80 = 258
SSC =
= 0+5+45+80 80
= 50
As the calculated value is less than the table value, we accept the null hypothesis.
2. Compute SST
SST = Sum of squares of all observations -
3. Compute SSC
SSC =
4. Compute SSR
SSR =
5. Compute SSE
6. Compute MSC
MSC = =
7. Compute MSR
MSR = =
8. Compute MSE
MSE = =
Fc =
Fr =
Varieties X Y Z
A 10 9 8
B 7 7 6
C 8 5 4
D 5 4 4
Carry two-way analysis of variance.
Sol:
X Y Z
Varieties Total Total
A( 10 9 8 27 100 81 64 245
B( ) 7 7 6 20 49 49 36 134
C( ) 8 5 4 17 64 25 16 105
D( ) 5 4 4 13 25 16 16 57
= 541 -
= 541 -
SSC =
= 225+156.25+121 494.083
= 502.25 494.083 = 8.167
SSR =
= 243+133.333+96.333+56.333 494.083
= 34.916
Machine Type
Workers A B C D
1 44 38 47 36
2 46 40 52 43
3 34 36 44 32
4 43 38 46 33
5 38 42 49 39
(a) Test whether the mean productivity is the same for the different machine types.
(b) Test whether the 5 workers differ with respect to mean productivity.
Let us apply coding method. Let us subtract 40 from all the observations.
A B C D
Workers Total Total
1( 4 -2 7 -4 5 16 4 49 16 85
2( ) 6 0 12 3 21 36 0 144 9 189
3( ) -6 -4 4 -8 -14 36 16 16 64 132
4( ) 3 -2 6 -7 0 9 4 36 49 98
5( ) -2 2 9 -1 8 4 4 81 1 90
= 594 -
SSC =
=
1794
= - 20 = 358.8-20 = 338.8
5
X1 2 X2 2 X3 2 X4 2 X5 2 T2
SSR =
N1 N2 N3 N4 N5 N