Unit 3 Fod
Unit 3 Fod
UNIT III
DESCRIBING RELATIONSHIPS
COMPILED BY,
VERIFIED BY
Correlation
Scatter plots
correlation coefficient for quantitative data
computational formula for correlation coefficient
Regression –regression line
least squares regression line
Standard error of estimate
interpretation of r2
multiple regression equations
regression towards the mean
LIST OF IMPORTANT QUESTIONS
UNIT III
DESCRIBING RELATIONSHIPS
PART – A
1. What do you mean by Correlation?
2. What do you mean by correlation coefficient?
3. Write down the Uses of correlations.
4. What is Multiple Correlation ?
5. State in each case whether there is
(a) Positive Correlation
(b) Negative Correlation
(c) No Correlation
Sl.No Particulars
1 Price of commodity and its demand
2 Yield of crop and amount of rainfall
3 No of fruits eaten and hungry of a person
4 No of units produced and fixed cost per unit
5 No of girls in the class and marks of boys
6 Ages of Husbands and wife
7 Temperature and sale of woollen garments
8 Number of cows and milk produced
9 Weight of person and intelligence
10 Advertisement expenditure and sales volume
PART - B
4. Find the correlation coefficient between age and playing habits of the following
students using Karl Pearson’s coefficient of correlation method.
Age 15 16 17 18 19 20
Number of students 250 200 150 120 100 80
Regular Players 200 150 90 48 30 12
5. Find Karl Pearson’s coefficient of correlation between capital employed and profit
obtained from the following data.
Capital Employed (Rs. In Crore) 10 20 30 40 50 60 70 80 90 100
Profit (Rs. In Crore) 2 4 8 5 10 15 14 20 22 50
PART – A
1. What do you mean by Correlation?
Correlation is a statistical technique to ascertain the association or relationship
between two or more variables. Correlation analysis is a statistical technique to study the
degree and direction of relationship between two or more variables.
2. What do you mean by correlation coefficient?
A correlation coefficient is a statistical measure of the degree to which changes
to the value of one variable predict change to the value of another. When the fluctuation
of one variable reliably predicts a similar fluctuation in another variable, there’s often a
tendency to think that means that the change in one causes the change in the other.
3. Write down the Uses of correlations:
I. Correlation analysis helps inn deriving precisely the degree and the direction of
such relationship.
II. The effect of correlation is to reduce the range of uncertainity of our prediction.
The prediction based on correlation analysis will be more reliable and near to
reality.
III. Correlation analysis contributes to the understanding of economic behaviour,
aids in locating the critically important variables on which others depend, may
reveal to the economist the connections by which disturbances spread and
suggest to him the paths through which stabilizing farces may become effective
IV. Economic theory and business studies show relationships between variables
like price and quantity demanded advertising expenditure and sales promotion
measures etc.
V. The measure of coefficient of correlation is a relative measure of change.
Types of Correlation: Correlation is described or classified in several different
ways. Three of the most important are: I. Positive and Negative II. Simple, Partial
and Multiple III. Linear and non-linear
4. What is Multiple Correlation ?
When three or more variables are studied, it is a case of multiple correlation. For
example, in above example if study covers the relationship between student marks,
attendance of students, effectiveness of teacher, use of teaching aids etc, it is a case of
multiple correlation.
5. State in each case whether there is
a) Positive Correlation
b) Negative Correlation
c) No Correlation
Sl.No Particulars Solution
1 Price of commodity and its demand Negative
2 Yield of crop and amount of rainfall Positive
3 No of fruits eaten and hungry of a person Negative
4 No of units produced and fixed cost per unit Negative
5 No of girls in the class and marks of boys No Correlation
6 Ages of Husbands and wife Positive
7 Temperature and sale of woollen garments Negative
8 Number of cows and milk produced Positive
9 Weight of person and intelligence No Correlation
10 Advertisement expenditure and sales volume Positive
6. List out the Properties of Coefficient of Correlation.
The coefficient of correlation always lies between – 1 to +1, symbolically it can
written as – 1 ≤ r ≤ 1.
The coefficient of correlation is independent of change of origin and scale.
The coefficient of correlation is a pure number and is independent of the units of
measurement. It means if X represent say height in inches and Y represent say
weights in kgs, then the correlation coefficient will be neither in inches nor in kgs but
only a pure number.
The coefficient of correlation is the geometric mean of two regression coefficient,
symbolically 𝑟2= bxy ∗ byx
If X and Y are independent variables then coefficient of correlation is zero.
A study of measuring the relationship between associated variables, wherein one
variable is dependent on another independent variable, called as Regression. It is
developed by Sir Francis Galton in 1877 to measure the relationship of height
between parents and their children.
PART – B
1. Explain in detail about the types of Correlation.
Correlation is described or classified in several different ways. Three of the most
important are:
I. Positive and Negative
Solution:
From the observation of scatter diagram we can say that the variables are positively
correlated. In the diagram the points trend toward upward rising from the lower left hand
corner to the upper right hand corner, hence it is positive correlation. Plotted points are in
narrow band which indicates that it is a case of high degree of positive correlation.
16
14
12
Profit (Rs. in Lakhs)
10
0
0 2 4 6 8 10 12 14
Capital Employed (Rs. in Crore)
Solution:
Let us assume that advertisement expenses are variable X and sales volume are variable Y.
Calculation of Karl Pearson’s coefficient of correlation
Firm X Y x=X- y=Y- xy
x2 y2
Ẋ Ẏ
1 11 50 -3 9 -8 64 24
2 13 50 -1 1 -8 64 8
3 14 55 0 0 -3 9 0
4 16 60 2 4 2 4 4
5 16 65 2 4 7 49 14
6 15 65 1 1 7 49 7
7 15 65 1 1 7 49 7
8 14 60 0 0 2 4 0
9 13 60 -1 1 2 4 -2
10 13 50 -1 1 -8 64 8
140 580 22 360 70
∑X ∑Y ∑x2 ∑y2 ∑xy
∑xy
r= √∑x2 ∑y2
70 70 = 0.7866
= √22∗360 = 88.9944
Interpretation: From the above calculation it is very clear that there is high degree of
positive correlation i.e. r = 0.7866, between the two variables. i.e. Increase in
advertisement expenses leads to increased sales volume.
4. Find the correlation coefficient between age and playing habits of the following students
using Karl Pearson’s coefficient of correlation method.
Age 15 16 17 18 19 20
Number of students 250 200 150 120 100 80
Regular Players 200 150 90 48 30 12
Solution:
To find the correlation between age and playing habits of the students, we need to compute
the percentages of students who are having the playing habit.
Percentage of playing habits = No. of Regular Players / Total No. of Students * 100
Now, let us assume that ages of the students are variable X and percentages of playing habits are
variable Y.
∑(X−Ẋ)(Y−Ẏ) = −240
r= √∑(X−Ẋ)2 ∑(Y−Ẏ)2 √17.5∗3350
−240 = -0.9912
= 242.126
Interpretation: From the above calculation it is very clear that there is high degree of negative
correlation i.e. r = -0.9912, between the two variables of age and playing habits. i.e. Playing habits
among students decreases when their age increases.
Solution:
32,500 = 0.8519
r= 38148.3945
6. After investigation it has been found the demand for automobiles in a city depends mainly, if
not entirely, upon the number of families residing in that city. Below are the given figures for
the sales of automobiles in the five cities for the year 2019 and the number of families
residing in those cities.
City No. of Families (in lakhs): X Sale of automobiles (in ‘000):
Y
Belagavi 70 25.2
Bangalore 75 28.6
Hubli 80 30.2
Kalaburagi 60 22.3
Mangalore 90 35.4
Fit a linear regression equation of Y on X by the least square method and estimate the sales for the
year 2020 for the city Belagavi which is estimated to have 100 lakh families assuming that the
same relationship holds true.
Solution:
Therefore now we have -221.5 = -500b, this can rewritten as 500b = 221.5
Now,
b = 221.5 = 0.443
500
Estimated sales of automobiles (Y) in city Belagavi for the year 2020, where
number of families (X) are 100(in lakhs):
Y = -4.885 + 0.443X
Y = -4.885 + (0.443 * 100)
Y = -4.885 + 44.3
Y = 39.415 (‘000)
Means sales of automobiles would be 39,415 when number of families are 100,00,000
15
7. Explain in detail about method of measurement of Correlation.
16
17
18