Regression Analysis
Regression Analysis
Regression Analysis
Regression Analysis
Regression
Regression
Analysis of dependence of dependent variable on the independent variables with
an objective to predict the average value of the dependent variable
Types of Regression
2
Regression Analysis
Historical Background
The term was first introduced by Sir Francis Galton in the study of hereditary.
In a population, if you take any child’s height; it will tend towards the population average.
In other words, taller parents had taller child and shorter parents had shorter child. It was
“regression to mediocrity”.
Karl Pearson confirmed this in his study where he found, “average height of sons of tall
fathers was less than their father’s height and average height of sons of short fathers
was greater than their father’s height, thus “regressing” tall and short sons toward
average height of all men”.
Example
Suppose the yield of the crop (Y) depends linearly on two explanatory variables, viz., the
quantity of fertilizer ( X 1 ) and level of irrigation ( X 2 ) as
Y=bX +bX +
1 1 2 2
There exist the true values of β 1 and β 2 in nature but are unknown to the experimenter.
Some values on Y are recorded by providing different values to X 1 and X 2 . There exists
some relationship between Y and X 1 , X 2 which gives rise to a systematically behaved
data on Y, X 1 and X 2 . Such a relationship is unknown to the experimenter.
3
Regression Analysis
To determine the model, we move in the backward direction in the sense that the
collected data is used to determine the unknown parameters β 1 and β 2of the model.
In this sense, such an approach is termed as regression analysis.
,..., b ) + where is the random error reflecting mainly the difference in the observed
k
value of Y and the value of Y obtained through the model. The form of f ( X , X ,..., X ; b ,
1 2 k 1
4
Regression Analysis
the maximum likelihood method (needs knowledge of distribution of Y), principle of least
squares, method of moments, ridge method, principal components method etc.
Fitting of model
The estimation of unknown parameters using appropriate method provides the values of
the parameter. Substituting these values in the equation gives us a usable model. This is
termed as model fitting. Estimates of parameters b , b ,..., b in the model Y = f ( X , X
1 2 k. 1 2
,..., X ; b , b ,..., b ) + are denoted by bˆ 1, bˆ2 ,..., bˆk which gives the fitted model as Y
k 1 2 k 1
= f ( X , X ,..., X ; bˆ 1, bˆ2 ,..., bˆk) . When the value of Y is obtained for the given values
1 2 k 1
Using the chosen model(s) for the solution of the posed problem.
The determination of the explicit form of the regression equation is the ultimate
objective of regression analysis.
To determine the role of any explanatory variable in the joint relationship in any
policy formulation,
To forecast the values of the response variable for a given set of values of
explanatory variables.
Regression Correlation
Purpose Predicts the average value of one Measures the direction and strength or
variable on the basis of fixed values of degree of linear association between the
other variables. two variables.
Usage There is an asymmetry in the way the Variables are treated symmetrically, i.e.;
dependent and explanatory variables are there is no difference between the
treated. dependent and explanatory variables.
The dependent variable is assumed to Both variables are assumed to be random.
be statistical, random or stochastic (i.e.,
to have a probability distribution). The
explanatory variables, are assumed to
have fixed values (in repeated sampling).
5
Regression Analysis