Multiple Linear Regression Model - Final
Multiple Linear Regression Model - Final
… (1.4.2)
• Assume that Eq. (1.4.1) is the “true” model explaining the behavior of child mortality in relation to per capita
GNP and female literacy rate (FLR).
• But suppose we disregard FLR and estimate the simple regression as in eq. (1.4.2) would constitute a
specification error; the error here consists in omitting the variable X3, the female literacy rate.
• Now will β̃ 1 provide an unbiased estimate of the true impact of PGNP, which is given by 1 in model (1.4.1)?
• In other words, will the coefficient of PGNP in Eq. (1.4.2) provide an unbiased estimate of the true impact of
PGNP on CM, knowing that we have omitted the variable X3 (FLR) from the model?
• In absolute terms, if we compare the results of this (mis-specified) regression with the “true” multiple
regression, we will see that the results are different.
• No, as β̃ 1 is not same as 1
Allocating R2 among Regressors
• In eq (1.4) CMi = 1+ 2PGNPi + 3FLRi + i the two regressors explain the 0.7077 variation in the
e.g. of child mortality that is:
R12 = 0.7077
• In eq (1.4) when we dropped PGNP, CMi = 1 + 2FLRi + 1 ,the goodness of fit is:
R22 = 0.6696
• In eq (1.4) when we dropped FLR, CMi =1+ 2PGNPi + i ,the goodness of fit is:
R32 = 0.1662
• From the above we can say that,
• FLR & PGNP , these two variables together explains 70.77% of variation in CM
• Out of 70.77%, FLR explains 66.96 % & PGNP explains 16.62 %
• But, R22 + R32 ≠ R12 - that means we cannot allocate R2 of 0.7077 between two regressors, PGNP and FLR,
in this manner.
• Because, the two regressors are correlated, the correlation coefficient between the two being 0.2685
Adjusted R2
• From the previous example we can also observe that as the number of regressors in the model
increases, the R2 value increases.
• An important property of R2 is that it is a nondecreasing function of the number of explanatory
variables or regressors present in the model, unless the added variable is perfectly collinear with the
other regressors; as the number of regressors increases, R 2 almost invariably increases and never
decreases.
• To see this, recall the definition of R 2
• Now ∑ y2 is independent of the number of X variables in the model because it is simply ∑(Yi − Y ̄ )2.
The RSS, ∑ 2 , however, depends it is clear that as the number of X variables increases, on the number
of regressors present in the model, ∑ 2 is likely to decrease; hence R2 will increase.
• To compare two R2 terms, one must take into account the number of X variables present in the
model. R2 measures the proportion of the variation in the dependent variable accounted for by the
explanatory variable(s).
Adjusted R2
• If you do want to compare the two R2 values, we need to calculated adjusted R2.
• The term adjusted means adjusted for the df associated with the sums of squares, which is as follows:
A comparison of 2008 Yu Cen , Empirical studies of the variation This study adopts Results of this study
neural network and Jinpeng Yin in debt ratios across firms have multiple linear show that the
multiple regression used statistical models singularly regressions and artificial determinants of capital
analysis to analyze the important neural networks (ANN) structure are different in
in modeling capital determinants of capital structure models. both industries.
structure This study attempts to examine
the usefulness of ANNs as Basic statistics are The major different
analyses and predictions of capital estimated to describe determinants are
structure and to compare these each variable collected business-risk and growth
ANNs with multiple linear and t-tests are opportunities.
regression results. conducted to determine
if variables of high-tech
corporations are
different from that of
traditional corporations.
References
• Pao, H. T. (2008). A comparison of neural network and multiple
regression analysis in modeling capital structure. Expert Systems with
Applications, 35(3), 720–727.
https://doi.org/10.1016/j.eswa.2007.07.018
• Gujarati, D. N. (2015). Basic econometrics (5th ed.). McGraw-Hill
Education.
• GMK Madnani. (2024). Introduction to Econometrics, Principles and
Applications (8th ed.)
Thank you