0% found this document useful (0 votes)

15 views6 pages

03a.session Notes On Multiple Linear Regression Analysis

Uploaded by

nairsuraj725

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views6 pages

03a.session Notes On Multiple Linear Regression Analysis

Uploaded by

nairsuraj725

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Session Notes on Multiple Linear Regression

Let us look at the situation considered for the discussion. We have a consortium of US firms
that produce raw materials used in Singapore. They are interested in the following
1. Predicting the level of exports from US.
2. Understanding the relationship between US exports to Singapore and certain
variables affecting the economy of that country.
Let us question, what are the advantages by doing the above.
1. Understanding the relation will allow the consortium members to time their
marketing efforts to coincide with favourable conditions in the Singapore economy.
2. Understanding the relationship would also allow the exporters to determine
whether expansion of exports to Singapore is feasible.
3. Also, to identify the significant variables that acts as main drivers of the exports to
Singapore.
Variables considered in the study
 US exports to Singapore in billions of Singapore Dollars (the dependent variable,
Exports),
 money supply figures in billions of Singapore dollars (variable M1),
 minimum Singapore bank lending rate in percentages (variable Lend),
 an index of local prices where the base year is 1974 (variable Price),
 the exchange rate of Singapore dollars per U.S. dollar (variable Exchange)
Now, why regression should be used as a method for analysing this data. Taking into
consideration the objectives, the appropriate method is regression. Regression gives one an
opportunity to
1. Measure the level of changes in the exports with the change in the levels of other
drivers considered.
2. To test the significance of each driver or variable that contributes to the change in
exports.
3. To help the US consortium to find the favourable conditions
4. To build a model that connects the exports and significant drivers of the exports and
make predictions.
Assumptions associated with the linear regression analysis
1. The response and the regressor variables are linearly related
2. On average residual is zero
3. All residuals have constant variance
4. All residuals are uncorrelated
5. Residuals are normally distributed
6. All regressors are independent
Discussion on R codes
In order to adopt R as a tool for running the regression analysis, we need to install few
packages available in R. These packages are developed by researchers and comes with
various built-in functions that are used to run the analysis. For running regression analysis in
R, we install the following packages
car-Companion to Applied regression analysis
https://www.rdocumentation.org/packages/car/versions/3.0-8

psych- package used for psychological, psychometric and personality research

https://www.rdocumentation.org/packages/psych/versions/1.9.12.31

Hmisc- Harrell Miscellaneous

https://www.rdocumentation.org/packages/Hmisc/versions/4.4-0

lmtest- Testing Linear Regression Models

https://www.rdocumentation.org/packages/lmtest/versions/0.9-37

lm.beta- Standardized regression coefficients to Lm objects

https://www.rdocumentation.org/packages/lm.beta/versions/1.5-1/topics/lm.beta

R-codes
setwd("F:/07.PGDM 2020/03.DAR/09.R-Codes") # This is used to set the working directory
getwd() # used for getting the working directory used

install.packages("readxl") # Used to install the package for importing the excel files to R
library(readxl) # Used to call the package readxl
install.packages("psych")
library(psych)
install.packages("Hmisc")
library(Hmisc)
install.packages("lmtest")
library(lmtest)
install.packages("lm.beta")
library(lm.beta)
install.packages("car")
library(car)
exports=read_excel(file.choose()) # Import the excel file named as exports
attach(exports) # Attach the file
fix(exports) # Open the data file in the R editor
View(exports) # Open the data file to view the data
#Summary Statistics
summary(exports) # Before building the model, it is very important to understand the
variables better. For this, one can obtain the summary statistics. One has to describe each
variable using the summary statistics like mean, median, mode, quartiles etc.
Exports M1 Lend Price
Min. :2.600 Min. :4.900 Min. : 7.80 Min. :114.0
1st Qu.:4.200 1st Qu.:6.000 1st Qu.: 9.00 1st Qu.:146.0
Median :4.800 Median :7.000 Median :10.00 Median :151.0
Mean :4.528 Mean :6.909 Mean :10.52 Mean :147.3
3rd Qu.:5.100 3rd Qu.:8.100 3rd Qu.:11.60 3rd Qu.:154.0
Max. :5.600 Max. :8.800 Max. :15.00 Max. :162.0

Exchange
Min. :2.040
1st Qu.:2.100
Median :2.130
Mean :2.133
3rd Qu.:2.160
Max. :2.240

#scatter plots
pairs(~exports$Exports+exports$M1+exports$Lend+exports$Price+exports$Exchange)
# This is used to get the scatter plots for all the variables considered in the study
#Building the model
exp_lm=lm(Exports~M1+Lend+Price+Exchange, exports) # “lm” means “linear model” and is
used to build the model. The symbol ~ is used to link the response (dependent variable) and
the regressor variables (independent variables). All the regressor variables are included in
the code using “+” sign.
exp_lm # This gives the coefficient values of the model.
Coefficients:
(Intercept) M1 Lend Price
-4.015461 0.368456 0.004702 0.036511
Exchange
0.267896

summary(exp_lm) # This gives the results of the testing

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -4.015461 2.766401 -1.452 0.151679
M1 0.368456 0.063848 5.771 2.71e-07 ***
Lend 0.004702 0.049222 0.096 0.924201
Price 0.036511 0.009326 3.915 0.000228 ***
Exchange 0.267896 1.175440 0.228 0.820465
---
Signif. codes:
0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 0.3358 on 62 degrees of freedom

Multiple R-squared: 0.825, Adjusted R-squared: 0.8137
F-statistic: 73.06 on 4 and 62 DF, p-value: < 2.2e-16

# Rebuilding the model after dropping the insignificant variables

exp_lm2=lm(Exports~M1+Price,exports) # Here, we rebuild the model by dropping the
variables that are insignificant.
exp_lm2
summary(exp_lm2)
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -3.422957 0.540853 -6.329 2.75e-08 ***
M1 0.361417 0.039246 9.209 2.45e-13 ***
Price 0.037033 0.004094 9.046 4.70e-13 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 0.3306 on 64 degrees of freedom

Multiple R-squared: 0.8248, Adjusted R-squared: 0.8193
F-statistic: 150.7 on 2 and 64 DF, p-value: < 2.2e-16

#Testing the assumptions of the mode

#1. Average residual is zero. This indicates that the contribution from the unknown variables
is on average negligible.
mean(exp_lm2$residuals) # This will help one to test the assumption.
[1] -2.531693e-18

#2. Variance of residual is constant

bptest(exp_lm2) # Breuch-Pagan test is used to check this assumption.
studentized Breusch-Pagan test

data: exp_lm2
BP = 3.0888, df = 2, p-value = 0.2134

#3. Errors are uncorrelated

durbinWatsonTest(exp_lm2) # We use Durbin Watson test to check this assumption
lag Autocorrelation D-W Statistic p-value
1 -0.3188038 2.576484 0.024
Alternative hypothesis: rho != 0
#4. Normality of the residuals
shapiro.test(exp_lm2$residuals)
Shapiro-Wilk normality test

data: exp_lm2$residuals
W = 0.96227, p-value = 0.03998

#5. All the regressors are independent

vif(exp_lm2) # Variance inflation factor is used to check this assumption
M1 Price
1.249779 1.249779
# If the VIF value is more than 10, then we conclude that there is a problem of
multicollinearity
# Confidence intervals for the regression coefficients
exp_lm2$coefficients
(Intercept) M1 Price
-3.42295723 0.36141732 0.03703264
# Confidence Intervals
confint(exp_lm2, level = 0.95)

2.5 % 97.5 %
(Intercept) -4.50343606 -2.34247841
M1 0.28301385 0.43982079
Price 0.02885435 0.04521092

predict(exp_lm2, interval = "confidence")

# Prediction Intervals
new1=data.frame(M1=c(5.3,5.4,5.5), Price=c(118,119,120))
new1
help("predict")
exp_lm2
predict(exp_lm2,new1)
predict(exp_lm2,new, interval = "prediction")
detach(exports)

M348 Applied Statistical Modelling - Linear Models
No ratings yet
M348 Applied Statistical Modelling - Linear Models
504 pages
WQU - Econometrics - Module2 - Compiled Content
100% (2)
WQU - Econometrics - Module2 - Compiled Content
73 pages
Ecomometrics 2020 08 Chapter All
No ratings yet
Ecomometrics 2020 08 Chapter All
502 pages
Unit-2 Ak
No ratings yet
Unit-2 Ak
106 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
MIT 302 - Statistical Computing II - Tutorial 03
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
16 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Practice-Training BTTC
No ratings yet
Practice-Training BTTC
25 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Presentation Regression Analysis
No ratings yet
Presentation Regression Analysis
61 pages
Data Analytics and Visualization Unit-II
No ratings yet
Data Analytics and Visualization Unit-II
23 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
A Guide To International Trading Professional Tools and Practice Insights For Successful Operations (Springer, 2023)
No ratings yet
A Guide To International Trading Professional Tools and Practice Insights For Successful Operations (Springer, 2023)
606 pages
Chapter Three
No ratings yet
Chapter Three
50 pages
MGT Three
No ratings yet
MGT Three
86 pages
Statistical Modelling
No ratings yet
Statistical Modelling
39 pages
Unit 3 Da
No ratings yet
Unit 3 Da
20 pages
DISC 212 Session 13
No ratings yet
DISC 212 Session 13
29 pages
Lec 40
No ratings yet
Lec 40
12 pages
The Shoe Industry of Marikina, Philippines-A Developing Country Cluster in Crisis
100% (1)
The Shoe Industry of Marikina, Philippines-A Developing Country Cluster in Crisis
24 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
Assignment 190623
No ratings yet
Assignment 190623
12 pages
Predictive Model Assignment 3 - MLR Model
No ratings yet
Predictive Model Assignment 3 - MLR Model
19 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
Lab 6 - Linear Regression and Multiple Linear Regression
No ratings yet
Lab 6 - Linear Regression and Multiple Linear Regression
12 pages
Fet402 Lec02 2023 Econometrics
No ratings yet
Fet402 Lec02 2023 Econometrics
60 pages
Topic 7-Regression Analysis
No ratings yet
Topic 7-Regression Analysis
56 pages
Chapter 06-Regression Analysis
No ratings yet
Chapter 06-Regression Analysis
41 pages
Quantum Technologies
No ratings yet
Quantum Technologies
20 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
Cursus Advanced Econometrics
No ratings yet
Cursus Advanced Econometrics
129 pages
R Codes 1
No ratings yet
R Codes 1
3 pages
Homework 2
100% (1)
Homework 2
14 pages
Report - Project8 - FRA - Surabhi - Report
0% (1)
Report - Project8 - FRA - Surabhi - Report
15 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
ML PR-2
No ratings yet
ML PR-2
11 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Chapter Three
No ratings yet
Chapter Three
35 pages
Stat 378
No ratings yet
Stat 378
73 pages
Demand Estimation and Forecasting
No ratings yet
Demand Estimation and Forecasting
64 pages
6UDIM MS Dec2019 FINAL 0
No ratings yet
6UDIM MS Dec2019 FINAL 0
12 pages
Group 1 Practical
No ratings yet
Group 1 Practical
16 pages
Oulier in R
No ratings yet
Oulier in R
8 pages
Basic Regression Analysis 2
No ratings yet
Basic Regression Analysis 2
6 pages
Multiple Regression
100% (1)
Multiple Regression
21 pages
Multiple Regression
100% (1)
Multiple Regression
30 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
Time Series Trend Finding and Removing
No ratings yet
Time Series Trend Finding and Removing
15 pages
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
No ratings yet
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
26 pages
ML Unit3 MultipleLinearRegression
No ratings yet
ML Unit3 MultipleLinearRegression
70 pages
Strategy in Global Environment
100% (1)
Strategy in Global Environment
15 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Report - Project8 - FRA - Surabhi - Report
100% (2)
Report - Project8 - FRA - Surabhi - Report
15 pages
Multiple Regression
No ratings yet
Multiple Regression
7 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
Introduction To Financial Econometrics
No ratings yet
Introduction To Financial Econometrics
38 pages
Linear Regression Analysis
No ratings yet
Linear Regression Analysis
7 pages
Linear Regression in R
No ratings yet
Linear Regression in R
19 pages
CHAPTER 3 The Economics of Tourism and Hospitality
100% (3)
CHAPTER 3 The Economics of Tourism and Hospitality
4 pages
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
No ratings yet
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
18 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
Chapter 4: Economic Analysis
No ratings yet
Chapter 4: Economic Analysis
18 pages
Heilongjiang Baishanlin Profile of Guyana-China Timber Industry
No ratings yet
Heilongjiang Baishanlin Profile of Guyana-China Timber Industry
27 pages
Hayek Essay by Paul Schmelzing
No ratings yet
Hayek Essay by Paul Schmelzing
11 pages
Unit 8 - MCQs Questions
No ratings yet
Unit 8 - MCQs Questions
44 pages
International Business
No ratings yet
International Business
13 pages
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
No ratings yet
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
9 pages
Linear Regression with Multiple Covariates
From Everand
Linear Regression with Multiple Covariates
Brett Kottmann
No ratings yet
Morita (w1)
No ratings yet
Morita (w1)
27 pages
Indirect Tax Assignment
No ratings yet
Indirect Tax Assignment
6 pages
Test Bank For International Business Law and Its Environment, 8th Edition: Schaffer PDF Download
100% (2)
Test Bank For International Business Law and Its Environment, 8th Edition: Schaffer PDF Download
44 pages
Chapter 03
No ratings yet
Chapter 03
35 pages
CDP Day 06
No ratings yet
CDP Day 06
2 pages
Chapter 7 - International Business Strategies
No ratings yet
Chapter 7 - International Business Strategies
31 pages
Deductions Under Chapter VI-A
No ratings yet
Deductions Under Chapter VI-A
20 pages
DLMM04 Week 4Micro-Lecture. 4.3 Protectionist Trading Policy and Optimal Tariffs
No ratings yet
DLMM04 Week 4Micro-Lecture. 4.3 Protectionist Trading Policy and Optimal Tariffs
5 pages
04.session Notes On Principal Component Regression
No ratings yet
04.session Notes On Principal Component Regression
12 pages
12-01087 - Mai Nguyen Hoang Nam - ECO 601 - Final Assignment
No ratings yet
12-01087 - Mai Nguyen Hoang Nam - ECO 601 - Final Assignment
19 pages
Global Tariffs 2025 Outlook
No ratings yet
Global Tariffs 2025 Outlook
6 pages
02.session-Notes-1 and 2-Basic Data Analysis
No ratings yet
02.session-Notes-1 and 2-Basic Data Analysis
11 pages
Integration of Uzbekistan Into The Global Trade Network Manuscript
No ratings yet
Integration of Uzbekistan Into The Global Trade Network Manuscript
9 pages
Forgoing-full-value-Iron-ore-mining-in-Newfoundland-and-Labrador-1954-20 PDF
No ratings yet
Forgoing-full-value-Iron-ore-mining-in-Newfoundland-and-Labrador-1954-20 PDF
14 pages
Victoria 3 Vickynomics
No ratings yet
Victoria 3 Vickynomics
13 pages
1-Economic Crisis of Pakistan
No ratings yet
1-Economic Crisis of Pakistan
3 pages
03b.session Notes On Dummy Variable Regression
No ratings yet
03b.session Notes On Dummy Variable Regression
5 pages
01.Session-notes-Data Import
No ratings yet
01.Session-notes-Data Import
3 pages
The Global Economic Crisis and Developing Countries
No ratings yet
The Global Economic Crisis and Developing Countries
72 pages
Chapter 13: Demand-Side and Supply-Side Policies
No ratings yet
Chapter 13: Demand-Side and Supply-Side Policies
8 pages
Tutorial 1-Solutions
No ratings yet
Tutorial 1-Solutions
3 pages
3.0. Market Feasibility
No ratings yet
3.0. Market Feasibility
4 pages
International Sales Business Development Director in United States Resume Eric Bardet
No ratings yet
International Sales Business Development Director in United States Resume Eric Bardet
1 page
Inma Course Outline
No ratings yet
Inma Course Outline
2 pages

03a.session Notes On Multiple Linear Regression Analysis

Uploaded by

03a.session Notes On Multiple Linear Regression Analysis

Uploaded by

Session Notes on Multiple Linear Regression

psych- package used for psychological, psychometric and personality research

Hmisc- Harrell Miscellaneous

lmtest- Testing Linear Regression Models

lm.beta- Standardized regression coefficients to Lm objects

summary(exp_lm) # This gives the results of the testing

Residual standard error: 0.3358 on 62 degrees of freedom

# Rebuilding the model after dropping the insignificant variables

Residual standard error: 0.3306 on 64 degrees of freedom

#Testing the assumptions of the mode

#2. Variance of residual is constant

#3. Errors are uncorrelated

#5. All the regressors are independent

predict(exp_lm2, interval = "confidence")

You might also like