0% found this document useful (0 votes)

3 views8 pages

Deliverytime 3

Uploaded by

KamalSilvas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views8 pages

Deliverytime 3

Uploaded by

KamalSilvas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Multiple Linera Regression on Delivery Time data

Delivery Time Data

Predicting the amount of time required by the route driver to service the vending machines in an outlet.
Response: delivery time (y) Predictors: number of cases of product stocked (x1) distance walked by the
route driver (x2).

library(readxl)
DeliveryTime <- read_excel("DeliveryTime.xlsx")

colnames(DeliveryTime)<-c("Time","NumberofCases", "Distance")
summary(DeliveryTime)

## Time NumberofCases Distance

## Min. : 8.00 Min. : 2.00 Min. : 36.0
## 1st Qu.:13.75 1st Qu.: 4.00 1st Qu.: 150.0
## Median :18.11 Median : 7.00 Median : 330.0
## Mean :22.38 Mean : 8.76 Mean : 409.3
## 3rd Qu.:21.50 3rd Qu.:10.00 3rd Qu.: 605.0
## Max. :79.24 Max. :30.00 Max. :1460.0

pairs(DeliveryTime)

1
5 10 15 20 25 30

70
50
Time

30
10
25

NumberofCases
15
5

1000
Distance

400
0
10 20 30 40 50 60 70 80 0 200 600 1000 1400

Scatter plots y vs x1 and y vs x2 shows linear relationships. Addition, x1 vs x2 plot also shows linear
relationship, resulting multicollinearity.
If there is only one (or a few) dominant regressor, or if the regressors operate nearly independently, the matrix
of scatterplots is most useful. However, when several important regressors are themselves interrelated, then
these scatter diagrams can be very misleading.

cor(DeliveryTime)

## Time NumberofCases Distance

## Time 1.0000000 0.9646146 0.8916701
## NumberofCases 0.9646146 1.0000000 0.8242150
## Distance 0.8916701 0.8242150 1.0000000

cor.test(as.numeric(DeliveryTime$Distance),DeliveryTime$Time)

##
## Pearson’s product-moment correlation
##
## data: as.numeric(DeliveryTime$Distance) and DeliveryTime$Time
## t = 9.4465, df = 23, p-value = 2.214e-09
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## 0.7666503 0.9515461
## sample estimates:

2
## cor
## 0.8916701

cor.test(DeliveryTime$NumberofCases,DeliveryTime$Time)

##
## Pearson’s product-moment correlation
##
## data: DeliveryTime$NumberofCases and DeliveryTime$Time
## t = 17.546, df = 23, p-value = 8.22e-15
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## 0.9202275 0.9845031
## sample estimates:
## cor
## 0.9646146

Both predictor variables have significant and strong positive linear correlations with the response.
Obtain the 3D visual as below.

Del_lm2<-lm(Time~NumberofCases+Distance,data=DeliveryTime)

sp<-scatterplot3d(DeliveryTime$NumberofCases,DeliveryTime$Distance,DeliveryTime$Time,xlab = "No. of Case

sp$plane3d(Del_lm2, lty.box = "solid")#,

80
60
Delivary time

Distance

1500
20

1000
500
0
0

0 5 10 15 20 25 30

No. of Cases

3
Modeling
form of the linear model:
y = β0 + β1 x 1 + β2 x 2 + ϵ

colnames(DeliveryTime)

## [1] "Time" "NumberofCases" "Distance"

First model with the simple linear model using NumberofCases only.

Del_lm1<-lm(Time~NumberofCases,data=DeliveryTime)

Now add the second variable Distance, to the multiple linear regression model.

Del_lm2<-lm(Time~NumberofCases+Distance,data=DeliveryTime)

To get the anova table,

anova(Del_lm2)

## Analysis of Variance Table

##
## Response: Time
## Df Sum Sq Mean Sq F value Pr(>F)
## NumberofCases 1 5382.4 5382.4 506.619 < 2.2e-16 ***
## Distance 1 168.4 168.4 15.851 0.0006312 ***
## Residuals 22 233.7 10.6
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

In the R output anova table shows SS for seperate predictor. But we need overall SS. Let’s obtain it as
below.
Null hypothesis: All the regression coefficients are zero. beta_1=beta2=0 Alternative hypothesis: at least
one regression coefficient(beta_1 or beta_2) is non zero
SSR = 5382.4+168.4=5550 with df=2 SSRes=233.7 with df=25-2-1=22 Fstatistic=(5550/2)/(233.7/22)=261.27
Critical Value of F=F(2,22)(alpha=0.05)=3.44

qf(0.95,2,22)

## [1] 3.443357

Fstatitic is greater than the critical value, therefore, we reject the null hypothesis and conclude that atleast
one regression coefficient is non zero. therefore the regression is significant.
H0 : β1 = β2 = 0 is rejected based on the F-test since pvalue~=0.
Since the P value of the F statistic is very small, we conclude that delivery time is related to delivery volume
and/or distance. However, this does not necessarily imply that the relationship found is an appropriate
one for predicting delivery time as a function of volume and distance. Further tests of model adequacy are
required.

4
summary(Del_lm2)

##
## Call:
## lm(formula = Time ~ NumberofCases + Distance, data = DeliveryTime)
##
## Residuals:
## Min 1Q Median 3Q Max
## -5.7880 -0.6629 0.4364 1.1566 7.4197
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 2.341231 1.096730 2.135 0.044170 *
## NumberofCases 1.615907 0.170735 9.464 3.25e-09 ***
## Distance 0.014385 0.003613 3.981 0.000631 ***
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1
##
## Residual standard error: 3.259 on 22 degrees of freedom
## Multiple R-squared: 0.9596, Adjusted R-squared: 0.9559
## F-statistic: 261.2 on 2 and 22 DF, p-value: 4.687e-16

In the bottom of the Summary of the model has the overall F-statistic as well.
Further,
R2 for the multiple regression model for the delivery time data as R2 = 0.96, or 96.0%.
Compare the R squared for single predictor and multiple predictor models.

summary(Del_lm1)

##
## Call:
## lm(formula = Time ~ NumberofCases, data = DeliveryTime)
##
## Residuals:
## Min 1Q Median 3Q Max
## -7.5811 -1.8739 -0.3493 2.1807 10.6342
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 3.321 1.371 2.422 0.0237 *
## NumberofCases 2.176 0.124 17.546 8.22e-15 ***
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1
##
## Residual standard error: 4.181 on 23 degrees of freedom
## Multiple R-squared: 0.9305, Adjusted R-squared: 0.9275
## F-statistic: 307.8 on 1 and 23 DF, p-value: 8.22e-15

2
RAdj = 0.956 . (95.6%) for the two - variable model, while for the simple linear regression model with only
2
x1 (cases), RAdj = 0.930 . , or 93%. Therefore, we would conclude that adding x2 (distance) to the model

5
did result in a meaningful reduction of total variability. This implies having distance and no of cases both
in the model is better than having only no. of cases in the model.
Also comment on the significance of single predictors using the t-test given in the summary.
H0 : β2 = 0 is rejected based on the t-test since pvalue is 0.000631. Hence, conclude that the regressor x2
(distance) contributes significantly to the model given that x1 (cases) is also in the model.
here, tstatistic=0.014385/0.003613=3.98 critical value=t(n-k-1)(0.05/2)=2.073873

qt(0.975,22)

## [1] 2.073873

#plot(Del_lm2)
confint.lm(Del_lm2)

## 2.5 % 97.5 %
## (Intercept) 0.066751987 4.61571030
## NumberofCases 1.261824662 1.96998976
## Distance 0.006891745 0.02187791

extra - sum - of - squares method.

measuring the contribution of xj as if it were the last variable added to the model.

anova(Del_lm1)

## Analysis of Variance Table

##
## Response: Time
## Df Sum Sq Mean Sq F value Pr(>F)
## NumberofCases 1 5382.4 5382.4 307.85 8.22e-15 ***
## Residuals 23 402.1 17.5
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

anova(Del_lm2)

6
## Analysis of Variance Table
##
## Response: Time
## Df Sum Sq Mean Sq F value Pr(>F)
## NumberofCases 1 5382.4 5382.4 506.619 < 2.2e-16 ***
## Distance 1 168.4 168.4 15.851 0.0006312 ***
## Residuals 22 233.7 10.6
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

anova(Del_lm1,Del_lm2)

## Analysis of Variance Table

##
## Model 1: Time ~ NumberofCases
## Model 2: Time ~ NumberofCases + Distance
## Res.Df RSS Df Sum of Sq F Pr(>F)
## 1 23 402.13
## 2 22 233.73 1 168.4 15.851 0.0006312 ***
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

confidence interval for regression coefficients

# compute 95% confidence interval for coefficients in 'linear_model'

confint(Del_lm2,level = 0.95)

## 2.5 % 97.5 %
## (Intercept) 0.066751987 4.61571030
## NumberofCases 1.261824662 1.96998976
## Distance 0.006891745 0.02187791

# compute 95% bonferroni confidence intervals

confint(Del_lm2, level = 1 -0.05/length(coef(Del_lm2)))

## 0.833 % 99.167 %
## (Intercept) -0.500628740 5.1830910
## NumberofCases 1.173496917 2.0583175
## Distance 0.005022556 0.0237471

CI Estimation of the Mean Response

predict(Del_lm2, data.frame(NumberofCases=8, Distance=275), interval = "confidence", level = 0.95)

## fit lwr upr

## 1 19.22432 17.6539 20.79474

7
Prediction interval for the new observation

predict(Del_lm2, data.frame(NumberofCases=8, Distance=275), interval = "prediction", level = 0.95)

## fit lwr upr

## 1 19.22432 12.28456 26.16407

Solution Manual For Microeconometrics
59% (22)
Solution Manual For Microeconometrics
785 pages
Ib Economics Textbook PDF PDF Free
100% (1)
Ib Economics Textbook PDF PDF Free
698 pages
Forecasting APICS
No ratings yet
Forecasting APICS
52 pages
Sample Size Calculations
100% (1)
Sample Size Calculations
5 pages
Multivariate Data Analysis Joseph F. Hair Jr. William C. Black Barry J. Babin Rolph E. Anderson Seventh Edition
0% (1)
Multivariate Data Analysis Joseph F. Hair Jr. William C. Black Barry J. Babin Rolph E. Anderson Seventh Edition
7 pages
Minitab DOE Tutorial PDF
100% (3)
Minitab DOE Tutorial PDF
32 pages
Importance of Service Quality in Customer Satisfaction
No ratings yet
Importance of Service Quality in Customer Satisfaction
12 pages
Project PPT (ASHA V S)
100% (1)
Project PPT (ASHA V S)
18 pages
Tga Manual 2011 10 14
100% (1)
Tga Manual 2011 10 14
280 pages
Forecasting: EM6113-Engineering Management Techniques
No ratings yet
Forecasting: EM6113-Engineering Management Techniques
40 pages
Collins Maydew and Weiss 1997
No ratings yet
Collins Maydew and Weiss 1997
29 pages
5 Errors, Survey Adjustments, and Precision of Observations and Adjus Tments
No ratings yet
5 Errors, Survey Adjustments, and Precision of Observations and Adjus Tments
16 pages
LGT2425 Lecture 3 Part II (Notes)
No ratings yet
LGT2425 Lecture 3 Part II (Notes)
55 pages
Tests For Mean and Proportion
No ratings yet
Tests For Mean and Proportion
34 pages
Metodologi Penelitian - Bab 5
No ratings yet
Metodologi Penelitian - Bab 5
29 pages
FMD PRACTICAL FILE
No ratings yet
FMD PRACTICAL FILE
61 pages
MJC - The STJM Command
No ratings yet
MJC - The STJM Command
38 pages
09 Chapter 4 & 5
No ratings yet
09 Chapter 4 & 5
20 pages
Ewan
No ratings yet
Ewan
144 pages
An Introduction To Regression Analysis
No ratings yet
An Introduction To Regression Analysis
34 pages
Course 1-AI NEP Notes (1) - 2
No ratings yet
Course 1-AI NEP Notes (1) - 2
54 pages
Advanced - Linear Regression
No ratings yet
Advanced - Linear Regression
57 pages
AIJJS Print A Study On Credit Facility Offered by Bank and Its Impact On Manufacturing Industry
No ratings yet
AIJJS Print A Study On Credit Facility Offered by Bank and Its Impact On Manufacturing Industry
10 pages
(ENGDAT2) Exercise 3
No ratings yet
(ENGDAT2) Exercise 3
10 pages
A Study On Employee Attrition: Inevitable Yet Manageable: Dr.B.Latha Lavanya
No ratings yet
A Study On Employee Attrition: Inevitable Yet Manageable: Dr.B.Latha Lavanya
13 pages
Residual Analysis Section 2 RM&a Nonlinear Heteroscadasticity September 2019
No ratings yet
Residual Analysis Section 2 RM&a Nonlinear Heteroscadasticity September 2019
14 pages
Week 5 Multiple Regression: Busa3500 Statistics For Business Ii Piedmont College
No ratings yet
Week 5 Multiple Regression: Busa3500 Statistics For Business Ii Piedmont College
57 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
Levine Smume7 Bonus Ch08
No ratings yet
Levine Smume7 Bonus Ch08
11 pages
Karakterisitik Responden Berdasarkan Umur Dan Jenis Kelamin: Hasil Analisis Spss
No ratings yet
Karakterisitik Responden Berdasarkan Umur Dan Jenis Kelamin: Hasil Analisis Spss
4 pages
Week 2
No ratings yet
Week 2
66 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
Stepwise Reg
No ratings yet
Stepwise Reg
31 pages
Lecture Note 02 - Annuities
No ratings yet
Lecture Note 02 - Annuities
22 pages
Lecture Note 05 (#)
No ratings yet
Lecture Note 05 (#)
6 pages
Lecture Note 01 (Edited) (#)
No ratings yet
Lecture Note 01 (Edited) (#)
5 pages
1 Regression
No ratings yet
1 Regression
2 pages
05 Linear Regression 2
No ratings yet
05 Linear Regression 2
71 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Team Nerds Report (Bus485) (Section 5)
No ratings yet
Team Nerds Report (Bus485) (Section 5)
15 pages
MLR Probs
No ratings yet
MLR Probs
45 pages
Stat Modelling Assignment 5
No ratings yet
Stat Modelling Assignment 5
12 pages
Ap Stats Cheat Sheet
No ratings yet
Ap Stats Cheat Sheet
1 page
Multiple Linear Regression
100% (1)
Multiple Linear Regression
14 pages
5.multiple Regression
No ratings yet
5.multiple Regression
17 pages
Probability and Statistics Part 6 Regression
No ratings yet
Probability and Statistics Part 6 Regression
47 pages
Multiple Regression PDF
No ratings yet
Multiple Regression PDF
19 pages
Wic 5 MLR & Anova
No ratings yet
Wic 5 MLR & Anova
10 pages
Regression (Class 16-17)
No ratings yet
Regression (Class 16-17)
39 pages
NguyenChiA6 Memo
No ratings yet
NguyenChiA6 Memo
4 pages
Demand Management Plan Wild Dog Coffee Company
No ratings yet
Demand Management Plan Wild Dog Coffee Company
16 pages
Final Exam Excel Statistics
No ratings yet
Final Exam Excel Statistics
33 pages
Econometrics CRT M2: Regression Model Evaluation
No ratings yet
Econometrics CRT M2: Regression Model Evaluation
7 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
No ratings yet
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
9 pages
Regression Illustrations PDF
No ratings yet
Regression Illustrations PDF
5 pages
Study On Employee Engagement - With Reference To Middle and Junior Level Management Employees at Manufacturing Industry, Chennai, Tamilnadu
No ratings yet
Study On Employee Engagement - With Reference To Middle and Junior Level Management Employees at Manufacturing Industry, Chennai, Tamilnadu
54 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Topic 7-Regression Analysis
No ratings yet
Topic 7-Regression Analysis
56 pages
Lecture 01
No ratings yet
Lecture 01
41 pages
Lecture 2 Vectors
No ratings yet
Lecture 2 Vectors
43 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
Day 6 Session 2 MLR
No ratings yet
Day 6 Session 2 MLR
16 pages
Lecture Note 08 - Root Finding
No ratings yet
Lecture Note 08 - Root Finding
30 pages
Lecsson 03 - Matrix - Plotting - 2
No ratings yet
Lecsson 03 - Matrix - Plotting - 2
23 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
MS Excel Linear & Multiple Regression
No ratings yet
MS Excel Linear & Multiple Regression
8 pages
Lecture Note - 2023
No ratings yet
Lecture Note - 2023
25 pages
Predictive Modeling-Handouts
No ratings yet
Predictive Modeling-Handouts
11 pages
MS - Excel - Linear - & - Multiple - Regression Office 2007
No ratings yet
MS - Excel - Linear - & - Multiple - Regression Office 2007
7 pages
Note 18
No ratings yet
Note 18
15 pages
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
No ratings yet
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
5 pages
Jurnal Akurasi Uav Dem Tampa GCP
No ratings yet
Jurnal Akurasi Uav Dem Tampa GCP
28 pages
Lab 5
No ratings yet
Lab 5
6 pages
Arun 27072021 Predictive Modeling PDF
No ratings yet
Arun 27072021 Predictive Modeling PDF
33 pages
Note 23
No ratings yet
Note 23
30 pages
Abhinav Vijay - 55
No ratings yet
Abhinav Vijay - 55
4 pages
Shiohama AlexandrovSpaces
No ratings yet
Shiohama AlexandrovSpaces
73 pages
Modelling Survival Data in Medical Research, 4th Edition Latest Edition Download
95% (20)
Modelling Survival Data in Medical Research, 4th Edition Latest Edition Download
16 pages
Shelf-Life FDA Ovais
100% (2)
Shelf-Life FDA Ovais
8 pages
11 Bda
No ratings yet
11 Bda
25 pages
S Al Com Mat Modelpaper2014
No ratings yet
S Al Com Mat Modelpaper2014
16 pages
W7 - CH13 - Practice Questions For Regression Analysis
No ratings yet
W7 - CH13 - Practice Questions For Regression Analysis
3 pages
Lecture - 06 - Processor Design
No ratings yet
Lecture - 06 - Processor Design
32 pages
Notes 610
No ratings yet
Notes 610
209 pages
Regression Analysis Notes
No ratings yet
Regression Analysis Notes
6 pages
Lecture - 09 - Introduction To Threads
No ratings yet
Lecture - 09 - Introduction To Threads
20 pages
Lecture - 05 - Memory Organization
No ratings yet
Lecture - 05 - Memory Organization
18 pages
Linear Model
No ratings yet
Linear Model
10 pages
Uji Regresi - Dewi Matius
No ratings yet
Uji Regresi - Dewi Matius
5 pages
Chapter 3 Multivariate Linear Regression
No ratings yet
Chapter 3 Multivariate Linear Regression
16 pages
Demand Estimation Worksheet
No ratings yet
Demand Estimation Worksheet
13 pages
Central Province Combined Maths 2020 Last Term Test
No ratings yet
Central Province Combined Maths 2020 Last Term Test
17 pages
Unit 3
No ratings yet
Unit 3
24 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
A2 Copy 2
No ratings yet
A2 Copy 2
8 pages
Travaux Pratiques Science de Données TP3
No ratings yet
Travaux Pratiques Science de Données TP3
5 pages
Regression Modelli NG Assignment
No ratings yet
Regression Modelli NG Assignment
3 pages
LINEAR MODELS Cheatsheet
No ratings yet
LINEAR MODELS Cheatsheet
14 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Formula Sheet
No ratings yet
Formula Sheet
2 pages
Business Analytics C-2
No ratings yet
Business Analytics C-2
7 pages
07 Multiple Regression Analysis PDF
No ratings yet
07 Multiple Regression Analysis PDF
26 pages
Lecture 10
No ratings yet
Lecture 10
5 pages
Basic Mathematics. Explained Easy | For Beginners
From Everand
Basic Mathematics. Explained Easy | For Beginners
ExaGrecation
No ratings yet
A Friendly Introduction to MATLAB Programming
From Everand
A Friendly Introduction to MATLAB Programming
Orhan Gazi
No ratings yet
Instruction for Using a Slide Rule
From Everand
Instruction for Using a Slide Rule
W. Stanley
No ratings yet

Deliverytime 3

Uploaded by

Deliverytime 3

Uploaded by

Multiple Linera Regression on Delivery Time data

Delivery Time Data

## Time NumberofCases Distance

## Time NumberofCases Distance

sp<-scatterplot3d(DeliveryTime$NumberofCases,DeliveryTime$Distance,DeliveryTime$Time,xlab = "No. of Case

sp$plane3d(Del_lm2, lty.box = "solid")#,

## [1] "Time" "NumberofCases" "Distance"

To get the anova table,

## Analysis of Variance Table

extra - sum - of - squares method.

## Analysis of Variance Table

## Analysis of Variance Table

confidence interval for regression coefficients

# compute 95% confidence interval for coefficients in 'linear_model'

# compute 95% bonferroni confidence intervals

CI Estimation of the Mean Response

predict(Del_lm2, data.frame(NumberofCases=8, Distance=275), interval = "confidence", level = 0.95)

## fit lwr upr

predict(Del_lm2, data.frame(NumberofCases=8, Distance=275), interval = "prediction", level = 0.95)

## fit lwr upr

You might also like