0% found this document useful (0 votes)

13 views96 pages

7 Regression

Uploaded by

Zahra Aghaei

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views96 pages

7 Regression

Uploaded by

Zahra Aghaei

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 96

ETC3550/ETC5550

Applied forecasting

Ch7. Regression models

OTexts.org/fpp3/
Outline

1 The linear model with time series

2 Some useful predictors for linear models
3 Residual diagnostics
4 Selecting predictors and forecast evaluation
5 Forecasting with regression
6 Matrix formulation
7 Correlation, causation and forecasting
2
Outline

1 The linear model with time series

yt = β0 + β1 x1,t + β2 x2,t + · · · + βk xk,t + εt .

yt is the variable we want to predict: the “response” variable

Each xj,t is numerical and is called a “predictor”. They are usually
assumed to be known for all past and future times.
The coefficients β1 , . . . , βk measure the effect of each predictor
after taking account of the effect of all other predictors in the
model.
That is, the coefficients measure the marginal effects.
εt is a white noise error term 4
Example: US consumption expenditure

Consumption
2
1
0
−1
−2

Income
2.5
0.0
−2.5

Production
2.5
0.0
−2.5
−5.0
40

Savings Unemployment
20
0
−20
−40
−60
1.5
1.0
0.5
0.0
−0.5
−1.0
1980 Q1 2000 Q1 2020 Q1
Quarter

5
Example: US consumption expenditure
Consumption Income Production Savings Unemployment

Consumption
0.6
0.4
Corr: Corr: Corr: Corr:
0.2 0.384*** 0.529*** −0.257*** −0.527***
0.0

Income
2.5 Corr: Corr: Corr:
0.0
−2.5 0.269*** 0.720*** −0.224**

Production
2.5
0.0 Corr: Corr:
−2.5 −0.059 −0.768***
−5.0
40

Savings Unemployment
20 Corr:
0
−20 0.106
−40
−60
1.5
1.0
0.5
0.0
−0.5
−1.0
−2 −1 0 1 2 −2.5 0.0 2.5 −5.0 −2.5 0.0 2.5 −60 −40 −20 0 20 40−1.0 −0.5 0.0 0.5 1.0 1.5

6
Example: US consumption expenditure
fit_consMR <- us_change %>%
model(lm = TSLM(Consumption ~ Income + Production + Unemployment + Savings))
report(fit_consMR)

## Series: Consumption
## Model: TSLM
##
## Residuals:
## Min 1Q Median 3Q Max
## -0.906 -0.158 -0.036 0.136 1.155
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 0.25311 0.03447 7.34 5.7e-12 ***
## Income 0.74058 0.04012 18.46 < 2e-16 ***
## Production 0.04717 0.02314 2.04 0.043 *
## Unemployment -0.17469 0.09551 -1.83 0.069 .
## Savings -0.05289 0.00292 -18.09 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.31 on 193 degrees of freedom
## Multiple R-squared: 0.768, Adjusted R-squared: 0.763
7
Example: US consumption expenditure

Percent change in US consumption expenditure

1
Data
0
Fitted

−1

−2

1980 Q1 2000 Q1 2020 Q1

Quarter

8
Example: US consumption expenditure

Percentage change in US consumption expenditure

Fitted (predicted values)

−1

−2 −1 0 1 2
Data (actual values)

9
Example: US consumption expenditure

fit_consMR %>% gg_tsresiduals()

Innovation residuals

1.0
0.5
0.0
−0.5
−1.0
1980 Q1 2000 Q1 2020 Q1
Quarter
40
0.1
30

count
0.0
acf

20
−0.1 10
−0.2 0
2 4 6 8 10 12 14 16 18 20 22 −1.0 −0.5 0.0 0.5 1.0
lag [1Q] .resid

10
Outline

1 The linear model with time series

Linear trend
xt = t
t = 1, 2, . . . , T
Strong assumption that trend will continue.

12
Nonlinear trend

Piecewise linear trend with bend at τ

x1,t = t

 0 t<τ
x2,t = 
(t − τ ) t ≥ τ

13
Nonlinear trend

Piecewise linear trend with bend at τ

x1,t = t

 0 t<τ
x2,t = 
(t − τ ) t ≥ τ

Quadratic or higher order trend

x1,t = t, x2,t = t2 , ...

13
Nonlinear trend

Piecewise linear trend with bend at τ

x1,t = t

 0 t<τ
x2,t = 
(t − τ ) t ≥ τ

Quadratic or higher order trend

x1,t = t, x2,t = t2 , ...

NOT RECOMMENDED! 13
Dummy variables
If a categorical variable takes
only two values (e.g., ‘Yes’ or
‘No’), then an equivalent
numerical variable can be
constructed taking value 1 if
yes and 0 if no. This is called
a dummy variable.

14
Dummy variables
If there are more than two
categories, then the variable
can be coded using several
dummy variables (one fewer
than the total number of
categories).

15
Beware of the dummy variable trap!

Using one dummy for each category gives too many dummy
variables!
The regression will then be singular and inestimable.
Either omit the constant, or omit the dummy for one category.
The coeﬃcients of the dummies are relative to the omitted
category.

16
Uses of dummy variables

Seasonal dummies
For quarterly data: use 3 dummies
For monthly data: use 11 dummies
For daily data: use 6 dummies
What to do with weekly data?

17
Uses of dummy variables

Seasonal dummies
For quarterly data: use 3 dummies
For monthly data: use 11 dummies
For daily data: use 6 dummies
What to do with weekly data?
Outliers
If there is an outlier, you can use a dummy variable to remove its
eﬀect.
Public holidays
For daily data: if it is a public holiday, dummy=1, otherwise dummy=0. 17
Beer production revisited
Australian quarterly beer production

500
Megalitres

450

400

1995 Q1 2000 Q1 2005 Q1 2010 Q1

Quarter [1Q]

18
Beer production revisited
Australian quarterly beer production

500
Megalitres

450

400

1995 Q1 2000 Q1 2005 Q1 2010 Q1

Quarter [1Q]

Regression model
yt = β0 + β1 t + β2 d2,t + β3 d3,t + β4 d4,t + εt
di,t = 1 if t is quarter i and 0 otherwise. 18
Beer production revisited
fit_beer <- recent_production %>% model(TSLM(Beer ~ trend() + season()))
report(fit_beer)

## Series: Beer
## Model: TSLM
##
## Residuals:
## Min 1Q Median 3Q Max
## -42.9 -7.6 -0.5 8.0 21.8
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 441.8004 3.7335 118.33 < 2e-16 ***
## trend() -0.3403 0.0666 -5.11 2.7e-06 ***
## season()year2 -34.6597 3.9683 -8.73 9.1e-13 ***
## season()year3 -17.8216 4.0225 -4.43 3.4e-05 ***
## season()year4 72.7964 4.0230 18.09 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
19
## Residual standard error: 12.2 on 69 degrees of freedom
Beer production revisited
augment(fit_beer) %>%
ggplot(aes(x = Quarter)) +
geom_line(aes(y = Beer, colour = "Data")) +
geom_line(aes(y = .fitted, colour = "Fitted")) +
labs(y="Megalitres",title ="Australian quarterly beer production") +
scale_colour_manual(values = c(Data = "black", Fitted = "#D55E00"))

Australian quarterly beer production

500
Megalitres

colour
450 Data
Fitted

400

1995 Q1 2000 Q1 2005 Q1 2010 Q1 20

Quarter
Beer production revisited
augment(fit_beer) %>%
ggplot(aes(x=Beer, y=.fitted, colour=factor(quarter(Quarter)))) +
geom_point() +
labs(y="Fitted", x="Actual values", title = "Quarterly beer production") +
scale_colour_brewer(palette="Dark2", name="Quarter") +
geom_abline(intercept=0, slope=1)

Quarterly beer production

Quarter
480
1
Fitted

2
440
3
4
400

400 450 500 21

Actual values
Beer production revisited

fit_beer %>% gg_tsresiduals()

Innovation residuals

20
0
−20
−40
1995 Q1 2000 Q1 2005 Q1 2010 Q1
Quarter

0.2
0.1 20

count
acf

0.0
−0.1 10
−0.2
0
2 4 6 8 10 12 14 16 18 −25 0 25
lag [1Q] .resid

22
Beer production revisited

fit_beer %>% forecast %>% autoplot(recent_production)

500

level
Beer

450
80
95

400

350
1995 Q1 2000 Q1 2005 Q1 2010 Q1
Quarter

23
Fourier series

Periodic seasonality can be handled using pairs of Fourier terms:

2πkt 2πkt
! !
sk (t) = sin ck (t) = cos
m m
K
X
yt = a + bt + [αk sk (t) + βk ck (t)] + εt
k=1
Every periodic function can be approximated by sums of sin and
cos terms for large enough K.
Choose K by minimizing AICc.
Called “harmonic regression”
TSLM(y ~ trend() + fourier(K))
24
Harmonic regression: beer production
fourier_beer <- recent_production %>% model(TSLM(Beer ~ trend() + fourier(K=2)))
report(fourier_beer)

## Series: Beer
## Model: TSLM
##
## Residuals:
## Min 1Q Median 3Q Max
## -42.9 -7.6 -0.5 8.0 21.8
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 446.8792 2.8732 155.53 < 2e-16 ***
## trend() -0.3403 0.0666 -5.11 2.7e-06 ***
## fourier(K = 2)C1_4 8.9108 2.0112 4.43 3.4e-05 ***
## fourier(K = 2)S1_4 -53.7281 2.0112 -26.71 < 2e-16 ***
## fourier(K = 2)C2_4 -13.9896 1.4226 -9.83 9.3e-15 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
25
## Residual standard error: 12.2 on 69 degrees of freedom
Harmonic regression: eating-out expenditure
aus_cafe <- aus_retail %>% filter(
Industry == "Cafes, restaurants and takeaway food services",
year(Month) %in% 2004:2018
) %>% summarise(Turnover = sum(Turnover))
aus_cafe %>% autoplot(Turnover)

4000

3500
Turnover

3000

2500

2000

2005 Jan 2010 Jan 2015 Jan 26

Month [1M]
Harmonic regression: eating-out expenditure
fit <- aus_cafe %>%
model(K1 = TSLM(log(Turnover) ~ trend() + fourier(K = 1)),
K2 = TSLM(log(Turnover) ~ trend() + fourier(K = 2)),
K3 = TSLM(log(Turnover) ~ trend() + fourier(K = 3)),
K4 = TSLM(log(Turnover) ~ trend() + fourier(K = 4)),
K5 = TSLM(log(Turnover) ~ trend() + fourier(K = 5)),
K6 = TSLM(log(Turnover) ~ trend() + fourier(K = 6)))
glance(fit) %>% select(.model, r_squared, adj_r_squared, AICc)

## # A tibble: 6 x 4
## .model r_squared adj_r_squared AICc
## <chr> <dbl> <dbl> <dbl>
## 1 K1 0.962 0.962 -1085.
## 2 K2 0.966 0.965 -1099.
## 3 K3 0.976 0.975 -1160.
## 4 K4 0.980 0.979 -1183.
## 5 K5 0.985 0.984 -1234.
## 6 K6 0.985 0.984 -1232. 27
Harmonic regression: eating-out expenditure
Log transformed TSLM, trend() + fourier(K = 1)

5000

AICc = −1085
4000

level
Turnover

80
95
3000

2000

2005 Jan 2010 Jan 2015 Jan 2020 Jan

Month

28
Harmonic regression: eating-out expenditure
Log transformed TSLM, trend() + fourier(K = 2)

5000

AICc = −1099
4000

level
Turnover

80
95
3000

2000

2005 Jan 2010 Jan 2015 Jan 2020 Jan

Month

29
Harmonic regression: eating-out expenditure
Log transformed TSLM, trend() + fourier(K = 3)

5000

AICc = −1160
4000

level
Turnover

80
95
3000

2000

2005 Jan 2010 Jan 2015 Jan 2020 Jan

Month

30
Harmonic regression: eating-out expenditure
Log transformed TSLM, trend() + fourier(K = 4)

5000

AICc = −1183
4000

level
Turnover

80
95
3000

2000

2005 Jan 2010 Jan 2015 Jan 2020 Jan

Month

31
Harmonic regression: eating-out expenditure
Log transformed TSLM, trend() + fourier(K = 5)

5000

AICc = −1234
4000

level
Turnover

80
95
3000

2000

2005 Jan 2010 Jan 2015 Jan 2020 Jan

Month

32
Harmonic regression: eating-out expenditure
Log transformed TSLM, trend() + fourier(K = 6)

5000

AICc = −1232
4000

level
Turnover

80
95
3000

2000

2005 Jan 2010 Jan 2015 Jan 2020 Jan

Month

33
Intervention variables

Spikes
Equivalent to a dummy variable for handling an outlier.

34
Intervention variables

Spikes
Equivalent to a dummy variable for handling an outlier.
Steps
Variable takes value 0 before the intervention and 1 afterwards.

34
Intervention variables

Spikes
Equivalent to a dummy variable for handling an outlier.
Steps
Variable takes value 0 before the intervention and 1 afterwards.
Change of slope
Variables take values 0 before the intervention and values
{1, 2, 3, . . . } afterwards.
34
Holidays

For monthly data

Christmas: always in December so part of monthly seasonal
eﬀect
Easter: use a dummy variable vt = 1 if any part of Easter is in that
month, vt = 0 otherwise.
Ramadan and Chinese new year similar.

35
Distributed lags

Lagged values of a predictor.

Example: x is advertising which has a delayed eﬀect

x1 = advertising for previous month;

x2 = advertising for two months previously;
..
.
xm = advertising for m months previously.

36
Example: Boston marathon winning times
marathon <- boston_marathon %>%
filter(Event == "Men's open division") %>%
select(-Event) %>%
mutate(Minutes = as.numeric(Time)/60)
marathon %>% autoplot(Minutes) + labs(y="Winning times in minutes")
Winning times in minutes

170

160

150

140

130

1900 1925 1950 1975 2000 2025

37
Year [1Y]
Example: Boston marathon winning times
fit_trends <- marathon %>%
model(
# Linear trend
linear = TSLM(Minutes ~ trend()),
# Exponential trend
exponential = TSLM(log(Minutes) ~ trend()),
# Piecewise linear trend
piecewise = TSLM(Minutes ~ trend(knots = c(1940, 1980)))
)

fit_trends

## # A mable: 1 x 3
## linear exponential piecewise
## <model> <model> <model>
## 1 <TSLM> <TSLM> <TSLM>
38
Example: Boston marathon winning times
fit_trends %>% forecast(h=10) %>% autoplot(marathon)

Boston marathon winning times

.model
160
exponential
linear
Minutes

piecewise
140

level
95
120

1920 1960 2000

Year [1Y] 39
Example: Boston marathon winning times
fit_trends %>%
select(piecewise) %>%
gg_tsresiduals()
Innovation residuals

20
10
0
−10

1900 1925 1950 1975 2000 2025

Year
0.3
0.2 20

count
0.1
acf

0.0 10
−0.1
−0.2 0
5 10 15 20 −10 0 10 20
lag [1Y] .resid
40
Outline

1 The linear model with time series

For forecasting purposes, we require the following assumptions:

εt are uncorrelated and zero mean
εt are uncorrelated with each xj,t .

42
Multiple regression and forecasting

For forecasting purposes, we require the following assumptions:

εt are uncorrelated and zero mean
εt are uncorrelated with each xj,t .
It is useful to also have εt ∼ N(0, σ 2 ) when producing prediction
intervals or doing statistical tests.

42
Residual plots

Useful for spotting outliers and whether the linear model was
appropriate.
Scatterplot of residuals εt against each predictor xj,t .
Scatterplot residuals against the ﬁtted values ŷt
Expect to see scatterplots resembling a horizontal band with no
values too far from the band and no patterns such as curvature
or increasing spread.

43
Residual patterns

If a plot of the residuals vs any predictor in the model shows a

pattern, then the relationship is nonlinear.
If a plot of the residuals vs any predictor not in the model shows
a pattern, then the predictor should be added to the model.
If a plot of the residuals vs ﬁtted values shows a pattern, then
there is heteroscedasticity in the errors. (Could try a
transformation.)

44
Outline

1 The linear model with time series

Computer output for regression will always give the R2 value. This is a
useful summary of the model.
It is equal to the square of the correlation between y and ŷ.
It is often called the “coeﬃcient of determination’ ’.
It can also be calculated as follows:
(ŷt − ȳ)2
P
2
R =P
(yt − ȳ)2
It is the proportion of variance accounted for (explained) by the
predictors.
46
Comparing regression models
However . . .
R2 does not allow for “degrees of freedom’ ’.
Adding any variable tends to increase the value of R2 , even if that
variable is irrelevant.

47
Comparing regression models
However . . .
R2 does not allow for “degrees of freedom’ ’.
Adding any variable tends to increase the value of R2 , even if that
variable is irrelevant.
To overcome this problem, we can use adjusted R2 :
T−1
R̄2 = 1 − (1 − R2 )
T−k−1
where k = no. predictors and T = no. observations.

Maximizing R̄2 is equivalent to minimizing σ̂ 2 .

1 T
σ̂ 2 = ε2
X
47
T − k − 1 t=1 t
Akaike’s Information Criterion

AIC = −2 log(L) + 2(k + 2)

where L is the likelihood and k is the number of predictors in the

model.

48
Akaike’s Information Criterion

AIC = −2 log(L) + 2(k + 2)

where L is the likelihood and k is the number of predictors in the

model.
AIC penalizes terms more heavily than R̄2 .
Minimizing the AIC is asymptotically equivalent to minimizing
MSE via leave-one-out cross-validation (for any linear
regression).
48
Corrected AIC

For small values of T, the AIC tends to select too many predictors, and
so a bias-corrected version of the AIC has been developed.
2(k + 2)(k + 3)
AICC = AIC +
T−k−3

As with the AIC, the AICC should be minimized.

49
Bayesian Information Criterion

BIC = −2 log(L) + (k + 2) log(T)

where L is the likelihood and k is the number of predictors in the

model.

50
Bayesian Information Criterion

BIC = −2 log(L) + (k + 2) log(T)

where L is the likelihood and k is the number of predictors in the

model.
BIC penalizes terms more heavily than AIC
Also called SBIC and SC.
Minimizing BIC is asymptotically equivalent to leave-v-out
cross-validation when v = T[1 − 1/(log(T) − 1)].
50
Leave-one-out cross-validation

For regression, leave-one-out cross-validation is faster and more

eﬃcient than time-series cross-validation.
Select one observation for test set, and use remaining
observations in training set. Compute error on test observation.
Repeat using each possible observation as the test set.
Compute accuracy measure over all errors.

51
Cross-validation

Traditional evaluation
Training data Test data
time

52
Cross-validation

Traditional evaluation
Training data Test data
time

Time series cross-validation

h=1

52
Cross-validation

Traditional evaluation
Training data Test data
time

Leave-one-out cross-validation
h=1

53
Cross-validation

Traditional evaluation
Training data Test data
time

Leave-one-out cross-validation
h=1

CV = MSE on test sets

53
Choosing regression variables

Best subsets regression

Fit all possible regression models using one or more of the
predictors.
Choose the best model based on one of the measures of
predictive ability (CV, AIC, AICc).

54
Choosing regression variables

Best subsets regression

Fit all possible regression models using one or more of the
predictors.
Choose the best model based on one of the measures of
predictive ability (CV, AIC, AICc).
Warning!
If there are a large number of predictors, this is not possible.
For example, 44 predictors leads to 18 trillion possible models!
54
Choosing regression variables

Backwards stepwise regression

Start with a model containing all variables.
Try subtracting one variable at a time. Keep the model if it has
lower CV or AICc.
Iterate until no further improvement.

55
Choosing regression variables

Backwards stepwise regression

Start with a model containing all variables.
Try subtracting one variable at a time. Keep the model if it has
lower CV or AICc.
Iterate until no further improvement.
Notes
Stepwise regression is not guaranteed to lead to the best
possible model.
Inference on coeﬃcients of ﬁnal model will be wrong.
55
Outline

1 The linear model with time series

Ex ante forecasts are made using only information available in

advance.
I require forecasts of predictors
Ex post forecasts are made using later information on the
predictors.
I useful for studying behaviour of forecasting models.
trend, seasonal and calendar variables are all known in advance,
so these don’t need to be forecast.

57
Scenario based forecasting

Assumes possible scenarios for the predictor variables

Prediction intervals for scenario based forecasts do not include
the uncertainty associated with the future values of the
predictor variables.

58
Building a predictive regression model

If getting forecasts of predictors is diﬃcult, you can use lagged

predictors instead.
yt = β0 + β1 x1,t−h + · · · + βk xk,t−h + εt
A diﬀerent model for each forecast horizon h.

59
US Consumption

fit_consBest <- us_change %>%

model(
TSLM(Consumption ~ Income + Savings + Unemployment)
)

future_scenarios <- scenarios(

Increase = new_data(us_change, 4) %>%
mutate(Income=1, Savings=0.5, Unemployment=0),
Decrease = new_data(us_change, 4) %>%
mutate(Income=-1, Savings=-0.5, Unemployment=0),
names_to = "Scenario")

fc <- forecast(fit_consBest, new_data = future_scenarios)

60
US Consumption
us_change %>% autoplot(Consumption) +
labs(y="% change in US consumption") +
autolayer(fc) +
labs(title = "US consumption", y = "% change")

US consumption

2 Scenario
Decrease
1
% change

Increase

0
level
−1
80
95
−2

1980 Q1 2000 Q1 2020 Q1

Quarter [1Q] 61
Outline

1 The linear model with time series

yt = β0 + β1 x1,t + β2 x2,t + · · · + βk xk,t + εt .

63
Matrix formulation

yt = β0 + β1 x1,t + β2 x2,t + · · · + βk xk,t + εt .

Let y = (y1 , . . . , yT )0 , ε = (ε1 , . . . , εT )0 , β = (β0 , β1 , . . . , βk )0 and


1 x1,1 x2,1 . . . xk,1 
1 x1,2 x2,2 . . . xk,2 
 
X =  .. ..

.. ..  .

. . . . 

1 x1,T x2,T . . . xk,T

63
Matrix formulation

yt = β0 + β1 x1,t + β2 x2,t + · · · + βk xk,t + εt .

Let y = (y1 , . . . , yT )0 , ε = (ε1 , . . . , εT )0 , β = (β0 , β1 , . . . , βk )0 and


1 x1,1 x2,1 . . . xk,1 
1 x1,2 x2,2 . . . xk,2 
 
X =  .. ..

.. ..  .

. . . . 

1 x1,T x2,T . . . xk,T
Then
y = Xβ + ε.
63
Matrix formulation

Least squares estimation

Minimize: (y − Xβ)0 (y − Xβ)

64
Matrix formulation

Least squares estimation

Minimize: (y − Xβ)0 (y − Xβ)
Diﬀerentiate wrt β gives
β̂ = (X 0 X)−1 X 0 y

64
Matrix formulation

Least squares estimation

Minimize: (y − Xβ)0 (y − Xβ)
Diﬀerentiate wrt β gives
β̂ = (X 0 X)−1 X 0 y

(The “normal equation”.)

64
Matrix formulation

Least squares estimation

Minimize: (y − Xβ)0 (y − Xβ)
Diﬀerentiate wrt β gives
β̂ = (X 0 X)−1 X 0 y

(The “normal equation”.)

1
σ̂ 2 = (y − X β̂)0 (y − X β̂)
T−k−1
Note: If you fall for the dummy variable trap, (X 0 X) is a singular matrix. 64
Likelihood

If the errors are iid and normally distributed, then

y ∼ N(Xβ, σ 2 I).

65
Likelihood

If the errors are iid and normally distributed, then

y ∼ N(Xβ, σ 2 I).
So the likelihood is
1 1
!
0
L= T exp − 2 (y − Xβ) (y − Xβ)
σ (2π)T/2 2σ

65
Likelihood

If the errors are iid and normally distributed, then

y ∼ N(Xβ, σ 2 I).
So the likelihood is
1 1
!
0
L= T exp − 2 (y − Xβ) (y − Xβ)
σ (2π)T/2 2σ
which is maximized when (y − Xβ)0 (y − Xβ) is minimized.

65
Likelihood

If the errors are iid and normally distributed, then

y ∼ N(Xβ, σ 2 I).
So the likelihood is
1 1
!
0
L= T exp − 2 (y − Xβ) (y − Xβ)
σ (2π)T/2 2σ
which is maximized when (y − Xβ)0 (y − Xβ) is minimized.
So MLE = OLS.

65
Multiple regression forecasts
Optimal forecasts
ŷ∗ = E(y∗ |y, X, x∗ ) = x∗ β̂ = x∗ (X 0 X)−1 X 0 y

where x∗ is a row vector containing the values of the predictors for

the forecasts (in the same format as X).

66
Multiple regression forecasts
Optimal forecasts
ŷ∗ = E(y∗ |y, X, x∗ ) = x∗ β̂ = x∗ (X 0 X)−1 X 0 y

where x∗ is a row vector containing the values of the predictors for

the forecasts (in the same format as X).
Forecast variance
Var(y∗ |X, x∗ ) = σ 2 1 + x∗ (X 0 X)−1 (x∗ )0
h i

66
Multiple regression forecasts
Optimal forecasts
ŷ∗ = E(y∗ |y, X, x∗ ) = x∗ β̂ = x∗ (X 0 X)−1 X 0 y

where x∗ is a row vector containing the values of the predictors for

the forecasts (in the same format as X).
Forecast variance
Var(y∗ |X, x∗ ) = σ 2 1 + x∗ (X 0 X)−1 (x∗ )0
h i

This ignores any errors in x∗ .

95% prediction intervals assuming normal errors:
q
∗
ŷ ± 1.96 Var(y∗ |X, x∗ ). 66
Outline

1 The linear model with time series

When x is useful for predicting y, it is not necessarily causing y.

e.g., predict number of drownings y using number of ice-creams
sold x.
Correlations are useful for forecasting, even when there is no
causality.
Better models usually involve causal relationships (e.g.,
temperature x and people z to predict drownings y).

68
Multicollinearity

In regression analysis, multicollinearity occurs when:

Two predictors are highly correlated (i.e., the correlation
between them is close to ±1).
A linear combination of some of the predictors is highly
correlated with another predictor.
A linear combination of one subset of predictors is highly
correlated with a linear combination of another subset of
predictors.

69
Multicollinearity

If multicollinearity exists. . .
the numerical estimates of coefficients may be wrong (worse in
Excel than in a statistics package)
don’t rely on the p-values to determine significance.
there is no problem with model predictions provided the
predictors used for forecasting are within the range used for
fitting.
omitting variables can help.
combining variables can help.
70

Regression Cookbook
100% (1)
Regression Cookbook
11 pages
Using R For Introductory Econometrics
No ratings yet
Using R For Introductory Econometrics
378 pages
Homework 2
100% (1)
Homework 2
14 pages
Ms 236 N 0
No ratings yet
Ms 236 N 0
63 pages
Reading 2 Time-Series Analysis - Answers
No ratings yet
Reading 2 Time-Series Analysis - Answers
63 pages
Unit 3 Regression Models
No ratings yet
Unit 3 Regression Models
74 pages
Computational Laboratory For Economics
0% (1)
Computational Laboratory For Economics
461 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
12
No ratings yet
12
20 pages
RPoE PDF
No ratings yet
RPoE PDF
253 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Homework 4
No ratings yet
Homework 4
119 pages
3.multiple Linear Regression - Jupyter Notebook
No ratings yet
3.multiple Linear Regression - Jupyter Notebook
5 pages
Sample Analysis Multiple Regression
No ratings yet
Sample Analysis Multiple Regression
9 pages
BZAN 535: Linear Regression
No ratings yet
BZAN 535: Linear Regression
11 pages
LinearRegression - 2022
No ratings yet
LinearRegression - 2022
38 pages
Codes
No ratings yet
Codes
8 pages
Amta - Final - Notes.r: ### Step Wise AIC Regression
No ratings yet
Amta - Final - Notes.r: ### Step Wise AIC Regression
6 pages
HW2 Solution
No ratings yet
HW2 Solution
7 pages
Econometrics Project
No ratings yet
Econometrics Project
10 pages
Analysing Data Using Linear Models 5th Ed January 2021
No ratings yet
Analysing Data Using Linear Models 5th Ed January 2021
388 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
Business Analytics II - Winter 2016 - Final Exam Solutions PDF
No ratings yet
Business Analytics II - Winter 2016 - Final Exam Solutions PDF
9 pages
Tutorial Session 12 - Model Selection Solution
No ratings yet
Tutorial Session 12 - Model Selection Solution
4 pages
Regression Analysis Using R
No ratings yet
Regression Analysis Using R
17 pages
Forecasting-Class-25012020 18.04.11
No ratings yet
Forecasting-Class-25012020 18.04.11
34 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
04.11. Assignment 1 Write-Up
No ratings yet
04.11. Assignment 1 Write-Up
15 pages
Introducing The Linear Model
No ratings yet
Introducing The Linear Model
15 pages
Amit Sir - Assignment
No ratings yet
Amit Sir - Assignment
19 pages
ETC3550 Applied Forecasting For Business and Economics: Ch12. Some Practical Forecasting Issues
No ratings yet
ETC3550 Applied Forecasting For Business and Economics: Ch12. Some Practical Forecasting Issues
22 pages
DSunit 2
No ratings yet
DSunit 2
4 pages
EXAM1 - Muhibbul Arman Mannan: List Ls
No ratings yet
EXAM1 - Muhibbul Arman Mannan: List Ls
13 pages
CH 13
No ratings yet
CH 13
11 pages
Regression Models For Data Science in R
No ratings yet
Regression Models For Data Science in R
137 pages
Module 3 - MultipleLinearRegression - Afterclass1b
No ratings yet
Module 3 - MultipleLinearRegression - Afterclass1b
34 pages
CLRM Assumptions
No ratings yet
CLRM Assumptions
20 pages
ch03 Regression
No ratings yet
ch03 Regression
10 pages
(Ebook PDF) Introductory Econometrics Asia Pacific Edition Download
100% (3)
(Ebook PDF) Introductory Econometrics Asia Pacific Edition Download
51 pages
Regression Models For Data Science in R by Brian Caffo
No ratings yet
Regression Models For Data Science in R by Brian Caffo
144 pages
Stat 378
No ratings yet
Stat 378
73 pages
Math2831 Course Pack
No ratings yet
Math2831 Course Pack
246 pages
Module 3 - SimpleLinearRegression - Afterclass1b
No ratings yet
Module 3 - SimpleLinearRegression - Afterclass1b
26 pages
5 Regression PDF
No ratings yet
5 Regression PDF
115 pages
Sales P: Finance 30210 Solutions To Problem Set #6: Demand Estimation and Forecasting
No ratings yet
Sales P: Finance 30210 Solutions To Problem Set #6: Demand Estimation and Forecasting
10 pages
Ts Linear
No ratings yet
Ts Linear
4 pages
Lab2-Markdown XFL (CLEAN)
No ratings yet
Lab2-Markdown XFL (CLEAN)
7 pages
SM Notes 2020
No ratings yet
SM Notes 2020
139 pages
Course Notes18
No ratings yet
Course Notes18
113 pages
Regression PDF
No ratings yet
Regression PDF
33 pages
Model Building
No ratings yet
Model Building
7 pages
Lec 05 2 - Time Series Regression Model
No ratings yet
Lec 05 2 - Time Series Regression Model
75 pages
HW3 Solution Fall 2024
No ratings yet
HW3 Solution Fall 2024
8 pages
Reg Mods
No ratings yet
Reg Mods
137 pages
Lec 05 - Time Series Regression Model
No ratings yet
Lec 05 - Time Series Regression Model
32 pages
Statistical Modelling
No ratings yet
Statistical Modelling
39 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Week3 - 2
No ratings yet
Week3 - 2
18 pages
MIFI 564 - UNIT 1 - New
No ratings yet
MIFI 564 - UNIT 1 - New
53 pages
CH06 Wooldridge 7e PPT 2pp
No ratings yet
CH06 Wooldridge 7e PPT 2pp
17 pages
Trendlines and Regression Analysis
No ratings yet
Trendlines and Regression Analysis
17 pages
Multimedia - Eng.ukm - My JKMB Kamal CM Chapter3
No ratings yet
Multimedia - Eng.ukm - My JKMB Kamal CM Chapter3
22 pages
6721981399ce3ce15b1bff4c 43249363231
No ratings yet
6721981399ce3ce15b1bff4c 43249363231
2 pages
Applied Multilevel Analysis A Practical Guide For Medical Researchers Digital PDF Download
No ratings yet
Applied Multilevel Analysis A Practical Guide For Medical Researchers Digital PDF Download
16 pages
Econometrics 1st Edition K. Nirmal Ravi Kumar - Quickly Download The Ebook To Start Your Content Journey
100% (1)
Econometrics 1st Edition K. Nirmal Ravi Kumar - Quickly Download The Ebook To Start Your Content Journey
70 pages
Econometrics by Example 2nd Edition Damodar Gujarati Download PDF
No ratings yet
Econometrics by Example 2nd Edition Damodar Gujarati Download PDF
41 pages
Rohini 73149042113
No ratings yet
Rohini 73149042113
11 pages
CH 05 Wooldridge 5e
No ratings yet
CH 05 Wooldridge 5e
8 pages
MH3510 PPT Lecture1IntroductionRegressionAnalysis
No ratings yet
MH3510 PPT Lecture1IntroductionRegressionAnalysis
32 pages
Chapter 7. Software Application
No ratings yet
Chapter 7. Software Application
43 pages
1 Simple Linear Regression
No ratings yet
1 Simple Linear Regression
13 pages
An Illustrated Guide To The Poisson Regression Model - by Sachin Date - Towards Data Science
No ratings yet
An Illustrated Guide To The Poisson Regression Model - by Sachin Date - Towards Data Science
25 pages
Computer Programming and Application: 3 Interpolation and Curve Fitting
No ratings yet
Computer Programming and Application: 3 Interpolation and Curve Fitting
43 pages
Slides 1 Arnold Ventures 2024
No ratings yet
Slides 1 Arnold Ventures 2024
68 pages
Group Homework DH36DD01
No ratings yet
Group Homework DH36DD01
8 pages
Regression and Analysis
No ratings yet
Regression and Analysis
132 pages
Proceedings of Spie
No ratings yet
Proceedings of Spie
10 pages
Fall 2023-2024 IE 451 Homework 3 Solutions
No ratings yet
Fall 2023-2024 IE 451 Homework 3 Solutions
15 pages
Chapter 9 - Simple Regression Analysis - L1 - Jan 2024
No ratings yet
Chapter 9 - Simple Regression Analysis - L1 - Jan 2024
26 pages
Fathom Tutorial
No ratings yet
Fathom Tutorial
24 pages
Analyzing The Ionosphere Using R
No ratings yet
Analyzing The Ionosphere Using R
22 pages
CHAPTER IV - Multiple Regression Model
No ratings yet
CHAPTER IV - Multiple Regression Model
90 pages
ECON1150 Lec 02
No ratings yet
ECON1150 Lec 02
5 pages
Math IA Sample
No ratings yet
Math IA Sample
23 pages
Class 5 Computer Exercise
No ratings yet
Class 5 Computer Exercise
3 pages
Econ 339 Final Cheat Sheet
No ratings yet
Econ 339 Final Cheat Sheet
2 pages
Problems and Possibilities of the Us Economy
From Everand
Problems and Possibilities of the Us Economy
Marc Stoffers Deliah
No ratings yet
Visual Financial Accounting for You: Greatly Modified Chess Positions as Financial and Accounting Concepts
From Everand
Visual Financial Accounting for You: Greatly Modified Chess Positions as Financial and Accounting Concepts
Anthony Brticevic
No ratings yet
Configuration Example: SAP Electronic Bank Statement (SAP - EBS)
From Everand
Configuration Example: SAP Electronic Bank Statement (SAP - EBS)
Conrad Jarrett
3/5 (1)
Practice Problems in Statistics and Data Reduction
From Everand
Practice Problems in Statistics and Data Reduction
Rahul Basu
No ratings yet