Reading 2 Time-Series Analysis - Answers
Reading 2 Time-Series Analysis - Answers
An analyst wants to model quarterly sales data using an autoregressive model. She has
found that an AR(1) model with a seasonal lag has significant slope coefficients. She also
finds that when a second and third seasonal lag are added to the model, all slope
coefficients are significant too. Based on this, the best model to use would most likely be an:
A) ARCH(1).
B) AR(1) model with no seasonal lags.
C) AR(1) model with 3 seasonal lags.
Explanation
She has found that all the slope coefficients are significant in the model xt = b0 + b1xt– 1 +
b2xt– 4 + et. She then finds that all the slope coefficients are significant in the model xt = b0
+ b1xt– 1 + b2xt– 2 + b3xt– 3 + b4xt– 4 + et. Thus, the final model should be used rather than
any other model that uses a subset of the regressors.
A) AR(2).
B) AR(1).
C) AR(12).
Explanation
The b1xt − 1 and b2xt − 2 lag terms make this an autoregressive model of order p = 2 with a
seasonal lag. The b3xt −12 term is a seasonal term which does not transform the model to
AR(12).
can be used to test for a unit root, which exists if the slope coefficient equals
A)
one.
B) cannot be used to test for a unit root.
can be used to test for a unit root, which exists if the slope coefficient is less
C)
than one.
Explanation
If you estimate the following model xt = b0 + b1 × xt-1 + et and get b1 = 1, then the process
has a unit root and is nonstationary.
Consider the estimated AR(2) model, xt = 2.5 + 3.0 xt-1 + 1.5 xt-2 + εt t=1,2,...50. Making a
prediction for values of x for 1 ≤ t ≤ 50 is referred to as:
A) an in-sample forecast.
B) an out-of-sample forecast.
C) requires more information to answer the question.
Explanation
An in-sample (a.k.a. within-sample) forecast is made within the bounds of the data used to
estimate the model. An out-of-sample forecast is for values of the independent variable
that are outside of those used to estimate the model.
Explanation
This is an autoregressive model (i.e., lagged dependent variable as independent variables)
of order p=4 (that is, 4 lags).
test autocorrelations of the residuals for a simple trend model, and specify the
A)
number of significant lags.
estimate an autoregressive model (for example, an AR(1) model), calculate the
B) autocorrelations for the model's residuals, test whether the autocorrelations
are different from zero, and add an AR lag for each significant autocorrelation.
estimate an autoregressive model (e.g., an AR(1) model), calculate the
autocorrelations for the model's residuals, test whether the autocorrelations
C)
are different from zero, and revise the model if there are significant
autocorrelations.
Explanation
The procedure is iterative: continually test for autocorrelations in the residuals and stop
adding lags when the autocorrelations of the residuals are eliminated. Even if several of
the residuals exhibit autocorrelation, the lags should be added one at a time.
Which of the following is least likely a consequence of a model containing ARCH(1) errors?
The:
Explanation
The presence of autoregressive conditional heteroskedasticity (ARCH) indicates that the
variance of the error terms is not constant. This is a violation of the regression
assumptions upon which time series models are based. The addition of another lag
variable to a model is not a means for correcting for ARCH (1) errors.
The primary concern when deciding upon a time series sample period is which of the
following factors?
Explanation
There will always be a tradeoff between the increase statistical reliability of a longer time
period and the increased stability of estimated regression coefficients with shorter time
periods. Therefore, the underlying economic environment should be the deciding factor
when selecting a time series sample period.
Autoregressive Model
Regression Statistics
R-squared 0.767
Observations 64
Autocorrelation of Residuals
Quarter Observation
Explanation
None of these are statically significant, so we can conclude that there is no evidence of
autocorrelation in the residuals, and therefore the AR model is properly specified.
What is the forecast for the gross margin in the first quarter of 2004?
A) 0.246.
B) 0.250.
C) 0.256.
Explanation
The forecast for the following quarter is 0.155 + 0.240(0.240) + 0.168(0.260) = 0.256.
Explanation
Supposing the time series is actually a random walk, which of the following approaches
would be appropriate prior to using an autoregressive model?
Explanation
First differencing often transforms a random walk into a covariance stationary time series
which can then be fitted using autoregressive models. ARCH is a type of AR model where
the residuals exhibit conditional heteroskedasticity and is not an approach to convert a
random walk into a covariance stationary time series. Taking natural log is recommended
for a time series with an exponential growth prior to fitting a trend model.
Explanation
The slope is the change in the dependent variable per unit of time. The intercept is the
estimate of the value of the dependent variable before the time series begins. The
disturbance term should be independent and identically distributed. There is no reason to
expect the disturbance term to be mean-reverting, and if the residuals are autocorrelated,
the research should correct for that problem.
B) (Salest - Sales t-1)= b0 + b1 (Sales t-1 - Sales t-2) + b2 (Sales t-4 - Sales t-5) + εt.
Explanation
Trend models can be useful tools in the evaluation of a time series of data. However, there
are limitations to their usage. Trend models are not appropriate when which of the following
violations of the linear regression assumptions is present?
A) Model misspecification.
B) Serial correlation.
C) Heteroskedasticity.
Explanation
One of the primary assumptions of linear regression is that the residual terms are not
correlated with each other. If serial correlation, also called autocorrelation, is present,
then trend models are not an appropriate analysis tool.
A time series that has a unit root can be transformed into a time series without a unit root
through:
A) first differencing.
B) mean reversion.
C) calculating moving average of the residuals.
Explanation
First differencing a series that has a unit root creates a time series that does not have a
unit root.
A time series x that is a random walk with a drift is best described as:
A) xt = b0 + b1 xt − 1.
B) xt = xt − 1 + εt.
C) xt = b0 + b1xt − 1 + εt.
Explanation
The best estimate of random walk for period t is the value of the series at (t − 1). If the
random walk has a drift component, this drift is added to the previous period's value of
the time series to produce the forecast.
A) revise the model to include at least another lag of the dependent variable.
B) switch models to a moving average model.
C) alter the model to an ARCH model.
Explanation
She should estimate an AR(4) model, and then re-examine the autocorrelations of the
residuals.
A) Yes, because the intercept and the lag coefficient are significant.
B) Yes, because most of the residual autocorrelations are negative.
C) No, because several of the residual autocorrelations are significant.
Explanation
At a 5% level of significance, the critical t-value is 1.98. Since the absolute values of several
of the residual autocorrelation's t-statistics exceed 1.98, it can be concluded that
significant serial correlation exists and the model should be respecified. The next logical
step is to estimate an AR(2) model, then test the associated residuals for autocorrelation.
If no serial correlation is detected, seasonality and ARCH behavior should be tested.
mean-reverting model to analyze the data because the time series pattern is
A)
covariance stationary.
B) linear model to analyze the data because the mean appears to be constant.
log-linear model to analyze the data because it is likely to exhibit a compound
C)
growth trend.
Explanation
A log-linear model is more appropriate when analyzing data that is growing at a compound
rate. Sales are a classic example of a type of data series that normally exhibits compound
growth.
(Salest - Sales t-1) = 30 + 1.25 (Sales t-1 - Sales t-2) + 1.1 (Sales t-4 - Sales t-5) t=1,2,.. T
t Period Sales
T 2000.2 $2,000
A) $2,270.00.
B) $2,625.00.
C) $1,730.00.
Explanation
Note that since we are forecasting 2000.3, the numbering of the "t" column has changed.
Alexis Popov, CFA, wants to estimate how sales have grown from one quarter to the next on
average. The most direct way for Popov to estimate this would be:
Explanation
If the goal is to simply estimate the dollar change from one period to the next, the most
direct way is to estimate xt = b0 + b1 × (Trend) + et, where Trend is simply 1, 2, 3, ....T. The
model predicts a change by the value b1 from one period to the next.
The regression results from fitting an AR(1) model to the first-differences in enrollment
growth rates at a large university includes a Durbin-Watson statistic of 1.58. The number of
quarterly observations in the time series is 60. At 5% significance, the critical values for the
Durbin-Watson statistic are dl = 1.55 and du = 1.62. Which of the following is the most
accurate interpretation of the DW statistic for the model?
B) Since dl < DW < du, the results of the DW test are inconclusive.
C) The Durbin-Watson statistic cannot be used with AR(1) models.
Explanation
The Durbin-Watson statistic is not useful when testing for serial correlation in an
autoregressive model where one of the independent variables is a lagged value of the
dependent variable. The existence of serial correlation in an AR model is determined by
examining the autocorrelations of the residuals.
The main reason why financial and time series intrinsically exhibit some form of
nonstationarity is that:
Explanation
Because all financial and time series relationships are dynamic, regression coefficients can
vary widely from period to period. Therefore, financial and time series will always exhibit
some amount of instability or nonstationarity.
Explanation
The Dickey-Fuller test estimates the equation (xt – xt-1) = b0 + (b1 - 1) * xt-1 + et and tests if
H0: (b1 – 1) = 0. Using a modified t-test, if it is found that (b1– 1) is not significantly
different from zero, then it is concluded that b1 must be equal to 1.0 and the series has a
unit root.
Troy Dillard, CFA, has estimated the following equation using semiannual data: xt = 44 +
0.1×xt– 1 – 0.25×xt– 2 - 0.15×xt– 3 + et. Given the data in the table below, what is Dillard's best
forecast of the second half of 2007?
Time Value
2003: I 31
2003: II 31
2004: I 33
2004: II 33
2005: I 36
2005: II 35
2006: I 32
2006: II 33
A) 33.74.
B) 34.05.
C) 34.36.
Explanation
To get the answer, Dillard must first make the forecast for 2007:I
E[x2007:I] = 34.05
Then, use this forecast in the equation for the first lag:
E[x2007:II] = 34.36
Which of the following statements regarding the instability of time-series models is most
accurate? Models estimated with:
A) shorter time series are usually more stable than those with longer time series.
a greater number of independent variables are usually more stable than those
B)
with a smaller number.
C) longer time series are usually more stable than those with shorter time series.
Explanation
Those models with a shorter time series are usually more stable because there is less
opportunity for variance in the estimated regression coefficients between the different
time periods.
Explanation
The t-statistics are all very small, indicating that none of the autocorrelations are
significantly different than zero. Based on these results, the model appears to be
appropriately specified. The error terms, however, should still be checked for
heteroskedasticity.
Which of the following is NOT a requirement for a series to be covariance stationary? The:
A) expected value of the time series is constant over time.
B) covariance of the time series with itself (lead or lag) must be constant.
C) time series must have a positive trend.
Explanation
For a time series to be covariance stationary: 1) the series must have an expected value
that is constant and finite in all periods, 2) the series must have a variance that is constant
and finite in all periods, and 3) the covariance of the time series with itself for a fixed
number of periods in the past or future must be constant and finite in all periods.
The table below shows the autocorrelations of the lagged residuals for quarterly theater
ticket sales that were estimated using the AR(1) model: ln(salest) = b0 + b1(ln salest − 1) + et.
Assuming the critical t-statistic at 5% significance is 2.0, which of the following is the most
likely conclusion about the appropriateness of the model? The time series:
A) contains seasonality.
B) contains ARCH (1) errors.
C) would be more appropriately described with an MA(4) model.
Explanation
The time series contains seasonality as indicated by the strong and significant
autocorrelation of the lag-4 residual.
Seasonal Lagged
Change in Lagged Change in
Warranty Change in
Quarter Warranty Warranty
Expense Warranty
Expense yt Expense yt-1
Expense yt-4
2002.1 103
2002.2 52 –51
Winston submits the following results to his supervisor. The first is the estimation of a trend
model for the period 2002:1 to 2004:4. The model is below. The standard errors are in
parentheses.
(14.37) (1.97)
R-squared = 16.2%
Winston also submits the following results for an autoregressive model on the differences in
the expense over the period 2004:to 2004:4. The model is below where "y" represents the
change in expense as defined in the table above. The standard errors are in parentheses.
R-squared = 99.98%
After receiving the output, Collier's supervisor asks him to compute moving averages of the
sales data.
Collier's supervisors would probably not want to use the results from the trend model for all
of the following reasons EXCEPT:
A) the model is a linear trend model and log-linear models are always superior.
B) the slope coefficient is not significant.
it does not give insights into the underlying dynamics of the movement of the
C)
dependent variable.
Explanation
Linear trend models are not always inferior to log-linear models. To determine which
specification is better would require more analysis such as a graph of the data over time.
As for the other possible answers, Collier can see that the slope coefficient is not
significant because the t-statistic is 1.37=2.7/1.97. Also, regressing a variable on a simple
time trend only describes the movement over time, and does not address the underlying
dynamics of the dependent variable.
yt = −0.9 − 0.23* yt −1 + et
R-squared = 78.3%
(0.823) (0.0222)
A) 1.16.
B) −0.73.
C) 0.77.
Explanation
Based on the autoregressive model, expected warranty expense in the first quarter of 2005
will be closest to:
A) $51 million.
B) $60 million.
C) $65 million.
Explanation
Substituting the 1-period lagged data from 2004.4 and the 4-period lagged data from
2004.1 into the model formula, change in warranty expense is predicted to be higher than
2004.4.
A) Yes, because the coefficient on yt–4 is large compared to its standard error.
No, because the slope coefficients in the autoregressive model have opposite
C)
signs.
Explanation
The t-statistic is equal to 0.83/0.0186 = 44.62, which is greater than the critical t-value (5%
LOS, 2-tailed, dof = 4) = 2.78
Jason Cranwell, CFA, has hypothesized that sales of luxury cars have grown at a constant
rate over the past 15 years.
b0 0.4563
b1 0.6874
R-squared 0.7548
F 12.63
Observations 180
A) 1.26.
B) 1.46.
C) 1.66.
Explanation
Cranwell is aware that the Dickey Fuller test can be used to discover whether a model has a
unit root. He is also aware that the test would use a revised set of critical t-values. What
would it mean to Bert to reject the null of the Dickey Fuller test (Ho: g = 0) ?
Explanation
The null hypothesis of g = 0 actually means that b1 – 1 = 0 , meaning that b1 = 1. Since we
have rejected the null, we can conclude that the model has no unit root.
Cranwell would also like to test for serial correlation in his AR(1) model. To do this, Cranwell
should:
A) use the provided Durbin Watson statistic and compare it to a critical value.
B) use a t-test on the residual autocorrelations over several lags.
determine if the series has a finite and constant covariance between leading
C)
and lagged terms of itself.
Explanation
To test for serial correlation in an AR model, test for the significance of residual
autocorrelations over different lags. The goal is for all t-statistics to lack statistical
significance. The Durbin-Watson test is used with trend models; it is not appropriate for
testing for serial correlation of the error terms in an autoregressive model. Constant and
finite unconditional variance is not an indicator of serial correlation but rather is one of
the requirements of covariance stationarity.
When using the root mean squared error (RMSE) criterion to evaluate the predictive power
of the model, which of the following is the most appropriate statement?
A) Use the model with the highest RMSE calculated using the in-sample data.
B) Use the model with the lowest RMSE calculated using the out-of-sample data.
C) Use the model with the lowest RMSE calculated using the in-sample data.
Explanation
RMSE is a measure of error hence the lower the better. It should be calculated on the out-
of-sample data i.e. the data not directly used in the development of the model. This
measure thus indicates the predictive power of our model.
Frank Batchelder and Miriam Yenkin are analysts for Bishop Econometrics. Batchelder and
Yenkin are discussing the models they use to forecast changes in China's GDP and how they
can compare the forecasting accuracy of each model. Batchelder states, "The root mean
squared error (RMSE) criterion is typically used to evaluate the in-sample forecast accuracy
of autoregressive models." Yenkin replies, "If we use the RMSE criterion, the model with the
largest RMSE is the one we should judge as the most accurate."
Explanation
The root mean squared error (RMSE) criterion is used to compare the accuracy of
autoregressive models in forecasting out-of-sample values (not in-sample values).
Batchelder is incorrect. Out-of-sample forecast accuracy is important because the future is
always out of sample, and therefore out-of-sample performance of a model is critical for
evaluating real world performance.
Yenkin is also incorrect. The RMSE criterion takes the square root of the average squared
errors from each model. The model with the smallest RMSE is judged the most accurate.
Albert Morris, CFA, is evaluating the results of an estimation of the number of wireless
phone minutes used on a quarterly basis within the territory of Car-tel International, Inc.
Some of the information is presented below (in billions of minutes):
Total 27 10,315.051
The variance of the residuals from one time period within the time series is not dependent
on the variance of the residuals in another time period.
Morris also models the monthly revenue of Car-tel using data over 96 monthly observations.
The model is shown below:
The value for WPM this period is 544 billion. Using the results of the model, the forecast
Wireless Phone Minutes three periods in the future is:
A) 691.30.
B) 586.35.
C) 683.18.
Explanation
Explanation
The model is specified as an AR Model, but there is no seasonal lag. No moving averages
are employed in the estimation of the model.
A) 381.29 million.
B) 8.83 million.
C) 43.2 million.
Explanation
b0 43.2
MRL = = = 381.29 million
1 − b1 1 − 0.8867
Morris concludes that the current price of Car-tel stock is consistent with single stage
constant growth model (with g=3%). Based on this information, the sales model is most
likely:
Explanation
If constant growth rate is an appropriate model for Car-tel, its dividends (as well as
earnings and revenues) will grow at a constant rate. In such a case, the time series needs
to be adjusted by taking the natural log of the time series. Taking the natural log of the
time series would lead to a series that exhibits a constant amount of growth (and still not
stationary). The final step would be to first difference the transformed series to make it
covariance stationary. First differencing would remove the trending component of a
covariance non-stationary time series but would not be appropriate for transforming an
exponentially growing time series.
Bill Johnson, CFA, has prepared data concerning revenues from sales of winter clothing
made by Polar Corporation. This data is presented (in $ millions) in the following table:
2013.1 182
2013.2 74 −108
2013.3 78 4 −108
2014.3 90 11 −115 4
Explanation
Johnson will use the table to forecast values using an autoregressive model for periods in
succession since each successive forecast relies on the forecast for the preceding period.
The seasonal lag is introduced to account for seasonal variations in the observed data.
The value that Johnson should enter in the table in place of "w" is:
A) 164.
B) −115.
C) −48.
Explanation
The seasonal lagged change in sales shows the change in sales from the period 4 quarters
before the current period. Sales in the year 2013 quarter 4 increased $164 million over the
prior period.
Imagine that Johnson prepares a change-in-sales regression analysis model with seasonality,
which includes the following:
Coefficients
Intercept −6.032
Lag 1 0.017
Lag 4 0.983
Based on the model, expected sales in the first quarter of 2015 will be closest to:
A) 190.
B) 210.
C) 155.
Explanation
Substituting the 1-period lagged data from 2014.4 and the 4-period lagged data from
2014.1 into the model formula, change in sales is predicted to be −6.032 + (0.017 × 170) +
(0.983 × −48) = −50.326. Expected sales are 260 + (−50.326) = 209.674.
Explanation
Johnson's model transforms raw sales data by first differencing it and then modeling
change in sales. This is most likely an adjustment to make the data stationary for use in an
AR model.
To test for covariance-stationarity in the data, Johnson would most likely use a:
A) Durbin-Watson test.
B) t-test.
C) Dickey-Fuller test.
Explanation
The Dickey-Fuller test for unit roots could be used to test whether the data is covariance
non-stationarity. The Durbin-Watson test is used for detecting serial correlation in the
residuals of trend models but cannot be used in AR models. A t-test is used to test for
residual autocorrelation in AR models.
invalid standard errors of regression coefficients, but statistical tests will still be
A)
valid.
B) invalid standard errors of regression coefficients and invalid statistical tests.
invalid estimates of regression coefficients, but the standard errors will still be
C)
valid.
Explanation
Suppose you estimate the following model of residuals from an autoregressive model:
If the residual at time t is 0.9, the forecasted variance for time t+1 is:
A) 0.790.
B) 0.736.
C) 0.850.
Explanation
The variance at t = t + 1 is 0.25 + [0.60 (0.9)2] = 0.25 + 0.486 = 0.736. See also, ARCH
models.
Explanation
The one-period forecast of a random walk model without drift is E(xt+1) = E(xt + et ) = xt + 0,
so the forecast is simply xt = 2.2. For a random walk process, the variance changes with
the value of the observation. However, the error term et = xt - xt-1 is not autocorrelated.
Housing industry analyst Elaine Smith has been assigned the task of forecasting housing
foreclosures. Specifically, Smith is asked to forecast the percentage of outstanding
mortgages that will be foreclosed upon in the coming quarter. Smith decides to employ
multiple linear regression and time series analysis.
Besides constructing a forecast for the foreclosure percentage, Smith wants to address the
following two questions:
Smith contends that adjustable rate mortgages often are used by higher risk borrowers and
that their homes are at higher risk of foreclosure. Therefore, Smith decides to use short-
term interest rates as one of the independent variables to test Research Question 1.
Smith uses quarterly data over the past 5 years to derive her regression. Smith's regression
equation is provided in Exhibit 1:
Exhibit 1: Foreclosure Share Regression Equation
where:
= the quarterly change in the 1-year Treasury bill rate (e.g., ΔINT = 2 for a
ΔINT
two percentage point increase in interest rates)
STIM = 1 for quarters in which a Federal fiscal stimulus package was in place
Regression 3 15 5.0000
Error 16 5 0.3125
Total 19 20
Smith expresses the following concerns about the test statistics derived in her regression:
Before completing her analysis, Smith runs a regression of the changes in foreclosure share
on its lagged value. The following regression results and autocorrelations were derived using
quarterly data over the past 5 years ( Exhibit 4 and Exhibit 5, respectively):
1 0.05 0.22
2 -0.35 -1.53
3 0.25 1.09
4 0.10 0.44
The most appropriate interpretation from the foreclosure share regression equation model
is:
Explanation
Based on her regression results in Exhibit 2, using a 5% level of significance, Smith should
conclude that:
Explanation
The appropriate test statistic for tests of significance on individual slope coefficient
estimates is the t-statistic, which is provided in Exhibit 2 for each regression coefficient
estimate. The reported t-statistic equals -2.10 for the STIM slope estimate and equals 2.35
for the CRISIS slope estimate. The critical t-statistic for the 5% significance level equals
2.12 (16 degrees of freedom, 5% level of significance).
Therefore, the slope estimate for STIM is not statistically significant (the reported t-
statistic, -2.10, is not large enough). In contrast, the slope estimate for CRISIS is statistically
significant (the reported t-statistic, 2.35, exceeds the 5% significance level critical value).
A) 0.16.
B) 0.53.
C) 0.56.
Explanation
The formula for the Standard Error of the Estimate (SEE) is:
SSE 5
SEE = √ = √ = 0.56
n - k - 1 16
The SEE equals the standard deviation of the regression residuals. A low SEE implies a high
R2.
Explanation
A) Smith is correct on both the forecast and the mean reverting level.
Smith is correct on the mean-reverting level for forecast of change in
B)
foreclosure share only.
Smith is correct on the two-step ahead forecast for change in foreclosure share
C)
only.
Explanation
Forecasts are derived by substituting the appropriate value for the period t-1 lagged value.
So, the one-step ahead forecast equals 0.30%. The two-step ahead (%) forecast is derived
by substituting 0.30 into the equation.
Assume for this question that Smith finds that the foreclosure share series has a unit root.
Under these conditions, she can most reliably regress foreclosure share against the change
in interest rates (ΔINT) if:
Explanation
The error terms in the regressions for choices A, B, and C will be nonstationary. Therefore,
some of the regression assumptions will be violated and the regression results are
unreliable. If, however, both series are nonstationary (which will happen if each has unit
root), but cointegrated, then the error term will be covariance stationary and the
regression results are reliable.
Suppose you estimate the following model of residuals from an autoregressive model:
If the residual at time t is 2.0, the forecasted variance for time t+1 is:
A) 2.0.
B) 3.6.
C) 3.2.
Explanation
Barry Phillips, CFA, has the following time series observations from earliest to latest: (5, 6, 5,
7, 6, 6, 8, 8, 9, 11). Phillips transforms the series so that he will estimate an autoregressive
process on the following data (1, -1, 2, -1, 0, 2, 0, 1, 2). The transformation Phillips employed
is called:
A) first differencing.
B) beta drift.
C) moving average.
Explanation
Phillips obviously first differenced the data because the 1=6-5, -1=5-6, .... 1 = 9 - 8, 2 = 11 -
9.
Barry Phillips, CFA, has estimated an AR(1) relationship (xt = b0 + b1 × xt-1 + et) and got the
following result: xt+1 = 0.5 + 1.0xt + et. Phillips should:
C) not first difference the data because b1 − b0 = 1.0 − 0.5 = 0.5 < 1.
Explanation
The condition b1 = 1 means that the series has a unit root and is not stationary. The
correct way to transform the data in such an instance is to first difference the data.
Given an AR(1) process represented by xt+1 = b0 + b1×xt + et, the process would not be a
random walk if:
B) E(et)=0.
C) b1 = 1.
Explanation
For a random walk, the long-run mean is undefined. The slope coefficient is one, b1=1, and
that is what makes the long-run mean undefined: mean = b0/(1-b1).
A) 1.88.
B) 4.14.
C) 6.69.
Explanation
Wellington's out-of-sample forecast of LN(xt) is 1.9 = 1.4 + 0.02 × 25, and e1.9 = 6.69. (Six
years of quarterly observations, at 4 per year, takes us up to t = 24. The first time period
after that is t = 25.)
David Brice, CFA, has used an AR(1) model to forecast the next period's interest rate to be
0.08. The AR(1) has a positive slope coefficient. If the interest rate is a mean reverting
process with an unconditional mean, a.k.a., mean reverting level, equal to 0.09, then which
of the following could be his forecast for two periods ahead?
A) 0.072.
B) 0.113.
C) 0.081.
Explanation
As Brice makes more distant forecasts, each forecast will be closer to the unconditional
mean. So, the two period forecast would be between 0.08 and 0.09, and 0.081 is the only
possible answer.
Yolanda Seerveld is an analyst studying the growth of sales of a new restaurant chain called
Very Vegan. The increase in the public's awareness of healthful eating habits has had a very
positive effect on Very Vegan's business. Seerveld has gathered quarterly data for the
restaurant's sales for the past three years. Over the twelve periods, sales grew from $17.2
million in the first quarter to $106.3 million in the last quarter. Because Very Vegan has
experienced growth of more than 500% over the three years, the Seerveld suspects an
exponential growth model may be more appropriate than a simple linear trend model.
However, she begins by estimating the simple linear trend model:
(sales)t = α + β × (Trend)t + εt
Regression Statistics
Multiple R 0.952640
R2 0.907523
Adjusted R2 0.898275
Observations 12
ANOVA
df SS
Regression 1 6495.203
Residual 10 661.8659
Total 11 7157.069
Regression Statistics
Multiple R 0.952028
R2 0.906357
Adjusted R2 0.896992
Standard Error 0.166686
Observations 12
ANOVA
df SS
Regression 1 2.6892
Residual 10 0.2778
Total 11 2.9670
Seerveld compares the results based upon the output statistics and conducts two-tailed
tests at a 5% level of significance. One concern is the possible problem of autocorrelation,
and Seerveld makes an assessment based upon the first-order autocorrelation coefficient of
the residuals that is listed in each set of output. Another concern is the stationarity of the
data. Finally, the analyst composes a forecast based on each equation for the quarter
following the end of the sample.
A) The simple trend regression is not, but the log-linear trend regression is.
B) Yes, both are significant.
C) The simple trend regression is, but not the log-linear trend regression.
Explanation
The respective t-statistics are 6.7400 / 0.6803 = 9.9074 and 0.1371 / 0.0140 = 9.7929. For
10 degrees of freedom, the critical t-value for a two-tailed test at a 5% level of significance
is 2.228, so both slope coefficients are statistically significant.
With respect to the possible problems of autocorrelation and nonstationarity, using the log-
linear transformation appears to have:
Explanation
The fact that there is a significant trend for both equations indicates that the data is not
stationary in either case. As for autocorrelation, the analyst really cannot test it using the
Durbin-Watson test because there are fewer than 15 observations, which is the lower limit
of the DW table. Looking at the first-order autocorrelation coefficient, however, we see
that it increased (in absolute value terms) for the log-linear equation. If anything,
therefore, the problem became more severe.
Using the simple linear trend model, the forecast of sales for Very Vegan for the first out-of-
sample period is:
A) $97.6 million.
B) $113.0 million.
C) $123.0 million.
Explanation
A) $109.4 million.
B) $117.0 million.
C) $121.2 million.
Explanation
Barry Phillips, CFA, is analyzing quarterly data. He has estimated an AR(1) relationship (xt =
b0 + b1 × xt-1 + et) and wants to test for seasonality. To do this he would want to see if which
of the following statistics is significantly different from zero?
A) Correlation(et, et-5).
B) Correlation(et, et-4).
C) Correlation(et, et-1).
Explanation
Although seasonality can make the other correlations significant, the focus should be on
correlation(et, et-4) because the 4th lag is the value that corresponds to the same season
as the predicted variable in the analysis of quarterly data.
To qualify as a covariance stationary process, which of the following does not have to be
true?
Explanation
If a series is covariance stationary then the unconditional mean is constant across periods.
The unconditional mean or expected value is the same from period to period: E[xt] =
E[xt+1]. The covariance between any two observations equal distance apart will be equal,
e.g., the t and t-2 observations with the t and t+2 observations. The one relationship that
does not have to be true is the covariance between the t and t-1 observations equaling
that of the t and t-2 observations.
Modeling the trend in a time series of a variable that grows at a constant rate with
continuous compounding is best done with:
Explanation
The log-linear transformation of a series that grows at a constant rate with continuous
compounding (exponential growth) will cause the transformed series to be linear.
Suppose that the time series designated as Y is mean reverting. If Yt+1 = 0.2 + 0.6 Yt, the best
prediction of Yt+1 is:
A) 0.5.
B) 0.8.
C) 0.3.
Explanation
The prediction is Yt+1 = b0 / (1-b1) = 0.2 / (1-0.6) = 0.5
The data below yields the following AR(1) specification: xt = 0.9 – 0.55xt-1 + Et , and the
indicated fitted values and residuals.
1 1 - -
2 -1 0.35 -1.35
3 2 1.45 0.55
4 -1 -0.2 -0.8
5 0 1.45 -1.45
6 2 0.9 1.1
7 0 -0.2 0.2
8 1 0.9 0.1
9 2 0.35 1.65
The following sets of data are ordered from earliest to latest. To test for ARCH, the
researcher should regress:
Explanation
The test for ARCH is based on a regression of the squared residuals on their lagged values.
The squared residuals are (1.8225, 0.3025, 0.64, 2.1025, 1.21, 0.04, 0.01, 2.7225). So,
(1.8225, 0.3025, 0.64, 2.1025, 1.21, 0.04, 0.01) is regressed on (0.3025, 0.64, 2.1025, 1.21,
0.04, 0.01, 2.7225). If coefficient a1 in:
2 2
ε̂ = a0 + a1 ε̂ + μ
t t-1 t
Consider the estimated model xt = -6.0 + 1.1 xt-1 + 0.3 xt-2 + εt that is estimated over 50
periods. The value of the time series for the 49th observation is 20 and the value of the time
series for the 50th observation is 22. What is the forecast for the 51st observation?
A) 23.
B) 24.2.
C) 30.2.
Explanation
Explanation
The goal of a time series model is to identify factors that can be predicted. Seasonality in a
time series refers to patterns that repeat at regular intervals. When a time series exhibits
seasonality, seasonal lags should be included in the model in order to increase its
predictive ability.
(Salest - Sales t-1)= 100 - 1.5 (Sales t-1 - Sales t-2) + 1.2 (Sales t-4 - Sales t-5) t=1,2,.. T
t Period Sales
T 2000.2 $1,000
A) $730.00.
B) $1,430.00.
C) $1,730.00.
Explanation
The table below shows the autocorrelations of the lagged residuals for the first differences
of the natural logarithm of quarterly motorcycle sales that were fit to the AR(1) model: (ln
salest − ln salest − 1) = b0 + b1(ln salest − 1 − ln salest − 2) + εt. The critical t-statistic at 5%
significance is 2.0, which means that there is significant autocorrelation for the lag-4
residual, indicating the presence of seasonality. Assuming the time series is covariance
stationary, which of the following models is most likely to CORRECT for this apparent
seasonality?
Explanation
Which of the following statements regarding a mean reverting time series is least accurate?
Explanation
If the current value of the time series is above the mean reverting level, the prediction is
that the time series will decrease; if the current value of the time series is below the mean
reverting level, the prediction is that the time series will increase.
Alexis Popov, CFA, is analyzing monthly data. Popov has estimated the model xt = b0 + b1 ×
xt-1 + b2 × xt-2 + et. The researcher finds that the residuals have a significant ARCH process.
The best solution to this is to:
Explanation
If the residuals have an ARCH process, then the correct remedy is generalized least
squares which will allow Popov to better interpret the results.
Explanation
The root mean squared error (RMSE) criterion is used to compare the accuracy of
autoregressive models in forecasting out-of-sample values. To determine which model will
more accurately forecast future values, we calculate the square root of the mean squared
error. The model with the smallest RMSE is the preferred model. The RMSE for Model 1 is
√10.429 = 3.23, while the RMSE for Model 2 is √11.642 = 3.41. Since Model 1 has the lowest
RMSE, that is the one Zox should conclude is the most accurate.
Suppose that the following time-series model is found to have a unit root:
Explanation
Estimation with first differences requires calculating the change in the variable from
period to period.
Rhonda Wilson, CFA, is analyzing sales data for the TUV Corp, a current equity holding in her
portfolio. She observes that sales for TUV Corp. have grown at a steadily increasing rate over
the past ten years due to the successful introduction of some new products. Wilson
anticipates that TUV will continue this pattern of success. Which of the following models is
most appropriate in her analysis of sales for TUV Corp?
A log-linear trend model, because the data series can be graphed using a
A)
straight, upward-sloping line.
A linear trend model, because the data series is equally distributed above and
B)
below the line and the mean is constant.
A log-linear trend model, because the data series exhibits a predictable,
C)
exponential growth trend.
Explanation
The log-linear trend model is the preferred method for a data series that exhibits a trend
or for which the residuals are predictable. In this example, sales grew at an exponential, or
increasing rate, rather than a steady rate.
Vikas Rathod, an enrolled candidate for the CFA Level II examination, has decided to perform
a calendar test to examine whether there is any abnormal return associated with
investments and disinvestments made in blue-chip stocks on particular days of the week. As
a proxy for blue-chips, he has decided to use the S&P 500 index. The analysis will involve the
use of dummy variables and is based on the past 780 trading days. Here are selected
findings of his study:
RSS 0.0039
SSE 0.9534
SST 0.9573
R-squared 0.004
SEE 0.035
Jessica Jones, CFA, a friend of Rathod, overhears that he is interested in regression analysis
and warns him that whenever heteroskedasticity is present in multiple regression this could
undermine the regression results. She mentions that one easy way to spot conditional
heteroskedasticity is through a scatter plot, but she adds that there is a more formal test.
Unfortunately, she can't quite remember its name. Jessica believes that heteroskedasticity
can be rectified using White-corrected standard errors. Her son Jonathan who has also taken
part in the discussion, hears this comment and argues that White correction would typically
reduce the number of Type I errors in financial data.
A) Four.
B) Six.
C) Five.
Explanation
There are 5 trading days in a week, but we should use (n − 1) or 4 dummies in order to
ensure no violations of regression analysis occur.
Explanation
The omitted variable is represented by the intercept. So, if we have four variables to
represent Monday through Thursday, the intercept would represent returns on Friday.
Explanation
Explanation
Are Jessica and her son Jonathan, correct in terms of the method used to correct for
heteroskedasticity and the likely effects?
A) Neither is correct.
B) Both are correct.
C) One is correct.
Explanation
Jessica is correct. White-corrected standard errors are also known as robust standard
errors. Jonathan is correct because White-corrected errors are higher than the biased
errors leading to lower computed t-statistics and therefore less frequent rejection of the
Null Hypothesis (remember incorrectly rejecting a true Null is Type I error).
Assuming the a1 term of an ARCH(1) model is significant, the following can be forecast:
Explanation
A Model is ARCH(1) if the coefficient a1 is significant. It will allow for the estimation of the
variance of the error term.
Which of the following statements regarding time series analysis is least accurate?
Explanation
Explanation
Forecasts in autoregressive models are made using the chain-rule, such that the earlier
forecasts are made first. Each later forecast depends on these earlier forecasts.
The regression results from fitting an AR(1) to a monthly time series are presented below.
What is the mean-reverting level for the model?
A) 0.6151.
B) 7.3220.
C) 1.6258.
Explanation
Consider the estimated model xt = −6.0 + 1.1 xt − 1 + 0.3 xt − 2 + εt that is estimated over 50
periods. The value of the time series for the 49th observation is 20 and the value of the time
series for the 50th observation is 22. What is the forecast for the 52nd observation?
A) 42.
B) 24.2.
C) 27.22.
Explanation
Alexis Popov, CFA, has estimated the following specification: xt = b0 + b1 × xt-1 + et. Which of
the following would most likely lead Popov to want to change the model's specification?
B) b0 < 0.
Explanation
If correlation(et, et-2) is not zero, then the model suffers from 2nd order serial correlation.
Popov may wish to try an AR(2) model. Both of the other conditions are acceptable in an
AR(1) model.
Bert Smithers, CFA, is a sell-side analyst who has been asked to look at the luxury car sector.
He has hypothesized that sales of luxury cars have grown at a constant rate over the past 15
years.
Exhibit 1
b0 0.4563
b1 0.6874
R-squared 0.7548
Durbin-Watson 1.23
F 12.63
Observations 15
If his assumption about a constant is correct, which of the following models is most
appropriate for modeling these data?
Explanation
Whenever the rate of change is constant over time, the appropriate model is a log- linear
trend model. A is a linear trend model and C is an autoregressive model.
A) 1.66.
B) 1.26.
C) 1.46.
Explanation
Bert is aware that the Dickey Fuller test can be used to discover whether a model has a unit
root. He is also aware that the test would use a revised set of critical t-values. What would it
mean to Bert to reject the null of the Dickey Fuller test (Ho: g = 0)?
A) There is a unit root and the model cannot be used in its current form.
B) There is a unit root but the model can be used if covariance-stationary.
C) There is no unit root.
Explanation
The null hypothesis of g = 0 actually means that b1 – 1 = 0, this will be the case if b1 = 1.
Since we have rejected the null, we can conclude that the model has no unit root.
Bert would also like to test for serial correlation in his AR(1) model. How could this be done?
Explanation
To test for serial correlation in an AR model, test for the significance of residual
autocorrelations over different lags. The goal is for all t-statistics to lack statistical
significance. A is only used for trend models and C is one of the requirements of
covariance stationarity.
When using the root mean squared error (RMSE) criterion to evaluate the predictive power
of the model, which of the following is the most appropriate statement?
A) Use the model with the lowest RMSE calculated using the in-sample data.
B) Use the model with the lowest RMSE calculated using the out-of-sample data.
C) Use the model with the highest RMSE calculated using the in-sample data.
Explanation
RMSE, or root of the mean squared error, is a measure similar to the SEE from multiple
regression. The lower, the better. It should be calculated on the out-of-sample data (i.e.,
the data not directly used in the development of the model) as this will be a better test of
the relevance and predictive power of the model going forward. This measure thus
indicates the predictive power of our model.
Bert would like to use his AR(1) model to forecast future sales of luxury automobiles. What is
the annualized growth rate between today and 20X3?
A) 11%.
B) 12%.
C) 10%.
Explanation
To get the 20X2 value, plug today's value of 1.05 into the model:
The annualized return between 20X1 and 20X3 is, therefore, (1.27 / 1.05)0.5 – 1 = 9.87%.
Explanation
Covariance stationarity requires that the expected value and the variance of the time
series be constant over time.
1 + 0.1×xt– 4 + et. Given the data in the table below, what is Dillard's best estimate of the first
quarter of 2007?
Time Value
2005: I 62
2005: II 62
2005: III 66
2005: IV 66
2006: I 72
2006: II 70
2006: III 64
2006: IV 66
A) 66.60.
B) 66.40.
C) 67.20.
Explanation
To get the answer, Dillard will use the data for 2006: IV and 2006: I, xt– 1 = 66 and xt– 4 = 72
respectively:
E[x2007:I] = 67.20
Which of the following statements regarding unit roots in a time series is least accurate?
Explanation
The presence of a unit root means that the least squares regression procedure that we
have been using to estimate an AR(1) model cannot be used without transforming the data
first.
A time series with a unit root will follow a random walk process. Since a time series that
follows a random walk is not covariance stationary, modeling such a time series in an AR
model can lead to incorrect statistical conclusions, and decisions made on the basis of
these conclusions may be wrong. Unit roots are most likely to occur in time series that
trend over time or have a seasonal element.