Exam 2
Exam 2
Elaborate two advantages of using probit or logit over linear regression models for binary
outcomes. (5p)
Two key advantages of using probit or logit models over linear regression models for binary
outcomes are:
1. Bounded Predictions:
Linear regression can produce predicted probabilities outside the [0, 1] range, which is
not appropriate for binary outcomes. For example, a linear regression model might
predict probabilities less than 0 or greater than 1, which doesn't make sense in the
context of binary outcomes (e.g., the probability of an event occurring or not). In
contrast, both the logit and probit models transform the output through a link function
(logistic or normal cumulative distribution function, respectively), ensuring that the
predicted probabilities are always between 0 and 1, making them more suitable for
binary data.
In linear regression, R2R^2R2 represents the proportion of the variance in the dependent
variable that is explained by the independent variables. It ranges from 0 to 1, with a value of 1
indicating that the model explains all of the variability in the dependent variable and a value of 0
meaning that the model explains none. This interpretation works well because linear regression
assumes a continuous outcome and measures how well the model fits the data.
For binary outcomes, such as those modeled with logit or probit regression, the pseudo
R2R^2R2 is an approximation of model fit but does not have the same interpretation as the
standard R2R^2R2. The pseudo R2R^2R2 is often based on different calculations that involve
likelihood functions rather than variance explained. Common pseudo R2R^2R2 measures
include:
• Cox and Snell's R2R^2R2, Nagelkerke's R2R^2R2, etc.: These are also variations that
aim to measure the goodness of fit, but like McFadden's, they do not have the same
straightforward interpretation as the standard R2R^2R2.
• Pseudo R2R^2R2 values are usually smaller and don't offer the same percentage-based
interpretation (e.g., you can't say a pseudo R2R^2R2 of 0.2 explains 20% of the
"variance" in a binary outcome).
Thus, the interpretation of pseudo R2R^2R2 is more about model fit in terms of likelihood rather
than variance explanation, which is why it should not be directly compared to standard
R2R^2R2.
a) Elaborate two advantages of using panel data compared to cross-section data or time-series
data. Explain the advantages with an empirical research example. (8p)
Panel data has several advantages over cross-section and time-series data. Two key advantages
are:
Panel data allows researchers to analyze changes within entities over time, making it easier to
study causality and dynamic relationships. It provides the opportunity to assess lagged effects,
path dependency, and state dependence, which are difficult to capture with pure cross-
sectional or time-series data.
b) Write the linear squares dummy variable (LSDV) regression for an individual fixed effect
model. (4p)
c) Describe the purpose of the Hausman test in panel data analysis (2p) What is its null
hypothesis? (2p) Explain verbally how the test helps in choosing between different panel data
models. (4p) (You do not need to write the test statistics of the Hausman test)
3. Consider a financial literacy program on savings behavior across different regions. The
program is implemented in some regions but not others, and within these regions, it targets
households with children. (20 points in total)
a) Design a difference-in-differences (DD) strategy for evaluating the impact of the financial
literacy program on savings behavior. (Note: To get full points, you need to specify the DD
estimator either based on conditional expectations of the outcome variable or in a regression.)
(4p)
To evaluate the impact of the financial literacy program on savings behavior, we can use a
difference-in-differences (DD) approach, which compares changes in savings behavior over
time between treated and untreated groups.
• Treated group: Households with children in regions where the financial literacy
program was implemented.
• Control group: Households with children in regions where the program was not
implemented.
• The key assumption is that both groups would have followed a similar savings trend in
the absence of the program (parallel trends assumption).
2. Time Periods
For the DiD estimator to provide valid causal inferences, two key assumptions must hold:
The most critical assumption in DiD is that, in the absence of the financial literacy program,
savings behavior in treated and control groups would have followed similar trends over
time. This means that any observed difference in savings between the groups after the program
must be due to the program itself and not pre-existing differences in savings trajectories.
• The assumption requires that, before the program was introduced, savings trends
among households with children in treated regions were evolving similarly to those in
control regions.
• If treated regions had faster-growing savings rates before the program (e.g., due to better
financial infrastructure), then the estimated impact of the program could be overstated.
The DiD method assumes that no other events or policies differentially affected the treated
group compared to the control group during the study period. If external shocks impact only
the treated group, the estimated program effect may be biased.
• Suppose the government introduced a separate tax incentive for savings that only
applies to households in treated regions. This policy, rather than the financial literacy
program, could be responsible for increased savings.
• To address this, researchers should control for major regional economic shocks or
conduct robustness checks, such as using additional control groups.
Conclusion:
If these assumptions hold, the DiD estimator will capture the true causal effect of the
financial literacy program on savings behavior. However, violations of these assumptions can
lead to biased estimates, and researchers must check them carefully through placebo tests,
pre-trend analysis, and robustness checks.
c) Design a difference-in-differences-in-differences (DDD) strategy for this problem. (Note: To
get full points, you need to specify the DD estimator either based on conditional expectations of
the outcome variable or in a regression. (8p)
• Treated group: Households with children in regions where the program was
implemented.
• Control group 1 (DiD control): Households with children in regions without the
program.
• Control group 2 (DDD control): Households without children in both treated and
control regions.
This additional comparison group helps account for other regional factors that might
influence savings behavior, such as regional economic shocks or policy changes affecting all
households in treated regions.
4. Why Use DDD Instead of DiD?
• The DDD approach accounts for regional shocks that might impact savings in both
treated and control regions.
• It also helps control for differences between households with and without children,
isolating the program’s effect more cleanly.
• If a general savings trend affected both households with and without children, DDD
would remove this bias by comparing changes in savings across both groups and
regions.
Conclusion:
d) What bias can be avoided by the DDD strategy but not by the DD strategy? (2p)
Bias Avoided by the DDD Strategy but Not by the DD Strategy (2p)
The DDD strategy helps eliminate biases arising from time-varying, region-specific shocks
that could affect both treated and control regions differently in a DD design. Specifically, DDD
accounts for:
o The DDD strategy removes this bias by comparing savings trends among
households with and without children across both treated and control
regions.
Conclusion:
The DDD strategy eliminates biases from confounding time-varying shocks specific to
treated regions and differential trends in household savings behavior that could distort the
true effect of the financial literacy program.
b) What is the null hypothesis implied by the restricted model? (2p) How to test this null
hypothesis? (2p)
C)How to estimate the risk premium for each factor? (4p)
d)Is the risk premium estimated in c) equivalent to the risk premium estimated from the cross-
sectional regression approach? Why? (4p)
Great question! The risk premium estimated in the approach I outlined earlier (based on time-
series regressions for each asset) is not exactly equivalent to the risk premium estimated using
a cross-sectional regression approach. Here’s why:
Time-Series Approach:
In the time-series approach, you're estimating the relationship between the returns of each
asset and the risk factors over time. The process involves:
1. Regression of Asset Returns on Risk Factors: For each asset, you regress its excess
return (return minus the risk-free rate) on the excess returns of the risk factors.
2. Risk Premiums: The risk premiums for the factors are typically estimated as the
average excess returns of the factors over the period you are analyzing.
In this method, you are focusing on the historical time-series data to estimate how much the
factors explain the variation in the asset returns over time.
Key Differences:
o In the time-series approach, the risk premium for each factor is an average
excess return over time, considering how the factor returns evolve.
o In the cross-sectional approach, the risk premium represents how much return
is required by investors for taking on exposure to a factor at a given point in
time.
Equivalence?
The risk premiums estimated in both approaches could be similar if factor returns are stable
over time and if the factor exposures (betas) of the assets are consistent across the sample
period. However, in practice, the two methods can yield different results:
• The cross-sectional regression approach gives a snapshot of the factor prices (risk
premiums) at a single point in time.
• The time-series regression approach gives the historical average of the risk premiums
over the time period considered.
Thus, while both approaches estimate the compensation investors demand for exposure to risk
factors, the time-series method focuses on how the returns of factors correlate with asset
returns over time, whereas the cross-sectional method focuses on the factor's pricing at a
given point in time across assets.
In summary, the risk premium from the time-series regression approach is not identical to that
from the cross-sectional regression approach, because they are estimating risk premiums in
different contexts (historical average vs. cross-sectional pricing). However, under certain
conditions (such as stable factor returns and exposures), the two approaches could yield
similar estimates.
b) Write the formula for GJR-GARCH(1,1). (5p) Which stylized feature of financial time-series
data can be captured by GJR-GARCH in addition to those by GARCH? (3p) How is this feature
parametrized in GJR-GARCH? (3p)