Econometrics Cheat Sheet
Econometrics Cheat Sheet
Experimental Data from randomized experiment Cov (y, x) 5. V ar (u|x1 , . . . , xk ) = V ar (u) = σ 2 (Homoskedasticity)
β̂1 = V
The way we answer yes/no questions about our population Level-log y log(x) ∆y ≈ (β1 /100) [1%∆x] Standard error: V ar
using a sample of data. e.g. “Does increasing public school Log-level log(y) x %∆y ≈ (100β1 ) ∆x s
spending increase student achievement?” Log-log log(y) log(x) %∆y ≈ β1 %∆x σ̂ 2
se(β̂j ) = , j = 1, . . . , k
null hypothesis (H0 ) Typically, H0 : θ = 0 Quadratic y x + x2 ∆y = (β1 + 2β2 x) ∆x SSTj (1 − Rj2 )
alt. hypothesis (Ha ) Typically, H0 : θ 6= 0 Note: DV = dependent variable; RHS = right hand side
significance level (α) Tolerance for making Type I error; Classical Linear Model (CLM)
(e.g. 10%, 5%, or 1%) Multiple Regression Model Add a 6th assumption to Gauss-Markov:
test statistic (T ) Some function of the sample of data Multiple regression is more useful than simple regression
critical value (c) Value of T such that reject H0 if |T | > c;
because we can more plausibly estimate ceteris paribus 6. u is distributed N 0, σ 2
c depends on α; relationships (i.e. E (u|x) = E (u) is more plausible)
c depends on if 1- or 2-sided test Need this to know what the exact distribution of β̂j is
p-value Largest α at which fail to reject H0 ; y = β0 + β1 x 1 + · · · + βk x k + u
reject H0 if p < α β̂1 , . . . , β̂k : partial effect of each of the x’s on y • If A(6) fails, need asymptotics to test β’s
Simple Regression Model • Then, interpret distr. of β̂j as asymptotic (not exact)
Regression is useful because we can estimate a ceteris paribus β̂0 = y − β̂1 x1 − · · · − β̂k xk
relationship between some variable x and our outcome y V
How to resolve E (u|x) 6= 0 • Can also use with more than 2 time periods Binary dependent variables
How can we get unbiased β̂j ’s when E (u|x) 6= 0? • δ̂1 has same interpretation, different math formula Three options for estimation when y is binary (0/1):
Can get causal effects from pooled cross sectional data s where φ (·) is the standard normal pdf
2
σu
A nat. experiment divides units into treatment, control groups θ =1− 2
σu + T σa2 Interpreting logit/probit parameter estimates
β̂RE → β̂F E as θ → 1 • β’s that come from logit/probit 6= β’s from LPM
yit = β0 + δ0 d2t + β1 dTit + δ1 d2t × dTit + uit
β̂RE → β̂P OLS as θ → 0 • But, sign is same
where
• In logit/probit, we have
• d2t = dummy for being in time period 2
RE assumes E (a|x) = 0 ∂p (x)
• dTit = dummy for being in the treatment group = βj g (xβ)
∂xj
• δ̂1 = difference in differences Cluster-robust SEs
• Serial correlation of νit is a problem where g (xβ) is the first derivative of G (xβ)
δ̂1 = y treat,2 − y control,2 − y treat,1 − y control,1
• Use cluster-robust SEs ∂p(x)
• In LPM, we have ∂xj
= βj
Extensions: • These correct for serial corr. and heterosk.
• Can also include x’s in the model • Cluster at the unit level Layout by Winston Chang, http://wch.github.io/latexsheet/