0% found this document useful (0 votes)

295 views23 pages

RDD Stata 1

This document discusses regression discontinuity designs (RDD) and methods for estimating treatment effects using RDD. It begins with an overview of RDD, noting that RDD can be viewed as a local randomized experiment. It then discusses sharp RDD notation and estimation of the treatment effect at the cutoff. Several Stata packages for implementing RDD are described, including methods for bandwidth selection, density tests, and randomization inference. The document provides details on local polynomial estimation approaches, bandwidth selection methods, and inference procedures for RDD. It emphasizes graphical analysis and falsification checks when using RDD.

Uploaded by

Denis Lima E Alves

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

295 views23 pages

RDD Stata 1

Uploaded by

Denis Lima E Alves

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Regression Discontinuity Designs in Stata

Matias D. Cattaneo
University of Michigan

July 30, 2015

Overview

Main goal: learn about treatment eect of policy or intervention.

If treatment randomization available, easy to estimate treatment eects.

If treatment randomization not available, turn to observational studies.

I Instrum ental variables.

I Selection on observables.

Regression discontinuity (RD) designs.

I Sim ple and ob jective. Requires little inform ation, if design available.
I M ight b e viewed as a lo cal random ized trial.
I Easy to falsify, easy to interpret.
I C areful: very local!
Overview of RD packages
https://sites.google.com/site/rdpackages

rdrobust package: estimation, inference and graphical presentation using local

polynomials, partitioning, and spacings estimators.
I rdrobust: RD inference (p oint estim ation and CI; classic, bias-corrected, robust).
I rdbwselect: bandwidth or window selection (IK, CV, CCT).
I rdplot: plots data (with optim al blo ck length).

rddensity package: discontinuity in density test at cuto (a.k.a. manipulation testing)

using novel local polynomial density estimator.
I rddensity: m anipulation testing using lo cal p olynom ial density estim ation.
I rdbwdensity: bandwidth or window selection.

rdlocrand package: covariate balance, binomial tests, randomization inference

methods (window selection & inference).
I rdrandinf: inference using random ization inference m etho ds.
I rdwinselect: falsication testing and window selection.
I rdsensitivity: treatm ent eect m o dels over grid of windows, CI inversion.
I rdrbounds: Rosenbaum b ounds.
Randomized Control Trials

Notation: (Yi (0); Yi (1); Xi ), i = 1; 2; : : : ; n.

Treatment: Ti 2 f0; 1g, Ti independent of (Yi (0); Yi (1); Xi ).

Data: (Yi ; Ti ; Xi ), i = 1; 2; : : : ; n, with

(
Yi (0) if Ti = 0
Yi =
Yi (1) if Ti = 1

Average Treatment Eect:

ATE = E[Yi (1) Yi (0)] = E[Yi jT = 1] E[Yi jT = 0]

Experimental Design.
Sharp RD design

Notation: (Yi (0); Yi (1); Xi ), i = 1; 2; : : : ; n, Xi continuous

Treatment: Ti 2 f0; 1g, Ti = 1(Xi x).

Data: (Yi ; Ti ; Xi ), i = 1; 2; : : : ; n, with

(
Yi (0) if Ti = 0
Yi =
Yi (1) if Ti = 1

Average Treatment Eect at the cuto:

SRD = E[Yi (1) Yi (0)jXi = x] = lim E[Yi jXi = x] lim E[Yi jXi = x]
x#x x"x

Quasi-Experimental Design: local randomization (more later)

3
2
Outcome variable (Y)

0
1
0
1
2

0.6 0.4 0.2 0.0 0.2 0.4 0.6

Assignment variable (R)

3
2
Outcome variable (Y)

0
1
0
1
2

0.6 0.4 0.2 0.0 0.2 0.4 0.6

Assignment variable (R)

3
2
Outcome variable (Y)

0
1
0
1
2

0.6 0.4 0.2 0.0 0.2 0.4 0.6

Assignment variable (R)

6
4
2
Outcome variable (Y)

0
0
2

Local Random Assignment

4
6

0.6 0.4 0.2 0.0 0.2 0.4 0.6

Assignment variable (R)

Empirical Illustration: Cattaneo, Frandsen & Titiunik (2015, JCI)

Problem: incumbency advantage (U.S. senate).

Data:
Yi = election outcome.
Ti = whether incumbent.
Xi = vote share previous election (x = 0).
Zi = covariates (demvoteshlag1, demvoteshlag2, dopen, etc.).

Potential outcomes:
Yi (0) = election outcome if had not been incumbent.
Yi (1) = election outcome if had been incumbent.

Causal Inference:
Yi (0) 6= Yi jTi = 0 and Yi (1) 6= Yi jTi = 1
Graphical and Falsication Methods

Always plot data: main advantage of RD designs!

Plot regression functions to assess treatment eect and validity.

Plot density of Xi for assessing validity; test for continuity at cuto and elsewhere.

Important: use also estimators that do not smooth-out data.

RD Plots (Calonico, Cattaneo & Titiunik, JASA):

I Two ingredients: (i) Sm o othed global p olynom ial t & (ii) binned discontinuous
lo cal-m eans t.
I Two goals: (i) detention of discontinuities, & (ii) representation of variability.
I Two tuning param eters:

F G lo b a l p o ly n o m ia l d e g re e (kn ).
F L o c a tio n (E S o r Q S ) a n d nu m b e r o f b in s (Jn ).
Manipulation Tests & Covariate Balance and Placebo Tests

Density tests near cuto:

I Idea: distribution of running variable should b e sim ilar at either side of cuto.

I M etho d 1: Histogram s & Binom ial count test.

I M etho d 2: Density Estim ator at b oundary.

F P re -b in n e d lo c a l p o ly n o m ia l m e th o d M c C ra ry (2 0 0 8 ).
F N e w tu n in g -p a ra m e te r-fre e m e th o d C a tta n e o , J a n sso n a n d M a (2 0 1 5 ).

Placebo tests on pre-determined/exogenous covariates.

I Idea: zero RD treatm ent eect for pre-determ ined/exogenous covariates.

I M ethods: global p olynom ial, lo cal p olynom ial, random ization-based.

Placebo tests on outcomes.

I Idea: zero RD treatm ent eect for outcom e at values other than cuto.
I M ethods: global p olynom ial, lo cal p olynom ial, random ization-based.
Estimation and Inference Methods

Global polynomial approach (not recommended).

Robust local polynomial inference methods.

I Bandwidth selection.

I Bias-correction.

I Condence intervals.

Local randomization and randomization inference methods.

I W indow selection.

I Estim ation and Inference m etho ds.

I Falsication, sensitivity and related m etho ds

Conventional Local-polynomial Approach

Idea: approximate regression functions for control and treatment units locally.

Local-linear estimator (w/ weights K( )):

hn Xi < x : x Xi hn :

Yi = + (Xi x) +" ;i Yi = + + (Xi x) + + "+;i

I Treatm ent eect (at the cuto): ^ SRD = ^ + ^

Can be estimated using linear models (w/ weights K( )):

Yi = + SRD Ti + (Xi x) 1 + Ti (Xi x) 1 + "i , hn Xi hn

Once hn chosen, inference is standard: weighted linear models.

I Details com ing up next.
Conventional Local-polynomial Approach
How to choose hn ?

Imbens & Kalyanaraman (2012, ReStud): optimal plug-in,

^ IK = C
h ^IK n 1=5

Calonico, Cattaneo & Titiunik (2014, ECMA): renement of IK

^ CCT = C
h ^CCT n 1=5

Ludwig & Miller (2007, QJE): cross-validation,

n
X
^ CV = arg min
h w(Xi ) (Yi ^ 1 (Xi ; h))2
h
i=1

Key idea: trade-o bias and variance of ^SRD (hn ). Heuristically:

" Bias(^SRD ) =) ^
#h and " Var(^SRD ) =) ^
"h
Local-Polynomial Methods: Bandwidth Selection
Two main methods: plug-in & cross-validation. Both MSE-optimal in some sense.

Imbens & Kalyanaraman (2012, ReStud): propose MSE-optimal rule,

1=5 1=5 Var(^SRD )

hMSE = CMSE n CMSE = C(K)
Bias(^SRD )2

I IK im plem entation: rst-generation plug-in rule.

I CCT im plem entation: second-generation plug-in rule.

I They dier in the way Var(^ SRD ) and Bias(^ SRD ) are estim ated.

Imbens & Kalyanaraman (2012, ReStud): discuss cross-validation approach,

n
X
^ CV = arg min CV (h) ,
h CV (h) = 1(X ;[ ] Xi X+;[ ] ) (Yi ^ (Xi ; h))2 ,
h>0
i=1

where
I ^ +;p (x; h) and ^ are lo cal p olynom ials estim ates.
;p (x; h)

I 2 (0; 1), X and X+;[ denote -th quantile of fXi : Xi < xg and fXi : Xi xg.
;[ ] ]

I Our im plem entation uses = 0:5; but this is a tuning param eter!
Conventional Approach to RD

Local-linear estimator (w/ weights K( )):

hn Xi < x : x Xi hn :

Yi = + (Xi x) +" ;i Yi = + + (Xi x) + + "+;i

I Treatm ent eect (at the cuto): ^ SRD = ^ + ^

Construct usual t-test. For H0 : SRD = 0,

^SRD ^+ ^
T^(hn ) = p = q d N (0; 1)
V^n ^ ^
V+;n + V ;n

95% Condence interval:

q
^ n) =
I(h ^SRD 1:96 V^n
Bias-Correction Approach to RD

Note well: for usual t-test,

^SRD
T^(hMSE ) = p d N (B; 1) 6= N (0; 1), B>0
V^n

I Bias B in RD estim ator captures curvature of regression functions.

^ n = 0:5 h
Undersmoothing/Small Bias Approach: Choose smaller hn ... Perhaps h ^ IK ?

=) Not clear guidance & power loss!

Bias-correction Approach:

^SRD B^n
T^bc (hn ; bn ) = p d N (0; 1)
^n
V
h p i
=) 95% Condence Interval: I^bc (hn ; bn ) = ^SRD ^n
B 1:96 ^n
V

How to choose bn ? Same ideas as before... ^bn = C

^ n 1=7
Robust Bias-Correction Approach to RD
Recall:
^ SRD ^ SRD ^n
B
bc
T^ (hn ) = p d N (0; 1) and T^ (hn ; bn ) = p d N (0; 1)
V^n ^n
V

I ^ n is constructed to estim ate leading bias B.

Robust approach:

^SRD B^n ^SRD Bn Bn B^n

T^bc (hn ; bn ) = p = p + p
^n
V ^n
V V^n
| {z } | {z }
d N (0;1) d N (0; )

Robust bias-corrected t-test:

^SRD B ^n ^SRD B ^n
T^rbc (hn ; bn ) = p = q d N (0; 1)
^n + W
V ^n ^
Vnbc

=) 95% Condence Interval:

q
I^rbc (hn ; bn ) = ^SRD ^n
B 1:96 ^n
V bc , ^n
V bc ^n + W
=V ^n
Local-Polynomial Methods: Robust Inference

Approach 1: Undersmoothing/Small Bias.

q
^ n) =
I(h ^SRD 1:96 V^n

Approach 2: Bias correction (not recommended).

q
I^bc (hn ; bn ) = ^SRD ^n
B 1:96 V^n

Approach 3: Robust Bias correction.

q
I^rbc (hn ; bn ) = ^SRD ^n
B 1:96 ^n + W
V ^n
Local-randomization approach and nite-sample inference

Popular approach: local-polynomial methods.

I Approxim ates regression function and relies on continuity assum ptions.

I Requires: choosing weights, bandwidth and p olynom ial order.

Alternative approach: local-randomization + randomization-inference

I Gives an alternative that can b e used as a robustness check.

I K ey assum ption: exists window W = [ hn ; hn ] around cuto ( hn < x < hn ) where

Ti indep endent of (Yi (0); Yi (1)) (for all Xi 2 W )

I In words: treatm ent is random ly assigned within W .

I Go o d news: if plausible, then RCT ideas/m etho ds apply.
I Not-so-good news: m ost plausible for very sm all windows (very few observations).
I One solution: em ploy sm all window but use random ization-inference m etho ds.
I Requires: choosing random ization rule, window and statistic.
Local-randomization approach and nite-sample inference

Recall key assumption: exists W = [ hn ; hn ] around cuto ( hn < x < hn ) where

Ti independent of (Yi (0); Yi (1)) (for all Xi 2 W )

How to choose window?

I Use balance tests on pre-determ ined/exogenous covariates.

I Very intuitive, easy to im plem ent.

How to conduct inference? Use randomization-inference methods.

1 Cho ose statistic of interest. E.g., t-stat for dierence-in-m eans.

2 Cho ose random ization rule. E.g., numb er of treatm ents and controls given.
3 Com pute nite-sam ple distribution of statistics by p ermuting treatm ent assignm ents.
Local-randomization approach and nite-sample inference

Do not forget to validate & falsify the empirical strategy.

1 Plot data to m ake sure lo cal-random ization is plausible.

2 Conduct placeb o tests.

(e.g., use pre-intervention outcom es or other covariates not used select W )

3 Do sensitivity analysis.

See Cattaneo, Frandsen and Titiunik (2015) for introduction.

See Cattaneo, Titiunik and Vazquez-Bare (2015) for further results and
implementation.

CausalML Book 2022
No ratings yet
CausalML Book 2022
500 pages
Infantry Combat, The Rifle Platoon - by John F Antal
100% (1)
Infantry Combat, The Rifle Platoon - by John F Antal
388 pages
Book of Rumi: 105 Stories and Fables That Illumine, Delight, and Inform Rumi Download
100% (1)
Book of Rumi: 105 Stories and Fables That Illumine, Delight, and Inform Rumi Download
115 pages
2025 - Applied Causal Inference Powered by ML and AI
No ratings yet
2025 - Applied Causal Inference Powered by ML and AI
518 pages
CausalML Book
No ratings yet
CausalML Book
496 pages
Sophia Rabe-Hesketh, Anders Skrondal - Multilevel and Longitudinal Modeling Using Stata. 2 Vols.-Stata Press (2012)
100% (2)
Sophia Rabe-Hesketh, Anders Skrondal - Multilevel and Longitudinal Modeling Using Stata. 2 Vols.-Stata Press (2012)
1,030 pages
CH 03 Wooldridge 5e PPT PDF
100% (3)
CH 03 Wooldridge 5e PPT PDF
35 pages
FroelichaSperlich Book
No ratings yet
FroelichaSperlich Book
365 pages
Past 3 Manual
No ratings yet
Past 3 Manual
225 pages
LectureNotes 480
No ratings yet
LectureNotes 480
192 pages
Practical Research 2 Module 8
0% (1)
Practical Research 2 Module 8
12 pages
533653
100% (1)
533653
20 pages
Practical Research 2: Quarter 4 - Module 2 Quantitative Data-Collection Techniques
100% (2)
Practical Research 2: Quarter 4 - Module 2 Quantitative Data-Collection Techniques
3 pages
SSRN 4487202
No ratings yet
SSRN 4487202
380 pages
MBA Programme Guide
No ratings yet
MBA Programme Guide
148 pages
ARDL Model - Hossain Academy Note PDF
100% (1)
ARDL Model - Hossain Academy Note PDF
5 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Class 12 Statistics
No ratings yet
Class 12 Statistics
11 pages
Causal Inference - A Statistical Learning Approach
No ratings yet
Causal Inference - A Statistical Learning Approach
247 pages
Topic 5 Regression Discontinuity
No ratings yet
Topic 5 Regression Discontinuity
69 pages
InstrumentalVars Kolesar Gsas - Harvard 0084L 10796
No ratings yet
InstrumentalVars Kolesar Gsas - Harvard 0084L 10796
162 pages
Chapter4 Intro To Regression
No ratings yet
Chapter4 Intro To Regression
129 pages
A Practical Introduction To Regression Discontinui
No ratings yet
A Practical Introduction To Regression Discontinui
158 pages
A Practical Introduction To Regression Discontinuity Designs
No ratings yet
A Practical Introduction To Regression Discontinuity Designs
165 pages
Slides 2014 Panel Data
No ratings yet
Slides 2014 Panel Data
67 pages
Chas A Low Course Notes
No ratings yet
Chas A Low Course Notes
146 pages
Machinistas Meet Randomistas: Useful ML Tools For Empirical Researchers Esther Duflo
No ratings yet
Machinistas Meet Randomistas: Useful ML Tools For Empirical Researchers Esther Duflo
71 pages
Regression Discontinuity Designs: Matias D. Cattaneo Roc Io Titiunik February 25, 2022
No ratings yet
Regression Discontinuity Designs: Matias D. Cattaneo Roc Io Titiunik February 25, 2022
50 pages
Slides 33 Ate Regdisc
No ratings yet
Slides 33 Ate Regdisc
73 pages
Robust Estimation Methods and Outlier Detection in Mediation Model
No ratings yet
Robust Estimation Methods and Outlier Detection in Mediation Model
25 pages
8822 LectureNotes
No ratings yet
8822 LectureNotes
74 pages
Lecture 3-RD Presentation
No ratings yet
Lecture 3-RD Presentation
33 pages
CIML2023
No ratings yet
CIML2023
87 pages
EconometricsII Exercises
100% (1)
EconometricsII Exercises
27 pages
SHS Topic For Demo
No ratings yet
SHS Topic For Demo
28 pages
Lecture-Quantitative Research Methods
No ratings yet
Lecture-Quantitative Research Methods
34 pages
Discussion+on+Multiple+Regression ShimengHuang
No ratings yet
Discussion+on+Multiple+Regression ShimengHuang
35 pages
Regression Discontinuity
No ratings yet
Regression Discontinuity
44 pages
Anova PPT Stats 511 For PG
No ratings yet
Anova PPT Stats 511 For PG
27 pages
MATM111 - Lesson 1 - Introduction To Statistics
No ratings yet
MATM111 - Lesson 1 - Introduction To Statistics
3 pages
s10 IV Handout
No ratings yet
s10 IV Handout
48 pages
EC501 Lecture 03
No ratings yet
EC501 Lecture 03
30 pages
Non Probability Sampling
No ratings yet
Non Probability Sampling
6 pages
Lesson 10: Discriminant Analysis: Example 1 - Swiss Bank Notes
No ratings yet
Lesson 10: Discriminant Analysis: Example 1 - Swiss Bank Notes
3 pages
Infant Mortality in India
No ratings yet
Infant Mortality in India
36 pages
9th Biology Chap2
No ratings yet
9th Biology Chap2
17 pages
Diseños de Regresión Discontinua Fundaciones
No ratings yet
Diseños de Regresión Discontinua Fundaciones
57 pages
Regression Discontinuity Designs Using Covariates
No ratings yet
Regression Discontinuity Designs Using Covariates
39 pages
Lecture Set 5
No ratings yet
Lecture Set 5
54 pages
2023 Socio-Economic Profile
No ratings yet
2023 Socio-Economic Profile
56 pages
A1 Regression
No ratings yet
A1 Regression
31 pages
RDD JM
No ratings yet
RDD JM
56 pages
Lecture 4
No ratings yet
Lecture 4
57 pages
CHNGPT Code R
No ratings yet
CHNGPT Code R
25 pages
Christian Morris 1 - Bossa Antigua Sheet Music
No ratings yet
Christian Morris 1 - Bossa Antigua Sheet Music
1 page
Chapter 5
No ratings yet
Chapter 5
45 pages
Confidence Interval Estimate
No ratings yet
Confidence Interval Estimate
4 pages
Grossman Hart Book Chapter
No ratings yet
Grossman Hart Book Chapter
35 pages
Optimal Bandwidth Choice For The Regression Discontinuity Estimator
No ratings yet
Optimal Bandwidth Choice For The Regression Discontinuity Estimator
27 pages
Regression Discontinuity
No ratings yet
Regression Discontinuity
60 pages
Optimal Bandwidth Choice For The Regression Discontinuity Estimator 2009
No ratings yet
Optimal Bandwidth Choice For The Regression Discontinuity Estimator 2009
27 pages
Lecture 4 - RDD
No ratings yet
Lecture 4 - RDD
48 pages
Randomization Inference in The Regression Discontinuity Design: An Application To Party Advantages in The U.S. Senate
No ratings yet
Randomization Inference in The Regression Discontinuity Design: An Application To Party Advantages in The U.S. Senate
24 pages
E2 RDD Extensions
No ratings yet
E2 RDD Extensions
34 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Panel Data Analysi
No ratings yet
Panel Data Analysi
27 pages
Meta Analysis of Skills An All Inclusive Manual: Sayed Huzaifa Mumit
No ratings yet
Meta Analysis of Skills An All Inclusive Manual: Sayed Huzaifa Mumit
23 pages
The Effect of Using The Team Quiz Method On Student Learning Outcomes in Mathematics Subjects
No ratings yet
The Effect of Using The Team Quiz Method On Student Learning Outcomes in Mathematics Subjects
5 pages
Hrs RDD Slides F
No ratings yet
Hrs RDD Slides F
40 pages
Analyzing The Statistical Error of Physical Chemistry Experimental Data
No ratings yet
Analyzing The Statistical Error of Physical Chemistry Experimental Data
7 pages
Detailed Lesson Plan (DLP) Format: Code
No ratings yet
Detailed Lesson Plan (DLP) Format: Code
6 pages
Module 5
No ratings yet
Module 5
24 pages
Rdrobust
No ratings yet
Rdrobust
18 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Regression Discntinue Paper PDF
No ratings yet
Regression Discntinue Paper PDF
21 pages
Stata Notes
No ratings yet
Stata Notes
3 pages
Package Rdrobust': R Topics Documented
No ratings yet
Package Rdrobust': R Topics Documented
18 pages
Practice Final
No ratings yet
Practice Final
15 pages
Lec-9 - Joint Moments and Joint Characteristic Functions of Functions of Two Random Variables
No ratings yet
Lec-9 - Joint Moments and Joint Characteristic Functions of Functions of Two Random Variables
20 pages
Data Analysis Test - 19175 (Sanjana Srinath)
No ratings yet
Data Analysis Test - 19175 (Sanjana Srinath)
10 pages
10 RD
No ratings yet
10 RD
16 pages
University of Northern Philippines: Office of The Laboratory Schools
No ratings yet
University of Northern Philippines: Office of The Laboratory Schools
9 pages
Midterm2021R1 Sol PDF
No ratings yet
Midterm2021R1 Sol PDF
13 pages
Statistics For Psychology
No ratings yet
Statistics For Psychology
9 pages
Problem Set 05 - Solutions (Odtuclass)
No ratings yet
Problem Set 05 - Solutions (Odtuclass)
10 pages
Types of Inferential Statistics
No ratings yet
Types of Inferential Statistics
2 pages
Exercise 5
No ratings yet
Exercise 5
2 pages
Formula Sheet
No ratings yet
Formula Sheet
8 pages
Chapter 2 Randomized Experiments Identification and Inference
No ratings yet
Chapter 2 Randomized Experiments Identification and Inference
7 pages
Cheatsheet
No ratings yet
Cheatsheet
4 pages
Aula Caps 8 9 10
No ratings yet
Aula Caps 8 9 10
5 pages
Vb V ε X = σ Vb = σ Vb = X'X Σx X'X: I X'X X'
No ratings yet
Vb V ε X = σ Vb = σ Vb = X'X Σx X'X: I X'X X'
9 pages
Endogeneity What It Is, and Potential Sources: Select Page
No ratings yet
Endogeneity What It Is, and Potential Sources: Select Page
2 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
MIT Microeconomics 14.32 Final Review
No ratings yet
MIT Microeconomics 14.32 Final Review
5 pages