Discussion Paper Series: Face Masks Considerably Reduce COVID-19 Cases in Germany: A Synthetic Control Method Approach
Discussion Paper Series: Face Masks Considerably Reduce COVID-19 Cases in Germany: A Synthetic Control Method Approach
Timo Mitze
Reinhold Kosfeld
Johannes Rode
Klaus Wälde
JUNE 2020
JUNE 2020
Any opinions expressed in this paper are those of the author(s) and not those of IZA. Research published in this series may
include views on policy, but IZA takes no institutional policy positions. The IZA research network is committed to the IZA
Guiding Principles of Research Integrity.
The IZA Institute of Labor Economics is an independent economic research institute that conducts research in labor economics
and offers evidence-based policy advice on labor market issues. Supported by the Deutsche Post Foundation, IZA runs the
world’s largest network of economists, whose research aims to provide answers to the global labor market challenges of our
time. Our key objective is to build bridges between academic research, policymakers and society.
IZA Discussion Papers often represent preliminary work and are circulated to encourage discussion. Citation of such a paper
should account for its provisional character. A revised version may be available directly from the author.
ISSN: 2365-9793
ABSTRACT
Face Masks Considerably Reduce
COVID-19 Cases in Germany:
A Synthetic Control Method Approach1
We use the synthetic control method to analyze the effect of face masks on the spread of
Covid-19 in Germany. Our identification approach exploits regional variation in the point
in time when face masks became compulsory. Depending on the region we analyse, we
find that face masks reduced the cumulative number of registered Covid-19 cases between
2.3% and 13% over a period of 10 days after they became compulsory. Assessing the
credibility of the various estimates, we conclude that face masks reduce the daily growth
rate of reported infections by around 40%.
Corresponding author:
Klaus Wälde
Gutenberg School of Management and Economics
Johannes Gutenberg Universität Mainz
Jakob-Welder-Weg 4
D-55131 Mainz
Germany
E-mail: [email protected]
1
Klaus Wälde has been acting as an IZA Visiting Research Fellow since March 2020.
2
This is similar to the setup in Abadie et al. (2010), who study the effect of an increase in the tobacco tax in
California. The tobacco tax was decided upon by the state government.
3
The main channel through which masks reduce transmission of SARS-CoV-2 is the reduction in aerosols and
droplets, as argued by Prather et al. (2020).
April 27
Saarland, Baden Württemberg, Rheinland-Palatine, Bavaria, Lower
April 22 Saxony, Brandenburg, Bremen, Hamburg, Hessia, Mecklenburg-
Saxony-Anhalt Western Pomerania, Northrhine-Westphalia, Berlin ( public transport)
April 20 April 24
Saxony Thuringia April 29
Schleswig-Holstein, Berlin (shopping malls)
Figure 1: The timing of mandatory mask wearing in federal states (top) and individual regions (below)
SCM identifies synthetic control groups for the treated unit(s) to build a counterfactual. In our
case, we need to find a group of regions that have followed the same Covid-19 trend as treated
units before mandatory masks in the latter. This control group would then most likely have had
the same behavior as treated unit(s) in the absence of the mask obligation. We can then use
this group to ‘synthesize’ the treated unit and conduct causal inference. The synthetic control
group is thereby constructed as an estimated weighted average of all regions in which masks
did not become compulsory earlier on. Historical realizations of the outcome variable and
several other predictor variables that are relevant in determining outcome levels allow us to
generate the associated weights, which result from minimizing a pre-treatment prediction
error function (see Abadie and Gardeazabal, 2003, Abadie et al., 2010 and Abadie, 2019 for
methodical details).
Data. We use the official German statistics on reported Covid-19 cases from the Robert Koch
Institute (RKI, 2020). The RKI collects the data from local health authorities and provides
updates on a daily basis. Using these data (available via API), we build a balanced panel for 401
NUTS Level 3 regions and 95 days spanning the period from January 28 to May 1, 2020 (38,095
observations). We use the cumulative number of registered Covid-19 cases in each district as
main outcome variable. 5 We estimate overall effects for this variable together with
disaggregated effects by age groups (persons aged 15-34 years, 35-59 years and 60+ years). As
an alternative outcome variable, we also use the cumulative incidence rate. Table 1 shows
summary statistics for both variables for our sample period.
Table 1 also presents our other predictor variables. We focus on factors that are likely to
describe the regional number and dynamics of reported Covid-19 cases. Obviously, past values
4
Friedson et al. (2020) employ the SCM to estimate the effect of the shelter-in-place order for California in the
development of Covid-19. The authors find inter alia that around 1600 deaths from Covid-19 were avoided by this
measure during the first four weeks.
5
We are aware of the existence of hidden infections. As it appears plausible to assume that they are proportional
to observed infections across regions, we do not believe that they affect our results. We chose the date of
reporting (as opposed to date of infections) because not all reported infections include information about the date
of infection.
Table 1: Summary Statistics of Covid-19 indicators (outcome variables) and predictors characterizing the
regional demographic structure and basic health care system
Mean S.D. Min. Max.
PANEL A: Data on registered Covid-19 cases
[1] Newly registered cases per day 4.13 10.66 0 310
[2] Cumulative number of cases 120.86 289.07 0 5795
[3] Cum. cases [2] per 100,000 inhabitants 59.87 106.80 0 1,530.32
PANEL B: Regional demographic structure and local health care system
Population density (inhabitants/km2) 534.79 702.40 36.13 4,686.17
Population share of highly educated* individuals (in %) 13.07 6.20 5.59 42.93
Share of females in population (in %) 50.59 0.64 48.39 52.74
Average age of females in population (in years) 45.86 2.11 40.70 52.12
Average age of males in population (in years) 43.17 1.83 38.80 48.20
Old-age dependency ratio (persons aged 65 years and 34.34 5.46 22.40 53.98
above per 100 of population age 15-64)
Young-age dependency ratio (persons aged 14 years 20.54 1.44 15.08 24.68
and below per 100 of population age 15-64)
Physicians per 10,000 of population 14.58 4.41 7.33 30.48
Pharmacies per 100,000 of population 27.01 4.90 18.15 51.68
Settlement type (categorial variable$) 2.59 1.04 1 4
Notes: * = International Standard Classification of Education (ISCED) Level 6 and above; $ = categories are based
on population shares and comprise 1) district-free cities (kreisfreie Großstädte), 2) urban districts (städtische
Kreise), 3) rural districts (ländliche Kreise mit Verdichtungsansätzen), 4) sparsely populated rural districts (dünn
besiedelte ländliche Kreise).
Inference thereby relies on permutation tests and follows the procedures suggested by Cavallo
et al. (2013) and applied, for example, by Eliason and Lutz (2018) or Hu et al. (2018). For both
the single and multiple treatment applications we estimate placebo-treatment effects for each
district in which masks did not become compulsory early on. These placebo treatments should
be small, relative to the treated regions. We calculate significance levels for the test of the
hypothesis that the mask obligation did not significantly affect reported Covid-19 cases. This
provides us with p−values for each day, which capture the estimated treatment effect on
reported Covid-19 cases from placebo regions. The p-values are derived from a ranking of the
actual treatment effect within the distribution of placebo treatment effects. We follow the
suggestion in Galiani and Quistorff (2017) and compute adjusted p-values taking the pre-
treatment match quality of the placebo treatments into account. 6
6
We conduct all estimations in STATA using “Synth” and “Synth Runner” packages (Abadie at al., 2020, Galiani and
Quistorff, 2017). Data and estimation files can be obtained from the authors upon request.
7
The pre-treatment root mean square prediction error (RMSPE) of 3.145 is significantly below a benchmark RMSPE
of 6.669, which has been calculated as the average RMSPE for all 401 regions in the pre-treatment period until
April 6. This points to the relatively good fit of the synthetic control group for Jena in this period.
200
Cumulative number Covid-19 cases
150
150
100
100
50
50
0
Figure 2: Treatment effects of mandatory face masks in Jena on April 6 and start of campaign on March
30 (see Table A3 and appendix B.2 for details)
8
See https://www.tagesschau.de/inland/corona-maskenpflicht-103.html. Last accessed May 05, 2020.
9
See https://www.jenaer-nachrichten.de/stadtleben/13069-jena-zeigt-maske-kampagne-f%C3%BCr-mund-
schutz-startet. Last accessed May 05, 2020.
Panel A: Cross-validation for changes in predictors Panel B: Placebo-in-time test (20 days in advance)
250
200
Cumulative number Covid-19 cases
100
150
50
100
Jena Jena
synthetic control unit (baseline) synthetic control unit
synthetic (lag: +1 day)
synthetic (lag: +3 days)
synthetic (lag: +7 days)
Another important factor for the validity of the results is that we do not observe an anticipation
effect for Jena prior to the announcement day. We test for a pseudo-treatment in Jena over a
period of 20 days before the introduction of face masks. This period is equally split into a pre-
10
300
400
Diff. (treated - synthetic control)
100
0
0
-200 -100
-200
Jena Other NUTS3 regions Jena Only larger cities (krsf. Städte)
Panel C: Significance levels for full sample [Panel A] Panel D: Significance levels for sample of large cities [Panel B]
.5
.5
.4
.4
Adjusted P-values
Adjusted P-values
.3
.3
.2
.2
.1
.1
0
0 5 10 15 20 5 10 15 20
Number of days after introduction of face masks Number of days after introduction of face masks
Figure 4: Comprehensive placebo-in-space tests for the effect of face masks on Covid-19 cases
Notes: Graphs exclude the following regions with a very large number of registered Covid-19 cases: Hamburg
(2000), Berlin (11000), Munich (9162), Cologne (5315) and Heinsberg (5370). In line with Abadie et al. (2010), we
only include placebo effects in the pool for inference if the match quality (pre-treatment RMSPE) of the specific
control regions is smaller than 20 times the match quality of the treated unit. P-values are adjusted for the quality
of the pre-treatment matches (see Galiani and Quistorff, 2017).
Treatment in other districts. Jena may be a unique case. We therefore also study treatment effects
for other regions that have antedated the general introduction of face masks in Germany. Further
single unit treatment analyses are shown in appendix C. Multiple unit treatments are studied in two
ways. The first sample covers all 401 regions and 32 treated units. The second focused on the
subsample of 105 larger cities (kreisfreie Städte), of which 8 are treated units. Treated regions
introduced face masks by April 22. The multiple treatment approach, visible in Figure 5, points to a
significant face mask-effect in the reduction of Covid-19 infections. The adjusted p-values indicate
that the estimated treatment effects are not random.
Face masks may have made a particular difference in the spread of Covid-19, particularly in
larger cities with higher population density and accordingly higher intensity of social
interaction. 10 Over a period of 10 days, we observe an average reduction of 12.3 cases between
treated and control regions. Relative to the average number of cumulative Covid-19 cases on
May 1 in control regions (295.6), this amounts to a reduction of 4.2% of cases. The daily growth
10
This is perfectly in line with Prather et al. (2020) given the reduction in aerosols and droplets via using masks.
11
260
300
Cumulative number Covid-19 cases
240
280
treated
220
260
synthetic
200
240
control
unit
180
220
-10 -5 0 5 10 -10 -5 0 5 10
5
0
0
-2
-5
-10
-4
-15
-6
-10 -5 0 5 10 -10 -5 0 5 10
Number of days before/after introduction of face masks Number of days before/after introduction of face masks
.1
.5
Adjusted P-values
Adjusted P-values
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10
Number of days after introduction of face masks Number of days after introduction of face masks
Figure 5: Average treatment effects for introduction of face masks (multiple treated units)
Notes Statistical inference for adjusted p-values has been conducted on the basis of a random sample of 1,000,000
placebo averages.
4 Conclusion
We set out by analyzing the city of Jena. The introduction of face masks on 6 April reduced the
number of new infections over the next 20 days by almost 25% relative to the synthetic control
group. This corresponds to a reduction in the average daily growth rate of the total number of
reported infections by 1.32 percentage points. Comparing the daily growth rate in the synthetic
control group with the observed daily growth rate in Jena, the latter shrinks by around 60% due
to the introduction of face masks. This is a sizeable effect. Wearing face masks apparently
helped considerably in reducing the spread of Covid-19. Looking at single treatment effects for
all other regions puts this result in some perspective. The reduction in the growth rate of
infections amounts to 20% only. By contrast, when we take the multiple treatment effect for
larger cities into account, we find a reduction in the growth rate of infections by around 40%.
12
References
Abadie A. (2019), Using Synthetic Controls: Feasibility, Data Requirements, and Methodological
Aspects. Article prepared for the Journal of Economic Literature.
https://economics.mit.edu/files/17847
Abadie A., & Gardeazabal J. (2003), The Economic Costs of Conflict: A Case Study of the Basque
Country. American Economic Review, 93(1): 113–132.
https://www.aeaweb.org/articles?id=10.1257/000282803321455188
Abadie A., A. Diamond, & J. Hainmueller (2010), Synthetic Control Methods for Comparative Case
Studies: Estimating the Effect of California’s Tobacco Control Program. Journal of the American
Statistical Association, 105(490): 493–505. https://doi.org/10.1198/jasa.2009.ap08746
Abadie, A., Diamond, A., & Hainmueller, J. (2020), Synth: Stata module to implement synthetic control
methods for comparative case studies. Revised version 2020-05-09.
https://econpapers.repec.org/software/bocbocode/s457334.htm
Becker S., Heblich S., & Sturm D. (2018), The Impact of Public Employment: Evidence from Bonn,
CESifo Working Paper Series 6841, CESifo Group Munich. http://ftp.iza.org/dp11255.pdf
Campos N., Coricelli F., & Moretti L. (2019), Institutional integration and economic growth in Europe.
Journal of Monetary Economics, 103: 88–104. https://doi.org/10.1016/j.jmoneco.2018.08.001
13
14
(a) University of Southern Denmark, RWI and RCEA, (b) University of Kassel, (c) TU Darmstadt
(d) Johannes Gutenberg University Mainz, CESifo and Visiting Research Fellow IZA
15
Table A1: Overview of dates when masks became compulsory in federal states and districts
Introduction Difference
Public Individual of face in days to
Federal State transport Sales shops NUTS3 region masks state
Baden-Wurttemberg 27.04.2020 27.04.2020 LK Rottweil 17.04.2020 10
Bavaria 27.04.2020 27.04.2020
Berlin 27.04.2020 29.04.2020
Brandenburg 27.04.2020 27.04.2020
Bremen 27.04.2020 27.04.2020
Hamburg 27.04.2020 27.04.2020
Hesse 27.04.2020 27.04.2020 Main-Kinzig-Kreis 20.04.2020 7
Mecklenburg-West Pomer. 27.04.2020 27.04.2020
Lower Saxony 27.04.2020 27.04.2020 Wolfsburg 20.04.2020 7
Braunschweig 25.04.2020 2
North Rhine-Westphalia 27.04.2020 27.04.2020
Rheinland-Pfalz 27.04.2020 27.04.2020
Saarland 27.04.2020 27.04.2020
Saxony 20.04.2020 20.04.2020
Saxony-Anhalt 22.04.2020 22.04.2020
Schleswig-Holstein 29.04.2020 29.04.2020
Thuringia 24.04.2020 24.04.2020 Jena 06.04.2020 18
Nordhausen 14.04.2020 10
Notes: A comprehensive overview of all public health measures introduced in German federal states and individual
regions is given in Kleyer et al. (2020).
16
This appendix presents supporting findings for the comparative case study of Jena.
B.1. Covid-19 cases and cumulative incidence rate in Jena and Germany on April 5
Panel A: Cumulative number Covid-19 cases (April 5) Panel B: Cumulative Incidence Rate (April 5)
600
300
400
200
Jena
200
100
Jena
0
Figure A1: Box plots for distribution of Covid-19 cases across German NUTS3 regions (April 5)
17
This appendix shows the balancing properties of the SCM approach together with the root
mean square percentage error (RMSPE) as a measure for the quality of the pre-treatment
prediction.
Table A2: Pre-treatment predictor balance and RMSPE for SCM in Figure 2
Introduction of Announcement/
Treatment:
face masks start of campaign
Jena Synthetic Jena Synthetic
control group control group
Cumulative number of registered Covid-19
cases (one and seven days before start of 129.5 129.2 93 92.7
treatment, average)
Number of newly registered Covid-19
cases (last seven days before the start of 3.7 3.8 5 5.2
the treatment, average)
Population density (Population/km2) 38.4 22.8 968.1 947.9
Share of highly educated population (in %) 968.1 1074.3 38.4 26.3
Share of female in population (in %) 50.1 50.1 50.1 50.1
Average age of female population (in
43.5 43.7 43.5 43.9
years)
Average age of male population (in years) 40.5 40.6 40.5 40.8
Old-age dependency ratio (in %) 32.1 29.3 32.1 29.8
Young-age dependency ratio (in %) 20.3 19.6 20.3 19.5
Physicians per 10,000 of population 20.5 19.8 20.5 20.8
Pharmacies per 100,000 of population 28.8 28.7 28.8 28.6
Settlement type (categorial variable) 1 1.3 1 1.9
RMSPE (pre-treatment) 3.145 4.796
Notes: Donor pool includes all other German NUTS3 regions except the two immediate neighboring regions of
Jena (Weimarer Land, Saale-Holzland-Kreis) as well as the regions Nordhausen and Rottweil since the latter regions
introduced face masks in short succession to Jen on April 14 and April 17.
18
Table A3: Distribution of sample weights in donor pool for synthetic Jena
Introduction of face masks (Panel A in Figure 2)
ID NUTS 3 region Weight
13003 Rostock 0.326
6411 Darmstadt 0.311
3453 Cloppenburg 0.118
7211 Trier 0.117
6611 Kassel 0.082
5370 Heinsberg 0.046
Note: Donor pools corresponds to SCM estimation in Panel A
of Figure 2. Sample weights are chosen to minimize the
RMSPE ten days prior to the start of the treatment.
These numbers are computed in an Excel-file available on the web pages of the authors.
19
100
80
80
70
60
60
40
20
50
80
60
60
50
40
40
20
30
0
March 27 April 6 April 16 April 26 March 17 March 30 April 6 April 16 April 26
30
50
40
20
30
10
20
10
Figure A2: Treatment effects for introduction and announcement of face masks in Jena
Notes: Predictor variables are chosen as for overall specification shown in Figure 2.
Table A5: Sample weights in donor pool for synthetic Jena (cumulative Covid-19 cases; by age groups)
Age Group 15-34 years Age Group 35-59 years Age Group 60 years and above
ID NUTS 3 region Weight ID NUTS 3 region Weight ID NUTS 3 region Weight
1001 Flensburg 0.323 6411 Darmstadt 0.528 6411 Darmstadt 0.522
7211 Trier 0.207 16055 Weimar 0.16 16055 Weimar 0.244
Neustadt a.d.
13003 Rostock 0.184 14511 Chemnitz 0.15 7316 0.068
Weinstraße
5370 Heinsberg 0.142 8221 Baden-Baden 0.07 9562 Erlangen 0.06
Hochtaunus-
3453 Cloppenburg 0.107 6434 0.062 3356 Osterholz 0.056
kreis
Offenbach am
6413 0.038 8435 Bodenseekreis 0.029 5515 Münster 0.027
Main
5370 Heinsberg 0.001 9188 Starnberg 0.022
Note: Donor pools corresponds to SCM estimations in Figure A2. Sample weights are chosen to minimize the
RMSPE ten days prior to the start of the treatment.
20
80
Cumulative Incidence Rate
70
140
120
60
100
50
80
Panel C: Persons aged 35-59 years Panel D: Persons aged 60 years and above
60
40
Cumultative Incidence Rate
Cumulative Incidence Rate
50
30
40
20
30
10
20
Figure A3: Treatment effects for introduction of face masks on cumulative incidence rate
Notes: See Table 1 for a definition of the incidence rate. Predictor variables are chosen as for overall specification
shown in Figure 2.
Table A6: Sample weights in donor pool for synthetic Jena (cumulative incidence rate)
ID NUTS 3 region Weight
6411 Darmstadt 0.46
15003 Magdeburg 0.171
5370 Heinsberg 0.133
13003 Rostock 0.093
5515 Münster 0.066
11000 Berlin 0.035
12052 Cottbus 0.032
6611 Kassel 0.011
Note: Donor pools corresponds to SCM estimation in Figure A3. Sample
weights are chosen to minimize the RMSPE ten days prior to the start of
the treatment.
21
March 31 April 22
80
60
40
20
0
Search: "Face Mask" Search: "Buy Face Mask" Search: "Buy Mask"
Figure A4: Online search for face masks and purchase options according to Google Trends
Note: Online search for keywords (in German) as shown in the legend as Face Mask (“Mund.-Nasen-Schutz”),
Buy Face Mask (“Mundschutz kaufen”) and Buy mask (“Maske kaufen”); alternative keywords show similar peaks
but with a lower number of hits; based on data from Google Trends (2020).
22
300
250
200
150
100
50
Figure A5: Treatment effects for changes in donor pool used to construct synthetic Jena
Notes: See main text for a detailed definition of the respective donor pools. Predictor variables are chosen as for
overall specification shown in Figure 2.
Table A8: Sample weights for alternative donor pools used to construct synthetic Jena
Only Thuringia Excluding Thuringia Only larger cities
ID NUTS 3 region Weight ID NUTS 3 region Weight ID NUTS 3 region Weight
16076 Greiz 0.533 13003 Rostock 0.318 6411 Darmstadt 0.504
16051 Erfurt 0.467 6411 Darmstadt 0.302 13003 Rostock 0.304
7211 Trier 0.129 5113 Essen 0.192
3453 Cloppenburg 0.122
6611 Kassel 0.083
5370 Heinsberg 0.046
Only East Germany Only West Germany
ID NUTS 3 region Weight ID NUTS 3 region Weight
16051 Erfurt 0.865 6411 Darmstadt 0.242
14612 Dresden 0.124 3402 Emden 0.198
11000 Berlin 0.011 6611 Kassel 0.169
7211 Trier 0.168
4012 Bremerhaven 0.167
5370 Heinsberg 0.057
Note: Donor pools corresponds to SCM estimations in Figure A5. Sample weights are chosen to minimize the
RMSPE ten days prior to the start of the treatment.
23
150
Cumulative number Covid-19 cases
120
100
100
50
80
60
0
March 27 April 6 April 16 April 26 March 27 April 6 April 16 April 26
200
150
80
100
40 60
50
20
Figure A6: Placebo tests for the effect of face masks in other cities in Thuringia on April 6.
Notes: For the placebo tests in the other cities in Thuringia the same set of predictors as for Jena (Figure 2) has
been applied. The reported regions cover all kreisfreie Städte plus Gotha (Landkreis). The cities Weimar, Suhl and
Eisenach have been aggregated since the number of reported Covid-19 is low in these cities.
24
25
600
50
500
40
400
30
300
20
300
500
280
400
260
300
240
200
220
Figure A7: Treatment effects for introduction of face masks in other cities
Notes: Nordhausen (Thuringia, April 14, top left), Rottweil (Baden Württemberg, April 17, top right), Wolfsburg
(Lower Saxony, April 20, middle left), Main-Kinzig-Kreis (Hessia, April 20, middle right). Predictor variables are
chosen as for overall specification shown in Figure 2.
As the figure shows, the result is 2:1:1. Rottweil and Wolfsburg display a positive effect of
mandatory mask wearing, just as Jena. The results in Nordhausen are very small or unclear. In
the region of Main-Kinzig, it even seems to be the case that masks increased the number of
cases relative to the synthetic control group. As all of these regions introduced masks after
Jena, the time period available to identify effects is smaller than for Jena. The effects of
mandatory face masks could also be underestimated as announcement effects and learning
from Jena might have induced individuals to wear masks already before they became
mandatory. Finally, the average pre-treatment RMSPE for these four regions (7.150) is larger
than for the case of Jena (3.145). For instance, in the case of the region of Main-Kinzig it is more
than three times as high (9.719), which indicates a lower pre-treatment fit. The obtained
treatment effects should then be interpreted with some care as the pre-sample error could also
translate into the treatment period. In order to minimize the influence of a poor pre-treatment
fit for some individual regions, the main text therefore compares the results in Jena mainly with
a multiple unit treatment approach.
26
27
11
In a short note, Hartl and Weber (2020) apply panel methods based on time dummies to understand the relative
importance of various public health measures. They employ data at the federal state level and not at the regional
level. As a detailed model description is not available, an appreciation of results is difficult at this point.
28
29