0% found this document useful (0 votes)

18 views

Course0 Introduction

Uploaded by

hishamhaydar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

Course0 Introduction

Uploaded by

hishamhaydar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Machine Learning for Econometrics

Introduction

Christophe Gaillac

Autumn 2024
Motivation

Econometrics and machine learning (ML) share many statistical

tools, but (usually) have different 1) purposes, 2) approaches to
building models.
Econometrics, first and foremost, aims to quantify a precisely
defined effect, with a focus on statistical inference:
what is the impact of a minimum wage increase on employment?
Are there peer effects among groups of students?
What is the average wage gap between women and men?
Empirical economics has focused on measuring causal effects,
stressing the identification strategy.
The goal of ML is to build a model that allows one to obtain the
best possible predictive performance for a given problem:
computational constraint when calling the model (runtime or
inference time performance)
focus on algorithms rather than models

Introduction 2 / 12
Motivation

Econometrics models are preferably simpler, with an emphasis on

linearity, and their structure is usually motivated by a theory of the
underlying causal relationship or of individual behavior
In ML, prediction performance is usually the only criterion for
selecting a model.

Some nuances:
several scandals involving the use of algorithms have highlighted the
need to be able to explain their predictions (so called explainability ),
to study and correct their biases (see the fairness and bias
mitigation literature)
“p-hacking” (i.e., the practice of embarking on a specification search
to obtain significant results), or more generally the replicability crisis
in empirical economics can be seen as a problem of overfitting.

Introduction 3 / 12
Outline

1 What is this course about?

2 Additional references

Introduction 4 / 12
High dimension and variable selection

Empirical economics involves crucial choices about the model: may

cast doubt on the credibility of the results.

First focus on high-dimensional methods, which can handle a

large number of covariates and/or instrumental variables, and on
certain machine learning techniques with the purpose of performing
causal inference in mind.

Introduction 5 / 12
High dimension and variable selection

Example: using the Neyman-Rubin potential outcome model.

Yi (0) is the potential outcome for individual i if not treated and
Yi (1) is the potential outcome if treated. We only observe the state
of treatment Di ∈ {0, 1} and the realized outcome Yi defined by:

Yi (0) if Di = 0,
Yi = Yi (Di ) =
Yi (1) if Di = 1.

One interesting quantity is the average treatment effect

τ0 := E[Yi (1) − Yi (0)],

representing the average impact of the intervention.

Introduction 6 / 12
High dimension and variable selection

1 If treatment assignment is random when conditioning on observables

(i.e., assuming that E[ε i |Di , Xi ] = 0 in the model below) and there
are only a limited number of significant covariates (sparsity) provides
the tools to estimate τ0 in the model:
′
Yi = Di τ0 + Xi β 0 + ε i , with E[ε i ] = 0 and E[ε i |Di , Xi ] = 0,

where Xi is a vector of p exogenous control variables, p being

potentially larger than the number of observations.
2 Generalization to other models, ML predictors
3 Generalization to E[ε i |Di , Xi ] ̸= 0 and many instruments.

Introduction 7 / 12
Estimation of heterogeneous effects
The average treatment effect (τ0 above) does not describe the
heterogeneity of responses to an intervention – some people may
benefit greatly from it, while others may not be affected or may
even be worse off.
Focus on a more complex parameter of interest, which is the
average treatment effect conditional on certain (observed) variables
τ : x 7→ E[Yi (1) − Yi (0)|Xi = x ].

Objective: inference on the function τ (·), i.e., to test the

significance of the effect conditional on covariates taking the value x .
Then lower our requirements to make inference only on certain
characteristics of the conditional average treatment effect, while
ensuring better theoretical guarantees
Then focus on optimal policy learning, which presents the tools
for estimating optimal policies in the context of randomized
experiments.
Introduction 8 / 12
Aggregate data and macroeconomic forecasting

Tools to handle data that has a temporal structure, often taking a

more aggregated form in economics.

High-dimensional estimation methods in a context where the

data is not identically distributed and potentially has heavy tails,
as is the case with macroeconomic and financial data.
Adapt the tools developed in the previous chapters to that of
predicting macroeconomic variables, in the context where one wishes
to select without prior from a large number of explanatory variables.
Talk about the limitations of sparse methods in this context are also
underlined and links to factor models, which are “dense” models, are
made.

Introduction 9 / 12
Text data

Short introduction to the basic tools for analysis of text data

First part on tools to extract a numerical representation of texts, as

well as to model the language.

Second part on the fundamental concepts of modern natural

language processing (NLP): word embeddings. Then, we will see
generalizations of the concept of embeddings beyond textual data,
still experimental in empirical economics, but identified as promising
for the future.

Introduction 10 / 12
Outline

1 What is this course about?

2 Additional references

Introduction 11 / 12
Resources

General econometrics
1 Wooldridge, J. M. (2002). Econometric analysis of cross section and
panel data. Cambridge and London: MIT Press
2 Hansen, B. E. (2022). Econometrics. Princeton University Press.

Causal inference
1 Angrist, J. and J.-S. Pischke (2009). Mostly Harmless Econometrics:
An Empiricist’s Companion (1st ed.). Princeton University Press.
2 Chernozhukov, V., C. Hansen, N. Kallus, M. Spindler, and V.
Syrgkanis (2024). Applied causal inference powered by ML and AI.

Machine learning methods, Neural Networks:

1 Hastie, T., R. Tibshirani, and J. Friedman (2009). The Elements of
Statistical Learning: Data mining, Inference and Prediction. Springer
2 Goodfellow, I. J., Y. Bengio, and A. Courville (2016). Deep Learning.
Cambridge, MA, USA: MIT Press. http://www.deeplearningbook.org.

Introduction 12 / 12
Resources

Our book with Jérémy L’Hour (CFM):

1 Gaillac, C. and J. L’Hour (2023). Machine Learning pour
l’économétrie. Economica
2 Gaillac, C. and J. L’Hour (June 2025). Machine Learning for
Econometrics. Oxford University Press

A GitHub repository is available at the address

https://github.com/jeremylhour/ml econometrie.
It contains scripts in R and Python, which reproduce some of the
applications.

Introduction 13 / 12
Appendix

Introduction 14 / 12

1 Introduction To Econometrics
100% (6)
1 Introduction To Econometrics
18 pages
HD Econometrics
No ratings yet
HD Econometrics
197 pages
+part 01 - AMEFA - 2024 - Introduction and Repetition
No ratings yet
+part 01 - AMEFA - 2024 - Introduction and Repetition
68 pages
Sanet ST
No ratings yet
Sanet ST
385 pages
HD - Machine Learnind and Econometrics
No ratings yet
HD - Machine Learnind and Econometrics
185 pages
Athey - ML
No ratings yet
Athey - ML
62 pages
Cate Meets ML
No ratings yet
Cate Meets ML
67 pages
[BOOK] a Primer in Econometric Theory - Stachurski 2016
No ratings yet
[BOOK] a Primer in Econometric Theory - Stachurski 2016
398 pages
1
No ratings yet
1
19 pages
Causal Inference in Econometric - Van-Nam Huynh
No ratings yet
Causal Inference in Econometric - Van-Nam Huynh
626 pages
SSRN Id3365282
No ratings yet
SSRN Id3365282
32 pages
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
No ratings yet
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
38 pages
Jep 31 2 87
No ratings yet
Jep 31 2 87
62 pages
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
No ratings yet
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
48 pages
Machine Learning: An Applied Econometric Approach
100% (1)
Machine Learning: An Applied Econometric Approach
31 pages
KREINOVICH 2018 Predictive Econometrics and Big Data
No ratings yet
KREINOVICH 2018 Predictive Econometrics and Big Data
788 pages
CH 1
No ratings yet
CH 1
5 pages
Introduction S
No ratings yet
Introduction S
31 pages
ACT-671 Introduction Econometrics-2012
No ratings yet
ACT-671 Introduction Econometrics-2012
29 pages
Mullainathan MachineLearningApplied 2017
No ratings yet
Mullainathan MachineLearningApplied 2017
21 pages
Machine Learning Methods for Economists
No ratings yet
Machine Learning Methods for Economists
41 pages
annurev-economics-080217-053433
No ratings yet
annurev-economics-080217-053433
41 pages
wp18 30
No ratings yet
wp18 30
37 pages
151 239
No ratings yet
151 239
89 pages
Machine Learningand Econometrics
No ratings yet
Machine Learningand Econometrics
80 pages
Advanced Topics in Data Science
No ratings yet
Advanced Topics in Data Science
4 pages
Athey, Imbens (2019) - Machine Learning Methods Economists Should Know About
No ratings yet
Athey, Imbens (2019) - Machine Learning Methods Economists Should Know About
41 pages
Chapter 1: The Nature of Econometrics and Economic Data
No ratings yet
Chapter 1: The Nature of Econometrics and Economic Data
9 pages
Machine Learning Methods That Economists Should Know About
No ratings yet
Machine Learning Methods That Economists Should Know About
44 pages
Athey Imbens 2019 Machine Learning Methods That Economists Should Know About
No ratings yet
Athey Imbens 2019 Machine Learning Methods That Economists Should Know About
44 pages
EBD Merged Notes-Quiz
No ratings yet
EBD Merged Notes-Quiz
284 pages
Econometrics 1
No ratings yet
Econometrics 1
74 pages
[Ebooks PDF] download Econometrics by Example 2nd Edition Damodar Gujarati full chapters
100% (6)
[Ebooks PDF] download Econometrics by Example 2nd Edition Damodar Gujarati full chapters
50 pages
Econometrics by Example 2nd Edition Damodar Gujarati All Chapters Instant Download
100% (1)
Econometrics by Example 2nd Edition Damodar Gujarati All Chapters Instant Download
51 pages
Improving Econometric Prediction by Machine Learning: Applied Economics Letters
No ratings yet
Improving Econometric Prediction by Machine Learning: Applied Economics Letters
8 pages
Econometrics: A Predictive Modeling Approach: Francis X. Diebold University of Pennsylvania
No ratings yet
Econometrics: A Predictive Modeling Approach: Francis X. Diebold University of Pennsylvania
247 pages
Introduction Econometrics Notes
No ratings yet
Introduction Econometrics Notes
7 pages
Topic 01 - Introduction to Econometrics.pptx
No ratings yet
Topic 01 - Introduction to Econometrics.pptx
63 pages
GROUP-12-WRITTEN-REPORT
No ratings yet
GROUP-12-WRITTEN-REPORT
12 pages
Get Introduction to the mathematical and statistical foundations of econometrics Bierens free all chapters
100% (5)
Get Introduction to the mathematical and statistical foundations of econometrics Bierens free all chapters
51 pages
Introduction to the mathematical and statistical foundations of econometrics Bierens All Chapters Instant Download
100% (11)
Introduction to the mathematical and statistical foundations of econometrics Bierens All Chapters Instant Download
50 pages
MLinEconomics Syllabus
No ratings yet
MLinEconomics Syllabus
3 pages
Generalization Bounds and Representation Learning For Estimation of Potential Outcomes and Causal Effects
No ratings yet
Generalization Bounds and Representation Learning For Estimation of Potential Outcomes and Causal Effects
50 pages
Machine Learning Methods That Economists Should Know About: Annual Review of Economics
No ratings yet
Machine Learning Methods That Economists Should Know About: Annual Review of Economics
41 pages
The Pakistan Development Review Volume 60, Number 2, 2021
No ratings yet
The Pakistan Development Review Volume 60, Number 2, 2021
44 pages
Handbook of Econometrics Vol 1
No ratings yet
Handbook of Econometrics Vol 1
757 pages
Econometrics: Submitted To Submitted by
No ratings yet
Econometrics: Submitted To Submitted by
6 pages
What Is Econometric S
No ratings yet
What Is Econometric S
5 pages
10.4324 9780203157688 Previewpdf
No ratings yet
10.4324 9780203157688 Previewpdf
36 pages
Chap1_Econometrics
No ratings yet
Chap1_Econometrics
42 pages
Econometrics Chapter 1
No ratings yet
Econometrics Chapter 1
46 pages
Course Syllabus - Colin Cameron
No ratings yet
Course Syllabus - Colin Cameron
3 pages
Chapter- 1 Econometrics
No ratings yet
Chapter- 1 Econometrics
12 pages
Econometrics I 1
No ratings yet
Econometrics I 1
40 pages
SSRN Id3373116 PDF
No ratings yet
SSRN Id3373116 PDF
39 pages
Introduction to the Concept of Econometrics
No ratings yet
Introduction to the Concept of Econometrics
35 pages
Econometrics in STAN
No ratings yet
Econometrics in STAN
39 pages
SSRN Id3270329
No ratings yet
SSRN Id3270329
57 pages
Alarming! the Chasm Separating Education of Applications of Finite Math from It's Necessities
From Everand
Alarming! the Chasm Separating Education of Applications of Finite Math from It's Necessities
Ramune B. Adams
No ratings yet
Secrets of Statistical Data Analysis and Management Science!
From Everand
Secrets of Statistical Data Analysis and Management Science!
Andrei Besedin
No ratings yet
7KM31200BA011DA0 Datasheet en
No ratings yet
7KM31200BA011DA0 Datasheet en
7 pages
Summative in INquiries I I, 2nd Sem 2022-2023
No ratings yet
Summative in INquiries I I, 2nd Sem 2022-2023
3 pages
Statement 47286068 USD
No ratings yet
Statement 47286068 USD
5 pages
Cupid and Psyche An Internet Love Story - Excerpt
No ratings yet
Cupid and Psyche An Internet Love Story - Excerpt
22 pages
Insulation Coordination - EMTP - RV1
100% (1)
Insulation Coordination - EMTP - RV1
40 pages
Project Report: Ghodbundar Road
No ratings yet
Project Report: Ghodbundar Road
2 pages
Merged
No ratings yet
Merged
3 pages
Resource Scheduling
No ratings yet
Resource Scheduling
10 pages
PSS Mentoring Checklist - Version 2.1 - 24.03.2021 - Laz + RL
No ratings yet
PSS Mentoring Checklist - Version 2.1 - 24.03.2021 - Laz + RL
6 pages
Class25 Calculated Fields-1
No ratings yet
Class25 Calculated Fields-1
30 pages
Working ECDIS Procedure - Answers ###
83% (6)
Working ECDIS Procedure - Answers ###
13 pages
Introduction to Data Science
No ratings yet
Introduction to Data Science
3 pages
2014 - Standard Manual
No ratings yet
2014 - Standard Manual
68 pages
KAC Civilian Catalog 2012
100% (1)
KAC Civilian Catalog 2012
28 pages
STEM Nexus Proposal
No ratings yet
STEM Nexus Proposal
6 pages
Topic Test - Integration
No ratings yet
Topic Test - Integration
2 pages
TOPIC 2 - Levelling
No ratings yet
TOPIC 2 - Levelling
18 pages
SOLN-MS-EL-GDC-BB304-0005 REV 1 METHOD STATEMENT FOR TESTING AND COMMISSIONING OF MEDIUM VOLTAGE SWITCHGEAR_1_rvwd by RAB 22Jan.2024._1 (1)_1
No ratings yet
SOLN-MS-EL-GDC-BB304-0005 REV 1 METHOD STATEMENT FOR TESTING AND COMMISSIONING OF MEDIUM VOLTAGE SWITCHGEAR_1_rvwd by RAB 22Jan.2024._1 (1)_1
21 pages
Syllabus Principles and Theories of Language
100% (1)
Syllabus Principles and Theories of Language
11 pages
Exhumation and Design of Anchorages in Chalk Barley Mothersille Weerasinghe Ice Publication 2003
No ratings yet
Exhumation and Design of Anchorages in Chalk Barley Mothersille Weerasinghe Ice Publication 2003
22 pages
7 Work, Energy and Power: Practice 7.1 (P. 141)
No ratings yet
7 Work, Energy and Power: Practice 7.1 (P. 141)
10 pages
Tewodros Tesfahun
No ratings yet
Tewodros Tesfahun
155 pages
G9 Activity 7A-C
No ratings yet
G9 Activity 7A-C
3 pages
Notes in STS Prelim
100% (1)
Notes in STS Prelim
27 pages
CH3 cell structure and function
No ratings yet
CH3 cell structure and function
5 pages
Research Template - Full Paper
No ratings yet
Research Template - Full Paper
30 pages
Violence at Work: A Guide For Employers
No ratings yet
Violence at Work: A Guide For Employers
7 pages
Castel Garden XF 130 HD Illustrated Parts List
No ratings yet
Castel Garden XF 130 HD Illustrated Parts List
29 pages
2011 Chevrolet Aveo Owners
No ratings yet
2011 Chevrolet Aveo Owners
328 pages
Marks Vs Percentile JEE Main 2021: A Comparative Study
No ratings yet
Marks Vs Percentile JEE Main 2021: A Comparative Study
5 pages

Course0 Introduction

Uploaded by

Course0 Introduction

Uploaded by

Machine Learning for Econometrics

Econometrics and machine learning (ML) share many statistical

Econometrics models are preferably simpler, with an emphasis on

1 What is this course about?

Empirical economics involves crucial choices about the model: may

First focus on high-dimensional methods, which can handle a

Example: using the Neyman-Rubin potential outcome model.

One interesting quantity is the average treatment effect

τ0 := E[Yi (1) − Yi (0)],

representing the average impact of the intervention.

1 If treatment assignment is random when conditioning on observables

where Xi is a vector of p exogenous control variables, p being

Objective: inference on the function τ (·), i.e., to test the

Tools to handle data that has a temporal structure, often taking a

High-dimensional estimation methods in a context where the

Short introduction to the basic tools for analysis of text data

First part on tools to extract a numerical representation of texts, as

Second part on the fundamental concepts of modern natural

1 What is this course about?

Machine learning methods, Neural Networks:

Our book with Jérémy L’Hour (CFM):

A GitHub repository is available at the address

You might also like