0% found this document useful (0 votes)

3 views

lecture_notes

The document discusses causal models, highlighting their distinction from statistical models by incorporating hypothetical interventions to analyze causal relationships. It covers various types of causal models, including potential outcome models, graphical causal models, and structural causal models, each with unique advantages and disadvantages. The document emphasizes the importance of understanding causal assumptions and the implications of different modeling approaches in causal analysis.

Uploaded by

dent

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

lecture_notes

Uploaded by

dent

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Causality

Lecture Notes

Niklas Pfister
May 28, 2024

1
1 Causal Models
A causal model, in its most basic form, is an extension of a statistical model that not only describes
a system in its observed states but also under a set of well-defined hypothetical interventions.
Conceptually, a statistical model specifies a set of distributions,

P ⊆ {P | P probability distribution on X },

such that the observed data is assumed to be a random sample X ∼ P0 for a fixed P0 ∈ P. In
contrast, for a fixed index set I, a causal model specifies a set of distribution valued functions
with domain I,
PI ⊆ {P | P : I → P}.
We call I the set of interventions and assume that for all interventions i ∈ I data is modeled as
a random sample X ∼ P 0 (i) for a fixed P 0 ∈ PI . We further assume that the set of interventions
can be divided into a set of observed interventions Iobs ⊆ I – often this only contains a single
element corresponding to the observed setting – and a set of hypothetical interventions Ihyp ⊆ I.
Formally, we assume Iobs ∩ Ihyp = ∅ and Iobs ∪ Ihyp = I. Similar to how in a statistical model
we define statistical estimands as real-valued functionals on P, we define causal estimands as
functionals Ψ : PI → R.

Example 1 (Average causal effects). Assume a setting in which we want to determine whether
taking an asprin helps removing a headache. Consider now a study population where for each
participant we measure some covariates X capturing some characteristics of the participants (e.g.,
age and gender), a response Y ∈ {0, 1} encoding whether they had a headache at the end of the
experiment and a treatment indicator T ∈ {0, 1} encoding whether they received an asprin or
not. We can use a statistical model describing (T, X, Y ) for each participant, for example, by
assuming they are i.i.d. draws from a distribution P0 . The distribution P0 , however, is insufficient
to formally express the original goal of determining the effect of asprin as it does not allow us to
describe changes in the model.
Instead, we need to first specify what an effect of asprin means. One way of doing this is to
imagine two hypothetical experiments. In the first we give all participants the asprin (denoted by
’ALL’) and in the second we give none of the participants an asprin (denoted by ’NONE’). If
we would observe data from both of these experiments, we could define an average causal effect of
asprin as the difference in the expectation of Y in the two experiments.
Formally, we define the set of interventions I := {∅, ALL, NONE}, where ∅ corresponds
to observed setting described above (i.e., Iobs := {∅} and Ihyp := {ALL, NONE}). As in the
statistical model, we now assume that under each intervention i ∈ I the measurements for all
participants are modeled as i.i.d. draws from

(T, X, Y ) ∼ P 0 (i),

where P 0 (i) is a fixed distribution. The average causal effect is then defined by

EP 0 (ALL) [Y ] − EP 0 (NONE) [Y ],

which captures the difference in expectation of having a headache across the two hypothetical in-
terventions. This target quantity can be expressed as the causal estimand Ψ : PI → R defined for
all P ∈ PI by
Ψ(P ) = EP (ALL) [Y ] − EP (NONE) [Y ].

Just as in an (observational) statistical model an estimand is not necessarily identifiable even

if infinite data are available. However, while for (most) statistical estimands identifiability follows
from assuming regularity conditions on the statistical model P (e.g., smoothness), the unidentifia-
bility for causal estimands can be more substantial as it can depend on interventional distributions
for which no data is available.

2
Definition 1 (Causal identifiability). Let PI be a causal model based on the interventions I =
Iobs ∪ Ihyp . Let Ψ : PI → R be a causal estimand. We say Ψ is causally identifiable if there exists
Φ : {P |Iobs | P ∈ PI } → R satisfying for all P ∈ PI that

Φ(P |Iobs ) = Ψ(P ).

A causal model as defined here is not capable of modeling counterfactual statements. While
there are certainly valid reasons to consider such statements (e.g., applications in fairness), we
avoid them here as they involve delicate philosophical reasoning that if not explicitly needed is
best avoided. We use the abstract notion of a causal model to provide a unified perspective
on the more specialized causal models that exist in the literature and have been developed (often
independently of each other) for different types of applications. Each model has its own advantages
and disadvantages and none is strictly superior to another. Most importantly, all of them can be
shown to induce distributions for observed and hypothetical interventions and are hence causal
models as defined above. Moreover, any valid causal analysis based solely on these distributions
results in the same conclusions.

Potential outcome models Potential outcome models (sometimes also called Rubin causal
models) were originally developed to formalize a notion of causality in (randomized) control trials
and are still the most prevalent models used in clinical and biomedical settings. They start from
a well-defined set of hypothetical interventions and introduce individual random variables (called
potential outcomes) for each possible intervention.
Advantages: The potential outcomes are easy to communicate to practitioners as most humans
are comfortable thinking in terms of counterfactuals. Moreover, the framework avoids explicitly
specifying causal mechanisms (as long as they are not needed). Lastly, since these models start
from individual units, it can be easier to specify certain delicate causal assumptions between these
units (e.g., interference) than with other causal models.
Disadvantages: A common criticism is that these models obscure which quantities are counter-
factual (and potentially non falsifiable) and which are interventional (and falsifiable by experimen-
tation) [e.g., Dawid, 2021]. A further disadvantage is that as soon as multiple sequential causal
relations are present (e.g., in mediation analysis or other settings with complex causal structure)
the models become complex and clearly communicating the underlying causal assumptions gets
challenging. Lastly, the underlying mathematical formalism is subtle given that it starts from
finite populations.
References: Rubin [2005], Imbens and Rubin [2015]

Graphical causal models Graphical casual models (sometimes also called Pearlian graphical
models) aim to provide a concise and intuitive description of causal relations between multiple
causal variables. They extend probabilistic graphical models, which parameterize distributions via
conditional independence constraints, to additionally model interventions that consist of changing
specific variables while keeping the others untouched. These models originated in the computer sci-
ence community, which relies heavily on probabilistic graphical models to efficiently parameterize
complex multivariate distributions.
Advantages: Due to its graphical representation it is straight-forward to communicate a causal
ordering and describe complex causal relations. Moreover, a causal graph provides a simple lan-
guage to communicate causal assumptions, which makes it easy to discuss assumptions with do-
main experts that may not have a background in causality.
Disadvantages: A common concern with these models is that they model all causal mechanisms
at once, even if these are not needed for the analysis. In particular, they (in general) assume that
interventions on all variables are feasible, which in practice is often unreasonable. A further
drawback is that functional constraints cannot be directly included (if this is required a structural
causal model is a better choice).
References: Pearl [2009]

3
Simultaneous and structural equation models Simultaneous and structural equation mod-
els extend conventional statistical regression models to allow the endogenous (dependent) variables
not only to depend on exogenous (independent) variables but also on the other endogenous vari-
ables. By explicitly distinguishing exogenous from endogenous variables these models can be used
to formalize causal effects on the endogenous variables given interventions on the exogenous vari-
ables. These models originated in the econometrics literature as a way to add causal meaning to
specific parts of a regression model and have in particular been used in the context of instrumental
variables.
Advantages: The closeness to conventional (non-causal) regression models means that they
integrate particularly well with existing statistical regression techniques. Furthermore, similar to
the potential outcome models, they only model a specific predefined set of interventions and hence
avoid unnecessary assumptions.
Disadvantages: The causal assumptions underlying these models are not made explicit meaning
that they can be easily misinterpreted or wrongly applied (if the required assumptions are not
satisfied).
References: Newey et al. [1999], Duncan [2014]

Structural causal models Structural causal models (sometimes also called functional causal
models) are a refinement of graphical causal models that additionally specify the functional form
of all causal mechanisms. One can see them as a hybrid of a graphical causal model and a
simultaneous equation model. While in most cases they are defined by specifying a causal structure
among all variables, they can in fact also be used (similar to simultaneous equation models) to
only model a single causal mechanism and leaving the remaining parts unspecified.
Advantages: They always induce a corresponding (partial) causal graphical model, hence mak-
ing the implied causal assumptions and structure easy to communicate and discuss. Moreover, by
explicitly modeling the causal mechanism, it is easy to specify functional causal assumptions that
can be easily mapped to statistical assumptions and procedures.
Disadvantages: A common concern with structural causal models is that by specifying the
causal mechanism explicitly they can be easily misinterpreted by using the model to reason about
intervention or counterfactual statements for which the model was not intended. Such ambiguities
about the causal implication of the model can be avoided by always explicitly stating which
interventions the model is intended to model.
References: [Pearl, 2009, Bongers et al., 2021, Peters et al., 2017]

Further models In practice, one often uses a mix or slight variations of the above models.
Moreover, additional causal models also exist. Notably, single-world intervention graphical models
(SWIGs) [Richardson and Robins, 2013] that attempt to add a rigorous graphical language to
potential outcome models and the decision theoretic framework for causality [Dawid, 2012, 2021],
which advocates for being more explicit about which causal implications of the structural and
graphical causal frameworks are actually used. In the end, which causal model is best suited for
a given application, boils down to personal preference and the application at hand. However, no
matter what model you use, you should always be able to understand and communicate the causal
assumption that it implies.

2 Potential Outcome Models

Assume we are given a set of observed variables and are interested in understanding how a subset
of these variables (called target variables) is causally affected by intervening on a second subset
of these variables (called intervention variables). The set of target variables should consist of
all variables for which we intend to model its distribution, this can in particular mean that a
variable is both a target and an intervention variable (e.g., in the instrumental variable setting).
A potential outcome model provides a mathematical language to describe these causal relations.
To construct it, we introduce for each of the observed variables either (i) a single new random

4
variable if it is not a target variable or (ii) a (potentially infinite) set of random variables – called
potential outcomes – indexed by the values the intervention variables (except itself) can attain if
it is a target variable. All of the new variables are assumed to live on the same probability space.
To make this more concrete, let (T1 , X1 , Y1 ), . . . , (Tn , Xn , Yn ) ∈ T × X × Y denote random
variables corresponding to observations from n different units (e.g., participants in a study). We
call Y = (Y1 , . . . , Yn ) responses, X = (X1 , . . . , Xn ) covariates and T = (T1 , . . . , Tn ) treatments.
We now are interested in understanding how the responses are affected by the treatments, hence
the responses are target variables and the treatments intervention variables. To construct the
potential outcome model, we therefore introduce the following new random variables

(T̄i , X̄i , (Ȳi (t))t∈T n )i∈[n] , with joint distribution Pfull . (1)

The variables (T̄i , X̄i ) should be thought of as copies of the observed treatment and covariates,
while the potential outcomes (Ȳi (t))t∈T n are the random values of the responses under the inter-
vention T = t. We call the set of probability distributions induced by (1) a potential outcome
model. This is indeed a causal model according to the abstract definition provided in Section 1,
Ȳ(t)
since the distribution Pfull induces for all t ∈ T n an interventional distribution Pfull over the
responses. Similar to a statistical model, we can restrict a potential outcome model by adding ad-
ditional assumptions on the possible distributions induced by (1). In most applications of potential
outcome models one adds the following three assumptions which we discuss in Section 2.1: As-
sumption 1 (consistency), Assumption 2 (no interference) and Assumption 3 (single unit). When
working with a potential outcome model it is important to be aware that the potential outcomes
for a single unit can never be observed at the same time. This counterfactual nature is relevant
both when constructing targets of inference as well as when making assumptions on the model
(see Remark 1).
Remark 1 (Counterfactuals). Potential outcomes are sometimes referred to as counterfactuals.
We avoid this terminology, because potential outcomes are not necessarily counterfactual quantities.
In fact one of the potential outcomes, the one for which the treatment is observed, is always fac-
tual. More specifically, assume the treatment T = t is observed, then the potential outcomes Yi (t)
are factual while for all t′ ∈ T with t′ ̸= t the potential outcomes Yi (t′ ) are counterfactual. When
working with potential outcomes one needs to be particularly careful at two stages of the analysis:
(1) When constructing a target of inference (i.e., a causal estimand) based on potential outcomes
and (2) when making assumptions based on potential outcomes. In both cases, it is easy to (acci-
dentally) end up with either a counterfactual quantity or a counterfactual assumption (sometimes
called cross-world assumption). If not explicitly of interest, one should avoid counterfactuals as
they are both philosophically delicate and potentially non-verifiable via experimentation.
The potential outcome model in (1) only models how the responses are affected by interventions
on the treatment. If we are interested in other causal questions, we need to specify different sets of
intervention and target variables. For example, if we are interested in treatment effects mediated
by the covariates X we would additionally need to consider X as a target and intervention variable,
leading to the potential outcomes Ti (x), Xi (t) and Yi (t, x).

2.1 Consistency, no interference and single unit assumptions

We start from the potential outcome model given in (1) and discuss the core assumptions made
in most (but not all) applications of the potential outcome model. The first assumption known as
consistency connects (1) to the observed data.
Assumption 1 (Consistency). For all i ∈ [n] it holds that

Ti = T̄i , Xi = X̄i and Yi = Ȳi (T̄).

Without consistency the potential outcome model is meaningless as it has no connection to

the observed data. Whether or not it holds needs to be argued using knowledge about the data

5
generating process. Since consistency directly connects the random variables in (1) to the data,
we follow the standard convention and drop the bar from the notation. It is nevertheless useful
to think of the observed random variables as being separate from the potential outcome model
random variables, even if this is not visible in the notation.
Unlike in structural causal models potential outcome models do not explicitly specify the causal
ordering. Nevertheless, depending on precise model, several assumptions on the causal order are
implicitly encoded. For example, in the model (1) the treatment is assumed to precede the response
implying that there is no feedback (or cycle) between the treatment and response. Moreover, if X
was also a target variable but we did not add potential outcomes for X, we would be implicitly
assuming that X is not causally affected by the treatment. Such mistakes can be avoided by
clearly specifying the target and intervention variables and adding additional assumptions on the
causal ordering only later.
Since we defined different potential outcomes for all values of the treatment of all units simul-
taneously, we can model settings in which the potential outcomes of a single unit can depend on
the treatments of the remaining units. This can for example occur in clinical trials on the efficacy
of vaccines where the potential outcome (getting a disease or not) of one individual might be
affected by whether individuals close to that individual received the treatment. Considering these
types of dependencies can be difficult, therefore if possible, one often assumes it does not occur.
Assumption 2 (No interference). For all i ∈ [n] and all t, t′ ∈ T n with ti = t′i it holds that

Yi (t) = Yi (t′ ).

Assumption 1 and Assumption 2 together are called the Stable Unit-Treatment Value Assump-
tion (SUTVA). As long as no-interference holds, one can avoid defining the potential outcomes for
all t ∈ T . Therefore, whenever we assume SUTVA we use the simplified potential outcome model

(Ti , Xi , (Yi (t))t∈T )i∈[n] with joint distribution Psutva . (2)

Finally, while in certain settings it is desirable to assume that different units are affected differently
the unit-level effects are often not identifiable without strong assumptions. It is therefore common
to assume that the unit-level data are i.i.d..
Assumption 3 (Single unit). Define for all i ∈ [n] the variables

Wi := (Ti , Xi , (Yi (t))t∈T n )

and assume that W1 , . . . , Wn are independent and identically distributed.

The single unit assumption makes most sense in combination with the no-interference assump-
tion, as it otherwise strongly restricts the type of allowed interference. It can be justified by
arguing that if the sequence W1 , . . . , Wn is exchangeable, then, in the limit as n tends to infinity,1
de Finitti’s representation theorem allows us to construct an identically distributed i.i.d. sequence.
Under the single unit assumption, the potential outcome model can be fully specified by the dis-
tribution of a single unit. Therefore, whenever we assume SUTVA and single unit we use the
simplified potential outcome model

(T, X, (Y (t))t∈T ) with joint distribution Psutva-iid . (3)

2.2 Average causal effects

We now assume we are given a potential outcome model

(T, X, (Y (t))t∈T ) with joint distribution P. (4)

1 The case of an infinite population is sometimes referred to as superpopulation in the potential outcome litera-

ture.

6
that satisfies Assumption 1, Assumption 2 and Assumption 3. The most common causal estimand
in the causal literature is the average causal effect. It is defined differently depending on whether
the treatment variable T is binary or continuous. For binary treatments (T = {0, 1}) it is given
by
ACE := E[Y (1)] − E[Y (0))],
while for continuous treatments it is defined by
d
ACE := E[ dt E[Y (t)]|t=T ],

where we assume sufficient smoothness for the derivative to exist. If a single number summarizing
the causal effect is insufficient, one can also consider the causal dose-response curve, given by

t 7→ E[Y (t)].

In the following section, we discuss the most common assumptions used in the potential outcome
model to ensure that these types of causal effects are identifiable (see Definition 1).

2.3 Identifiability conditions

To achieve identifiability of average causal effects in the potential outcome model (4), we can
consider hypothetical experiments in which the treatment is randomly assigned. In such a case, it
makes sense to assume that the treatment assignment is independent of the potential outcomes.
Assumption 4 (Ignorability). It holds for all t ∈ T that

Y (t) ⊥⊥ T.

Using ignorability we can directly express the average causal effect (in the binary case) as

ACE = E[Y (1)] − E[Y (0)]

= E[Y (T ) | T = 1] − E[Y (T ) | T = 0]
E[Y 1(T = 1)] E[Y 1(T = 0)]
= − ,
P(T = 1) P(T = 0)

where in the second equality we used ignorability and in the third equality we used consistency
and the definition of the conditional expectation. Here we additionally assumed that P(T = 0) > 0
and P(T = 1) > 0, which will be formalized below in Assumption 6. In settings in which we cannot
assume that the treatment is randomized it is no longer realistic to assume ignorability. Instead,
one therefore often considers a weaker notion, that is motivated by a randomized control trial in
which the treatment is assigned randomly based on a set of covariates X.
Assumption 5 (Conditional Ignorability). It holds for all t ∈ T that

Y (t) ⊥⊥ T | X.

Conditional ignorability alone is not sufficient to ensure identifiability of the causal effect, since
it does not ensure that all possible treatments t ∈ T are observed across all subgroups X = x. We
therefore make the following additional assumption.
Assumption 6 (Positivity). Assume (4) induces a density p. Then, it holds for all t ∈ T that

P(p(t | X) > 0) = 1.

Given that both conditional ignorability and positivity are true, we can express the average
causal effect in terms of the distribution of the observed variables (T, X, Y ). In the case of a binary

p(y, x | t)p(x)
Z Z
= y µ(dy)µ(dx)
p(x | t)
Z Z
p(t)
= y p(y, x | t)µ(dy)µ(dx)
p(t | x)

p(t)
=E Y T =t .
p(t | X)
This short computation provides two useful representations for identifying the interventional dis-
tribution E[Y (t)]. Firstly, the adjustment representation E[E[Y | X, T = t]], which if used directly
requires an estimate of the conditional expectation function (x, t) 7→ E[Y | X = x, T = t]. Sec-
p(t)
ondly, the propensity weighted representation E[Y p(t|X) | T = t], which if the propensity score
p(t | x) is known (or easy to estimate) only requires an estimate of the simpler conditional expec-
p(t)
tation function t 7→ E[Ỹ | T = t] for Ỹ = Y p(t|X) .
Remark 2 (Identifiability for continuous treatments). We skipped over a technical detail regarding
identifiability in the case of a continuous treatment variable T (i.e., dominated by the Lebesgue
measure). Even if there exists a joint density p for the observational distribution over (T, X, Y ),
the positivity assumption is insufficient to uniquely identify the density as it can be modified on
sets of measure zero. This further implies that the conditional expectation function
(t, x) 7→ E[Y | T = t, X = x]
is also not uniquely identified. To avoid this unidentifiability additional regularity conditions on
the density (e.g., assuming it is continuous) are required. We do not consider these issues further
and simply assume that sufficient regularity for identifiability of the density is given.

2.3.1 Adjusting with propensity scores

The following result is due to Rosenbaum and Rubin [1983].
Theorem 1 (Adjusting with propensity scores). Assume T ∈ {0, 1} and the potential outcome
model satisfies Assumptions 1, 2, 3, 5 and 6. Then, for all t ∈ {0, 1} it holds that
Y (t) ⊥⊥ T | π(X),
where π(X) := E[T | X] is the propensity score.

8
Proof. Fix t = 1 and an arbitrary bounded measurable function g : R → R. Then, it holds that

E[g(Y (1)) | π(X)]

Here, for the first and second equality we used the tower property and that we can drop π(X)
from the conditioning, respectively. For the third equality we used conditional ignorability (i.e.,
Y (1) ⊥⊥ T | X). For the remaining steps we used that T is binary and that in that case E[Y |
1(T =1)|X]
X, T = 1] = E[YP(T =1|X) .
Since this holds for all bounded measurable g, it follows by the definition of the conditional
expectation [see e.g. Dawid, 1979] that

Y (1) ⊥⊥ T | X.

The same argument can be performed for t = 0, which completes the proof of Theorem 1.

References
S. Bongers, P. Forré, J. Peters, and J. M. Mooij. Foundations of structural causal models with
cycles and latent variables. The Annals of Statistics, 49(5):2885–2915, 2021.

A. P. Dawid. Conditional Independence in Statistical Theory. Journal of the Royal Statistical

Society: Series B (Methodological), 41(1):1–15, 1979. doi: 10.1111/j.2517-6161.1979.tb01052.x.
P. Dawid. The decision-theoretic approach to causal inference. Causality: Statistical perspectives
and applications, pages 25–42, 2012.

P. Dawid. Decision-theoretic foundations for statistical causality. Journal of Causal Inference, 9

(1):39–77, 2021.
O. D. Duncan. Introduction to structural equation models. Elsevier, 2014.
G. W. Imbens and D. B. Rubin. Causal inference in statistics, social, and biomedical sciences.
Cambridge university press, 2015.
W. K. Newey, J. L. Powell, and F. Vella. Nonparametric estimation of triangular simultaneous
equations models. Econometrica, 67(3):565–603, 1999. doi: 10.1111/1468-0262.00037.
J. Pearl. Causality. Cambridge University Press, 2009.

J. Peters, D. Janzing, and B. Schölkopf. Elements of Causal Inference: Foundations and Learning
Algorithms. MIT Press, Cambridge, MA, 2017.

9
T. S. Richardson and J. M. Robins. Single world intervention graphs (swigs): A unification of the
counterfactual and graphical approaches to causality. Center for the Statistics and the Social
Sciences, University of Washington Series. Working Paper, 128(30):2013, 2013.
P. R. Rosenbaum and D. B. Rubin. The central role of the propensity score in observational
studies for causal effects. Biometrika, 70(1):41–55, 1983.

D. B. Rubin. Causal inference using potential outcomes: Design, modeling, decisions. Journal of
the American Statistical Association, 100(469):322–331, 2005.

Lecture 21
No ratings yet
Lecture 21
8 pages
Causal Notes
No ratings yet
Causal Notes
17 pages
21.1 Causality
No ratings yet
21.1 Causality
56 pages
m-api-2a21566f-2983-da6f-46ef-125ec30fbc54
No ratings yet
m-api-2a21566f-2983-da6f-46ef-125ec30fbc54
17 pages
Greenland & Robins 2009
No ratings yet
Greenland & Robins 2009
9 pages
A Survey of Causal Inference Framework
No ratings yet
A Survey of Causal Inference Framework
19 pages
intro-stat
No ratings yet
intro-stat
17 pages
Kenneth Rothman - Timothy L. Lash - Modern Epidemiology-LWW (2020) - 96-142
No ratings yet
Kenneth Rothman - Timothy L. Lash - Modern Epidemiology-LWW (2020) - 96-142
47 pages
Causal Inference: 1.1 Two Types of Causal Questions
No ratings yet
Causal Inference: 1.1 Two Types of Causal Questions
19 pages
Pearl 10 A
No ratings yet
Pearl 10 A
20 pages
AAAI-2023 教程用于因果推断的机器学习
No ratings yet
AAAI-2023 教程用于因果推断的机器学习
145 pages
Perraillon MC, Causal Inference
No ratings yet
Perraillon MC, Causal Inference
22 pages
The International Journal of Biostatistics: An Introduction To Causal Inference
No ratings yet
The International Journal of Biostatistics: An Introduction To Causal Inference
62 pages
Causal Inference in Statistics: An Overview
100% (1)
Causal Inference in Statistics: An Overview
51 pages
annurev-statistics-033121-114601
No ratings yet
annurev-statistics-033121-114601
30 pages
Causal Inference in The Social Sciences
No ratings yet
Causal Inference in The Social Sciences
30 pages
r354 Reprint Corrected
No ratings yet
r354 Reprint Corrected
61 pages
An Introduction To Causal Inference
No ratings yet
An Introduction To Causal Inference
67 pages
Causal Diagrams in Systems Epidemiology: Analyticperspective Open Access
No ratings yet
Causal Diagrams in Systems Epidemiology: Analyticperspective Open Access
18 pages
Causal Inference: 1.1 Two Types of Causal Questions
No ratings yet
Causal Inference: 1.1 Two Types of Causal Questions
8 pages
Causal Models and Learning From Data
No ratings yet
Causal Models and Learning From Data
9 pages
Imperial Causality
No ratings yet
Imperial Causality
124 pages
Causal Inference for Statistics Social and Biomedical Sciences An Introduction 1st Edition Guido W. Imbens all chapter instant download
100% (1)
Causal Inference for Statistics Social and Biomedical Sciences An Introduction 1st Edition Guido W. Imbens all chapter instant download
27 pages
WieczorekRoth2019entropy
No ratings yet
WieczorekRoth2019entropy
26 pages
Causality
No ratings yet
Causality
22 pages
li-et-al-2023-bayesian-causal-inference-a-critical-review
No ratings yet
li-et-al-2023-bayesian-causal-inference-a-critical-review
24 pages
Thinking
No ratings yet
Thinking
16 pages
Causal Inference
No ratings yet
Causal Inference
2 pages
Causal Inference
No ratings yet
Causal Inference
11 pages
Causal Inference in Statistics: An Overview
100% (2)
Causal Inference in Statistics: An Overview
51 pages
QR 33
No ratings yet
QR 33
140 pages
causal-inference-intro
No ratings yet
causal-inference-intro
16 pages
01 Foundations
No ratings yet
01 Foundations
102 pages
Epidemiology-2
No ratings yet
Epidemiology-2
128 pages
Introduction Causal Inference
No ratings yet
Introduction Causal Inference
2 pages
With Great Data Comes Great Responsibility .3
No ratings yet
With Great Data Comes Great Responsibility .3
2 pages
Bulbulia Et Al 2021
No ratings yet
Bulbulia Et Al 2021
9 pages
A Scoping Review of Causal Methods Enabling Predictions Under Hypothetical Interventions
No ratings yet
A Scoping Review of Causal Methods Enabling Predictions Under Hypothetical Interventions
16 pages
Greenland Pearl and Robins 1999
No ratings yet
Greenland Pearl and Robins 1999
13 pages
Casual Tutorial Slides
No ratings yet
Casual Tutorial Slides
254 pages
Invited Commentary -- Machine Learning in Causal Inference—How Do I Love
No ratings yet
Invited Commentary -- Machine Learning in Causal Inference—How Do I Love
5 pages
A Note On "Causality: Models, Reasoning, and Inference" by Judea Pearl
No ratings yet
A Note On "Causality: Models, Reasoning, and Inference" by Judea Pearl
4 pages
1-Introduction To Applied Econometrics
No ratings yet
1-Introduction To Applied Econometrics
33 pages
Generalization Bounds and Representation Learning For Estimation of Potential Outcomes and Causal Effects
No ratings yet
Generalization Bounds and Representation Learning For Estimation of Potential Outcomes and Causal Effects
50 pages
Tolbert Required Racial_Agnosticism (4)
No ratings yet
Tolbert Required Racial_Agnosticism (4)
18 pages
The Statistics of Causal Inference: A View From Political Methodology
No ratings yet
The Statistics of Causal Inference: A View From Political Methodology
23 pages
3702315
No ratings yet
3702315
11 pages
Robins and Greenland 1986
No ratings yet
Robins and Greenland 1986
11 pages
Lectures On Causal Inference
No ratings yet
Lectures On Causal Inference
225 pages
Spe2024 CIlect KF
No ratings yet
Spe2024 CIlect KF
35 pages
Unit V Full
No ratings yet
Unit V Full
23 pages
Causal Inference, Michael E. Sobel
No ratings yet
Causal Inference, Michael E. Sobel
3 pages
Econometrics Review #1
No ratings yet
Econometrics Review #1
35 pages
Statistical Causal Inferences and Their Applications in Public Health Research-Springer International Publishing (2016)
100% (1)
Statistical Causal Inferences and Their Applications in Public Health Research-Springer International Publishing (2016)
324 pages
Causal Inference in Python
No ratings yet
Causal Inference in Python
10 pages
Causal inference lesson one pdf
No ratings yet
Causal inference lesson one pdf
16 pages
Causality
No ratings yet
Causality
89 pages
The Matrixial Brain: Experiments in Reality
From Everand
The Matrixial Brain: Experiments in Reality
Paul Chaplin
No ratings yet
Artificial Intelligence Diagnosis: Fundamentals and Applications
From Everand
Artificial Intelligence Diagnosis: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bayesian Methodology: an Overview With The Help Of R Software
From Everand
Bayesian Methodology: an Overview With The Help Of R Software
Editor IJSMI
No ratings yet
Senior Research Presentation - Lauren Hobbs
No ratings yet
Senior Research Presentation - Lauren Hobbs
18 pages
Final Agenda Media Professionals Training 11092024
No ratings yet
Final Agenda Media Professionals Training 11092024
4 pages
Discourse As Interaction in Society (Detail Teach)
No ratings yet
Discourse As Interaction in Society (Detail Teach)
50 pages
BBA 1ST SEM Bachelor of Business Administration - Tourism and Travel
No ratings yet
BBA 1ST SEM Bachelor of Business Administration - Tourism and Travel
11 pages
The Importance of Audiovisual Materials and Educational Technolog1
No ratings yet
The Importance of Audiovisual Materials and Educational Technolog1
7 pages
HISTORY P1 GR12 MEMO JUNE - English
No ratings yet
HISTORY P1 GR12 MEMO JUNE - English
24 pages
Michelle Renee Perez Aquino - CREATE SHEET 1 HES 036 LAB
No ratings yet
Michelle Renee Perez Aquino - CREATE SHEET 1 HES 036 LAB
13 pages
Appeal Against Acquittal With All Grounds
No ratings yet
Appeal Against Acquittal With All Grounds
10 pages
03 Ong V Roban Lending Corp. Adorna
No ratings yet
03 Ong V Roban Lending Corp. Adorna
2 pages
International Journal of Existential Psychology & Psychotherapy Volume 3, Number 1 January, 2010
No ratings yet
International Journal of Existential Psychology & Psychotherapy Volume 3, Number 1 January, 2010
18 pages
Chapter 6 Guide For Notes
No ratings yet
Chapter 6 Guide For Notes
4 pages
Card Game Exploration
No ratings yet
Card Game Exploration
4 pages
Good Role Model
No ratings yet
Good Role Model
3 pages
Major Event and Contribution of Social Science Disciplines
50% (2)
Major Event and Contribution of Social Science Disciplines
12 pages
Respiratory System Concept Map
No ratings yet
Respiratory System Concept Map
1 page
Gs Third Conditional - Exercises
100% (1)
Gs Third Conditional - Exercises
2 pages
pk2 Form 2
No ratings yet
pk2 Form 2
9 pages
Pronunciation LP - Word Final Consonant Clusters (Sponseller) in Depth Exercise
100% (6)
Pronunciation LP - Word Final Consonant Clusters (Sponseller) in Depth Exercise
16 pages
Guide To Your Astrology Birth Chart
No ratings yet
Guide To Your Astrology Birth Chart
39 pages
Life and Times of King Henry VII and VIII
100% (1)
Life and Times of King Henry VII and VIII
21 pages
1604372239IELTS Listening Practice 11-IELTS Material
No ratings yet
1604372239IELTS Listening Practice 11-IELTS Material
9 pages
Thermodynamics Notes
No ratings yet
Thermodynamics Notes
2 pages
ttj3c Transportation Technology
No ratings yet
ttj3c Transportation Technology
5 pages
Thesis Intro
100% (1)
Thesis Intro
5 pages
Assignment - 4: Topic: Critically Analyze Marketing Mix of Pampers
No ratings yet
Assignment - 4: Topic: Critically Analyze Marketing Mix of Pampers
7 pages
Mitra Adiperkasa: Indonesia Company Guide
No ratings yet
Mitra Adiperkasa: Indonesia Company Guide
14 pages
Learning The Rudiments of Law Exam Questions Maneuver 101
No ratings yet
Learning The Rudiments of Law Exam Questions Maneuver 101
9 pages
Chronic disease related nutrition
No ratings yet
Chronic disease related nutrition
9 pages
Bretan Material
50% (2)
Bretan Material
20 pages
JOINT AFFIDAVIT - Mylyn (Surname and Surname of Father)
No ratings yet
JOINT AFFIDAVIT - Mylyn (Surname and Surname of Father)
2 pages

lecture_notes

Uploaded by

lecture_notes

Uploaded by

Causality

Just as in an (observational) statistical model an estimand is not necessarily identifiable even

Φ(P |Iobs ) = Ψ(P ).

2 Potential Outcome Models

2.1 Consistency, no interference and single unit assumptions

Ti = T̄i , Xi = X̄i and Yi = Ȳi (T̄).

Without consistency the potential outcome model is meaningless as it has no connection to

(Ti , Xi , (Yi (t))t∈T )i∈[n] with joint distribution Psutva . (2)

Wi := (Ti , Xi , (Yi (t))t∈T n )

and assume that W1 , . . . , Wn are independent and identically distributed.

(T, X, (Y (t))t∈T ) with joint distribution Psutva-iid . (3)

2.2 Average causal effects

(T, X, (Y (t))t∈T ) with joint distribution P. (4)

2.3 Identifiability conditions

ACE = E[Y (1)] − E[Y (0)]

2.3.1 Adjusting with propensity scores

E[g(Y (1)) | π(X)]

A. P. Dawid. Conditional Independence in Statistical Theory. Journal of the Royal Statistical

P. Dawid. Decision-theoretic foundations for statistical causality. Journal of Causal Inference, 9

You might also like