5  Observational Trials

Observational studies are empirical investigations. Unlike randomized controlled trials (RCTs), observational studies refrain from direct intervention or manipulation of variables. Instead, observational studies focus on real-world dynamics, often in fields such as epidemiology, social sciences and economics.

One of the main differences with randomized controlled trials is the challenge of managing potential confounding variables due to the inherent susceptibility of unmeasured factors to alter observed associations.

Despite this constraint, the usefulness of observational studies is undeniable. This approach reflects the diversity and variability of the real world, which may not be fully reproduced in the controlled environment of randomized controlled trials.

In the context of observational trials, it is totally unrealistic to assume the assumption of ignorability like in RCTs. This is why calculating the ATE is not as straightforward as in randomized clinical trials. In addition, it is necessary to formulate other hypotheses to decompose the ATE. Thus, in order to compute \(\mathbb{E}[Y^{(1)}]\) (resp \(\mathbb{E}[Y^{(0)}]\)) in the case of observational studies we need to have the potential outcome for all the individuals given that they took (resp or not) the treatment.

To solve this issue we need to make assumptions under which we can estimate the two expectancies. Let \(X_i\in \mathbb{R}^p\) be the vector of confounding variables of the individual \(i \in \mathcal{I}\). We can then define the random variable \(X\) as:

\[\begin{align*} X \colon \mathcal{I} & \longrightarrow \mathbb{R}^p \\ i & \longmapsto X_i \end{align*}\]

Assumption 5.1: Conditional Exchangeability / Unconfoundedness
\[ T \perp\mkern-9.5mu\perp(Y(0), Y(1)) | X \]

Unconfoundedness, also known as the “Conditional Exchangeability” assumption, is the assumption that conditioned on confounding variables there are no unobserved variables that simultaneously influence the treatment assignment and the outcome. Hence, the treatment assignment is independent of the potential outcomes conditioned on pre-treatment variables.

We will now define the propensity score which will be a very useful for the assumptions and estimators that we will be working with.

Definition 5.1: Propensity score
The propensity score is defined as the conditional probability of treatment given background variables: \[ e(x) = \mathbb{P}[{T=1|X=x}]\]

We can now construct our first theorem with all the previous definitions and assumptions.

Theorem 5.1: Propensity Score Theorem
Given positivity, unconfoundedness given \(X\) implies unconfoundedness given the propensity score \(e(X)\): \[ T \perp\mkern-9.5mu\perp(Y(0), Y(1)) | X \Longrightarrow T \perp\mkern-9.5mu\perp(Y(0), Y(1)) | e(X) \]

Proof
To prove this theorem, we will show that for \(t\in \{0,1\},\) \(\mathbb{P}[{T|Y^{(t)},e(X)}] = \mathbb{P}[{T|e(X})]\) which is equivalent to \(T \perp\mkern-9.5mu\perp(Y(0), Y(1)) | e(X)\). Now we fix \(t \in \{0,1\}\). Since \(T\) is binary, we transform can this probability into an expectation: \[\begin{align*} \mathbb{P}[{T|Y^{(t)},e(X)}] &= \mathbb{E}[{T|Y^{(t)},e(X)}]\\ &= \mathbb{E}[\mathbb{E}[{T|Y^{(t)},e(X), X}]|Y^{(t)},e(X)]\\ &= \mathbb{E}[\mathbb{E}[{T|Y^{(t)}, X}]|Y^{(t)},e(X)]\\ &= \mathbb{E}[\mathbb{E}[{T|X}]|Y^{(t)},e(X)] && \text{Unconfoundness}\\ &= \mathbb{E}[\mathbb{P}[{T|X}]|Y^{(t)},e(X)]\\ &= \mathbb{E}[e(X)|Y^{(t)},e(X)]\\ &= e(X) \end{align*}\] We also have: \[\begin{align*} \mathbb{E}[T|e(X)] &= \mathbb{E}[\mathbb{E}[T|X=x, e(X)]|e(X)]\\ &= \mathbb{E}[\mathbb{E}[T|X=x]|e(X)]\\ &= \mathbb{E}[e(X)]|e(X)]\\ &= e(X) \end{align*}\] Therefore, we have that \(T \perp\mkern-9.5mu\perp(Y(0), Y(1)) | e(X)\).

This theorem was introduced by Rosenbaum and Rubin. It basically states that it is not necessary to condition on the whole random variable \(X\) but only on \(e(X)\) since the propensity score completely describes \(\mathbb{P}[{T|X}]\).

Another assumption is required on this propensity score, to ensure that the processus in treatment assignment is not completely deterministic, that is for any observation the probability to receive or not the treatment is different from 0 or 1.

Assumption 5.2: Positivity / Overlap
\[ \exists \eta \in ]0;1[ ,\forall x \in X(\Omega), \quad \eta \le e(x) \le 1-\eta \]

For all levels of X, all individuals have a positive probability of receiving both the treatment and the control. Put differently, it means that there are no subpopulations with zero or near-zero probabilities of receiving a specific treatment, which is also referred to as overlapping. This contrary to Unconfoundedness is a testable assumption.