๐Ÿ”’

Literature Readings

Reading notes โ€” please enter access password

โœ— Incorrect password

Literature Readings

DiD ยท Paper Detail

โ†All Readings โ†DiD Methodology

๐Ÿ“„ This paper

โšกTL;DR ๐ŸงฉSetup ๐Ÿ“Main results ๐Ÿ› ๏ธFor practice ๐ŸงญIn the lit ๐Ÿ“ฅPDF

Literature Readings ยท DiD ยท Paper Detail

Efficient Difference-in-Differences and Event-Study Estimators

Xiaohong Chen (Yale) ยท Pedro H. C. Sant'Anna (Emory) ยท Haitian Xie (Peking U.)

Cowles WP 2470 (2025)MethodologyEfficiency Theory

๐Ÿ“ฅ Read it

Local PDF (981 KB)arXiv 2506.17729Cowles

โšก TL;DR

Derives the semiparametric efficient influence function (EIF) in closed form for DiD and event-study causal parameters under heterogeneous treatment effects and standard parallel-trends assumptions. The resulting estimator achieves the smallest variance among all asymptotically normal, regular estimators. Establishes the efficiency benchmark for the post-2020 DiD landscape.

๐Ÿงฉ Setup & motivation

The post-2020 DiD methods (CS, SA, BJS, dC-dH, Wooldridge ETWFE) all target the same causal parameters under heterogeneity, but they differ in efficiency. Which is the most efficient? Until this paper, there was no definitive answer.

Chen-Sant'Anna-Xie compute the semiparametric efficiency bound for short-panel DiD and ES estimands. The bound is a function of the unit-level heterogeneity, time-varying covariates, and assignment mechanism. Their EIF-based estimator achieves the bound; existing estimators do not (some come close).

๐Ÿ“ Main results

The efficient influence function

Under standard parallel-trends assumptions and a short-panel structure, the EIF for the ATT at horizon \(k\) takes the form \(\psi_k(O; \eta) = \frac{D_i}{p} [(Y_{i,t+k} - Y_{i,t-1}) - m_k(X_i, \eta)] - \frac{(1-D_i) e(X_i)}{p(1-e(X_i))} [(Y_{i,t+k} - Y_{i,t-1}) - m_k(X_i, \eta)]\) where \(m_k\) is the conditional expectation of the outcome difference and \(e(X_i)\) the propensity score.

Neyman orthogonality

The EIF is automatically Neyman orthogonal: the first-order bias of \(\psi_k\) with respect to nuisance-parameter estimation is zero. This means ML estimators of the nuisances (\(m_k\) and \(e\)) can be plugged in without contaminating root-\(n\) inference โ€” the same logic as in Chernozhukov-Chetverikov-Demirer-Duflo-Hansen-Newey-Robins (2018) double machine learning.

The proposed estimator

Simple plug-in: estimate \(m_k\) and \(e\) with any ML method via cross-fitting, then compute the sample average of \(\hat\psi_k\). Achieves the semiparametric efficiency bound asymptotically.

Efficiency comparison

The paper compares their EIF estimator to BJS, CS with covariates, SA with covariates, and Wooldridge ETWFE. The EIF achieves the bound; BJS is efficient under linearity but loses ground when nonlinear nuisances matter; CS with parametric covariate models is inefficient when the propensity score is far from \(1/2\).

๐Ÿ› ๏ธ Implications for practice

  • For maximum statistical efficiency in a short-panel DiD with covariates, use the EIF estimator with cross-fitted ML for nuisances.
  • For larger panels, BJS is close to efficient and computationally simpler.
  • Software implementation forthcoming.

๐Ÿงญ Where this sits in the broader DiD literature

Companion to Borusyak-Hull-Roth (2025) "Harvesting" โ€” both chase efficiency from different angles. Builds on the double/debiased ML literature (Chernozhukov et al. 2018), Sant'Anna-Zhao (2020) doubly-robust DiD, and the semiparametric-efficiency tradition (Newey 1990, Van der Vaart 2000). Likely cited in BCCGS 2026 JEL as the modern efficiency benchmark.

๐Ÿ“ฅ Read the paper

  • Local PDF (981 KB) โ€” instant, no external request
  • arXiv 2506.17729
  • Cowles