πŸ”’

Literature Readings

Reading notes β€” please enter access password

βœ— Incorrect password

Literature Readings

DiD Β· Paper Detail

←All Readings ←DiD Methodology

πŸ“„ This paper

⚑TL;DR 🧩Setup πŸ“Main results πŸ› οΈFor practice 🧭In the lit πŸ“₯PDF

Literature Readings Β· DiD Β· Paper Detail

Harvesting Differences-in-Differences and Event-Study Estimators

Kirill Borusyak Β· Peter Hull Β· Jonathan Roth

NBER WP 34550MethodologyEstimator Combination

πŸ“₯ Read it

Local PDF (939 KB)NBER

⚑ TL;DR

The post-2020 DiD revolution produced several heterogeneity-robust estimators β€” Sun-Abraham, Callaway-Sant'Anna, Borusyak-Jaravel-Spiess, de Chaisemartin-D'HaultfΕ“uille, Wooldridge ETWFE β€” each with different efficiency and identification properties. This paper shows how to combine the strengths of multiple estimators into a single "harvested" estimator with improved efficiency relative to any individual estimator.

🧩 Setup & motivation

Each of the modern DiD estimators uses a slightly different identification strategy: BJS imputes counterfactuals from pre-period unit fits; CS uses group-time ATT building blocks with flexible control-group choice; SA reweights TWFE coefficients to remove cohort contamination. They all converge in large samples under parallel trends, but each is more efficient in particular settings.

Borusyak-Hull-Roth ask: can we combine them? The answer is yes β€” under linear-combination conditions, the optimal weighted combination of multiple estimators achieves lower asymptotic variance than any single estimator. The paper provides theory and practical implementation.

πŸ“ Main results

The harvesting framework

Let \(\hat\tau_1, \hat\tau_2, \dots, \hat\tau_K\) be \(K\) alternative DiD estimators of the same target parameter \(\tau\), with covariance matrix \(\Omega\) estimated jointly. The optimal linear combination is \(\hat\tau_{\text{combine}} = \sum_k w_k \hat\tau_k\) where the weights solve the minimum-variance problem. Closed-form: \(w^* = (\mathbf{1}'\Omega^{-1}\mathbf{1})^{-1} \Omega^{-1} \mathbf{1}\).

When does harvesting help?

Harvesting helps when individual estimators have imperfectly correlated sampling errors β€” i.e., they use different pieces of information from the same data. For well-known DiD estimators, the correlations are typically 0.7–0.9 but not 1.0, so combinations achieve 5–20% variance reduction. Bigger gains when the estimators use very different identifying variation (e.g., never-treated vs not-yet-treated controls).

Inference

Joint covariance is estimated via bootstrap or asymptotic theory. The paper provides a practitioner-friendly procedure that practitioners can run on top of existing estimator implementations.

πŸ› οΈ Implications for practice

  • For headline efficiency, report the harvested estimate alongside the individual estimates.
  • If the harvested estimate diverges substantially from individual estimates, that's a diagnostic for non-shared identifying variation.
  • R implementation forthcoming; the paper provides pseudo-code.

🧭 Where this sits in the broader DiD literature

Builds directly on Borusyak-Jaravel-Spiess (2024, REStud), Callaway-Sant'Anna (2021, J Econometrics), and Sun-Abraham (2021, J Econometrics). Extends the BJS efficiency-bound result to combinations of estimators. Companion to Chen-Sant'Anna-Xie (2025) on the semiparametric efficient influence function β€” both papers chase the same efficiency frontier from different directions.

πŸ“₯ Read the paper

  • Local PDF (939 KB) β€” instant, no external request
  • NBER