Observational Causal Inference

Estimate cause and effect when you can't run an experiment • 40 papers

7 subtopics

Matching & Propensity Scores

Compare similar treated and untreated groups fairly

1983 29661 cited

The Central Role of the Propensity Score in Observational Studies

Paul Rosenbaum, Donald Rubin

The foundational paper introducing propensity scores for causal inference in observational studies.

2019 4192 cited

Matching as Nonparametric Preprocessing for Reducing Model Dependence

Gary King, Richard Nielsen

Best practices for matching methods and the MatchIt software implementation.

2018 1894 cited

Double/Debiased Machine Learning for Treatment and Structural Parameters

Victor Chernozhukov, Denis Chetverikov, Mert Demirer, et al.

The double ML framework using cross-fitting to obtain valid inference with ML first-stage estimation.

2012 4786 cited

Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies

Jens Hainmueller

Introduced entropy balancing, which achieves exact covariate balance through maximum entropy reweighting—eliminating iterative propensity score model searching.

2005 1823 cited

Doubly Robust Estimation in Missing Data and Causal Inference Models

Heejung Bang, James M. Robins

Accessible exposition of the augmented IPW estimator, consistent if either propensity score or outcome model is correct—the foundational 'doubly robust' property.

1995 856 cited

Semiparametric Efficiency in Multivariate Regression Models with Missing Data

James M. Robins, Andrea Rotnitzky

Foundational theoretical paper deriving semiparametric efficiency bounds and introducing the AIPW estimator class underlying all modern doubly robust methods.

Difference-in-Differences

Measure impact of changes that roll out over time

2023

What's Trending in Difference-in-Differences? A Synthesis of the Recent Econometrics Literature

Jonathan Roth, Pedro Sant'Anna, Alyssa Bilinski, John Poe

Comprehensive review of modern DiD methods including staggered adoption and heterogeneous effects.

2021 5879 cited

Difference-in-Differences with Variation in Treatment Timing

Andrew Goodman-Bacon

Decomposes two-way fixed effects estimators and reveals issues with staggered DiD designs.

2021 1413 cited

Difference-in-Differences with Multiple Time Periods

Brantly Callaway, Pedro Sant'Anna

Group-time average treatment effects and aggregation methods for staggered DiD.

2020 3759 cited

Two-Way Fixed Effects Estimators with Heterogeneous Treatment Effects

Clément de Chaisemartin, Xavier D'Haultfœuille

Demonstrates TWFE regressions estimate weighted sums of ATEs with potentially negative weights—proposing the DIDM estimator as solution.

2021 3857 cited

Estimating Dynamic Treatment Effects in Event Studies with Heterogeneous Treatment Effects

Liyang Sun, Sarah Abraham

Shows TWFE event-study coefficients are contaminated by effects from other periods, proposing an interaction-weighted estimator.

2023 841 cited

A More Credible Approach to Parallel Trends

Ashesh Rambachan, Jonathan Roth

Formal sensitivity analysis for parallel trends violations—implemented in the widely-used HonestDiD package.

Synthetic Control

Create a comparison group when you only have one treated unit

2010 5029 cited

Synthetic Control Methods for Comparative Case Studies

Alberto Abadie, Alexis Diamond, Jens Hainmueller

The foundational synthetic control paper with the California tobacco application.

2021 1186 cited

Using Synthetic Controls: Feasibility, Data Requirements, and Methodological Aspects

Alberto Abadie

Practical guidance on when and how to apply synthetic control methods.

2021 12 cited

Synthetic Difference in Differences

Dmitry Arkhangelsky, Susan Athey, David Hirshberg, et al.

Combines synthetic control and DiD for improved inference in panel data settings.

2021 115 cited

Matrix Completion Methods for Causal Panel Data Models

Susan Athey, Mohsen Bayati, Nikolay Doudchenko, Guido Imbens, Khashayar Khosravi

Bridges matrix completion/ML with synthetic control using nuclear norm regularization—handles staggered adoption and outperforms traditional SC.

2021 66 cited

The Augmented Synthetic Control Method

Eli Ben-Michael, Avi Feller, Jesse Rothstein

Extends synthetic control to settings where perfect pre-treatment fit is infeasible using ridge regression to de-bias estimates.

2018 182 cited

Synthetic Control Method: Inference, Sensitivity Analysis and Confidence Sets

Sergio Firpo, Vitor Possebom

Essential theoretical foundation for statistical inference in SC applications, extending permutation tests and constructing proper confidence sets.

Instrumental Variables & LATE

Find causal effects using natural experiments

1994 3981 cited

Identification and Estimation of Local Average Treatment Effects

Guido Imbens, Joshua Angrist

The LATE framework for interpreting IV estimates as effects on compliers.

1996 4037 cited

Identification of Causal Effects Using Instrumental Variables

Joshua Angrist, Guido Imbens, Donald Rubin

Defines the assumptions needed for IV and connects to potential outcomes framework.

2005 132 cited

Testing for Weak Instruments in Linear IV Regression

James H. Stock, Motohiro Yogo

Foundational framework for detecting weak instruments—the origin of the 'first-stage F > 10' rule now required in all IV applications.

2022 135 cited

Quasi-Experimental Shift-Share Research Designs

Kirill Borusyak, Peter Hull, Xavier Jaravel

Modern econometric framework for Bartik instruments—identification follows from quasi-random shock assignment rather than exogenous shares.

2023 85 cited

Judging Judge Fixed Effects

Brigham R. Frandsen, Lars Lefgren, Emily Leslie

Develops nonparametric tests for exclusion and monotonicity in examiner IV designs—essential for criminal justice, disability, and immigration research.

Regression Discontinuity

Exploit eligibility cutoffs to measure program effects

2010 65 cited

Regression Discontinuity Designs in Economics

David Lee, Thomas Lemieux

Comprehensive guide to RDD identification, estimation, and practical implementation.

2019 474 cited

A Practical Introduction to Regression Discontinuity Designs

Matias Cattaneo, Nicolas Idrobo, Rocio Titiunik

Modern implementation guide with rdrobust software for sharp and fuzzy RDD.

2008 995 cited

Manipulation of the Running Variable in the Regression Discontinuity Design: A Density Test

Justin McCrary

Introduces the canonical density discontinuity test for detecting manipulation at the cutoff—now a required falsification check in all RDD work.

2012 2124 cited

Optimal Bandwidth Choice for the Regression Discontinuity Estimator

Guido Imbens, Karthik Kalyanaraman

Derives the MSE-optimal bandwidth for local linear RD estimation—the first principled approach to bandwidth selection.

2014 2799 cited

Robust Nonparametric Confidence Intervals for Regression-Discontinuity Designs

Sebastian Calonico, Matias D. Cattaneo, Rocío Titiunik

Shows MSE-optimal bandwidths yield invalid conventional CIs and develops bias-corrected robust inference—foundation for the rdrobust package.

Double ML & Heterogeneous Effects

Find which customers benefit most from an intervention

2018 2467 cited

Estimation and Inference of Heterogeneous Treatment Effects using Random Forests

Stefan Wager, Susan Athey

Causal forests for estimating conditional average treatment effects with valid confidence intervals.

2020

Generic Machine Learning Inference on Heterogeneous Treatment Effects in Randomized Experiments

Victor Chernozhukov, Mert Demirer, Esther Duflo, Ivan Fernández-Val

Framework for using any ML method to find heterogeneous effects with valid inference.

2019 858 cited

Metalearners for Estimating Heterogeneous Treatment Effects using Machine Learning

Sören Künzel, Jasjeet Sekhon, Peter Bickel, Bin Yu

Taxonomy of metalearners (S-learner, T-learner, X-learner) for CATE estimation.

2021 93 cited

Quasi-Oracle Estimation of Heterogeneous Treatment Effects

Xinkun Nie, Stefan Wager

Introduces the R-learner framework achieving quasi-oracle efficiency—matching error bounds of an oracle knowing nuisance components.

2023 110 cited

Towards Optimal Doubly Robust Estimation of Heterogeneous Causal Effects

Edward H. Kennedy

Establishes model-free oracle inequalities for the DR-learner—doubly robust CATE estimation achieves faster convergence rates.

2021 3 cited

Policy Learning With Observational Data

Susan Athey, Stefan Wager

Theoretical foundations for learning optimal treatment policies from observational data using doubly robust scores.

Sensitivity & Bounds

Stress-test your causal conclusions

2017 4886 cited

Sensitivity Analysis in Observational Research: Introducing the E-Value

Tyler VanderWeele, Peng Ding

The E-value for quantifying sensitivity to unmeasured confounding.

2020 819 cited

Making Sense of Sensitivity: Extending Omitted Variable Bias

Carlos Cinelli, Chad Hazlett

Modern sensitivity analysis framework with intuitive benchmarking against observed covariates.

1990 567 cited

Nonparametric Bounds on Treatment Effects

Charles F. Manski

Foundational paper establishing the partial identification paradigm—showing what can be learned under minimal assumptions when point identification fails.

2009 1321 cited

Training, Wages, and Sample Selection: Estimating Sharp Bounds on Treatment Effects

David S. Lee

Developed the 'Lee bounds' trimming procedure for handling attrition under monotonicity—now a standard robustness check for differential selection.

2019 4287 cited

Unobservable Selection and Coefficient Stability: Theory and Evidence

Emily Oster

Shows how to jointly use coefficient movements and R-squared changes to bound omitted variable bias—the workhorse sensitivity analysis.

2005 3560 cited

Selection on Observed and Unobserved Variables: Assessing the Effectiveness of Catholic Schools

Joseph G. Altonji, Todd E. Elder, Christopher R. Taber

Pioneered the insight that selection on observables guides selection on unobservables—framework for assessing confounding needed to explain away effects.

Must-read papers for tech economists and applied researchers