Search and Filter

Comparing Machine Learning Methods for Estimating Heterogeneous Treatment Effects in Randomized Trials: A Comprehensive Simulation Study

This study compares 18 machine learning methods for estimating heterogeneous treatment effects in randomized controlled trials, using simulations calibrated to two large-scale educational experiments. We evaluate performance across continuous and binary outcomes with diverse and realistic treatment effect heterogeneity patterns, varying sample sizes, covariate complexities, and effect magnitudes. Bayesian Additive Regression Trees with S-learner (BART S) outperforms alternatives on average. While no method predicts individual effects with high accuracy, some show promise in identifying who benefits most or least. An empirical application illustrates how ML methods can reveal heterogeneity patterns beyond conventional subgroup analysis. These findings highlight both the potential and the limitations of ML, offering evidence-based practical guidance for analyzing treatment effect variation in experimental evaluations.

Keywords
Machine Learning; Heterogeneous Treatment Effect; Randomized Controlled Trials; Simulation Study; Educational Research
Education level
Topics
Document Object Identifier (DOI)
10.26300/qdkn-z470
EdWorkingPaper suggested citation:
Miratrix, Luke, Polina Polskaia, Richard Dorsett, Pei Zhu, Nicholas Commins, and J. David Selby. (). Comparing Machine Learning Methods for Estimating Heterogeneous Treatment Effects in Randomized Trials: A Comprehensive Simulation Study. (EdWorkingPaper: -1276). Retrieved from Annenberg Institute at Brown University: https://doi.org/10.26300/qdkn-z470

Machine-readable bibliographic record: RIS, BibTeX