Machine learning in the estimation of causal effects: targeted minimum loss-based estimation and double/debiased machine learning

Summary In recent decades, the fields of statistical and machine learning have seen a revolution in the development of data-adaptive regression methods that have optimal performance under flexible, sometimes minimal, assumptions on the true regression functions. These developments have impacted all...

Full description

Bibliographic Details
Published in:Biostatistics
Main Author: Díaz, Iván
Format: Article in Journal/Newspaper
Language:English
Published: Oxford University Press (OUP) 2019
Subjects:
DML
Online Access:http://dx.doi.org/10.1093/biostatistics/kxz042
http://academic.oup.com/biostatistics/advance-article-pdf/doi/10.1093/biostatistics/kxz042/30988844/kxz042.pdf
Description
Summary:Summary In recent decades, the fields of statistical and machine learning have seen a revolution in the development of data-adaptive regression methods that have optimal performance under flexible, sometimes minimal, assumptions on the true regression functions. These developments have impacted all areas of applied and theoretical statistics and have allowed data analysts to avoid the biases incurred under the pervasive practice of parametric model misspecification. In this commentary, I discuss issues around the use of data-adaptive regression in estimation of causal inference parameters. To ground ideas, I focus on two estimation approaches with roots in semi-parametric estimation theory: targeted minimum loss-based estimation (TMLE; van der Laan and Rubin, 2006) and double/debiased machine learning (DML; Chernozhukov and others, 2018). This commentary is not comprehensive, the literature on these topics is rich, and there are many subtleties and developments which I do not address. These two frameworks represent only a small fraction of an increasingly large number of methods for causal inference using machine learning. To my knowledge, they are the only methods grounded in statistical semi-parametric theory that also allow unrestricted use of data-adaptive regression techniques.