Double/debiased machine learning for treatment and structural parameters

We revisit the classic semiparametric problem of inference on a low di-mensional parameter Ø0 in the presence of high-dimensional nuisance parameters Û0. We depart from the classical setting by allowing for Û0 to be so high-dimensional that the traditional assumptions, such as Donsker properties, th...

Full description

Bibliographic Details
Main Authors:	Chernozhukov, Victor, Chetverikov, Denis, Demirer, Mert, Duflo, Esther, Hansen, Christian B., Newey, Whitney K., Robins, James
Format:	Report
Language:	English
Published:	London: Centre for Microdata Methods and Practice (cemmap) 2017
Subjects:	ddc:330 Kausalanalyse Ökonometrie DML
Online Access:	http://hdl.handle.net/10419/189736 https://doi.org/10.1920/wp.cem.2017.2817

id	ftzbwkiel:oai:econstor.eu:10419/189736
record_format	openpolar
spelling	ftzbwkiel:oai:econstor.eu:10419/189736 2023-12-17T10:29:26+01:00 Double/debiased machine learning for treatment and structural parameters Chernozhukov, Victor Chetverikov, Denis Demirer, Mert Duflo, Esther Hansen, Christian B. Newey, Whitney K. Robins, James 2017 http://hdl.handle.net/10419/189736 https://doi.org/10.1920/wp.cem.2017.2817 eng eng London: Centre for Microdata Methods and Practice (cemmap) Series: cemmap working paper No. CWP28/17 gbv-ppn:889233446 doi:10.1920/wp.cem.2017.2817 http://hdl.handle.net/10419/189736 RePEc:ifs:cemmap:28/17 http://www.econstor.eu/dspace/Nutzungsbedingungen ddc:330 Kausalanalyse Ökonometrie doc-type:workingPaper 2017 ftzbwkiel https://doi.org/10.1920/wp.cem.2017.2817 2023-11-20T00:42:46Z We revisit the classic semiparametric problem of inference on a low di-mensional parameter Ø0 in the presence of high-dimensional nuisance parameters Û0. We depart from the classical setting by allowing for Û0 to be so high-dimensional that the traditional assumptions, such as Donsker properties, that limit complexity of the parameter space for this object break down. To estimate Û0, we consider the use of statistical or machine learning (ML) methods which are particularly well-suited to estimation in modern, very high-dimensional cases. ML methods perform well by employing regularization to reduce variance and trading off regularization bias with overfitting in practice. However, both regularization bias and overfitting in estimating Û0 cause a heavy bias in estimators of Ø0 that are obtained by naively plugging ML estimators of Û0 into estimating equations for Ø0. This bias results in the naive estimator failing to be N -1/2 consistent, where N is the sample size. We show that the impact of regularization bias and overfitting on estimation of the parameter of interest Ø0 can be removed by using two simple, yet critical, ingredients: (1) using Neyman-orthogonal moments/scores that have reduced sensitivity with respect to nuisance parameters to estimate Ø0, and (2) making use of cross-fitting which provides an efficient form of data-splitting. We call the resulting set of methods double or debiased ML (DML). We verify that DML delivers point estimators that concentrate in a N -1/2-neighborhood of the true parameter values and are approximately unbiased and normally distributed, which allows construction of valid confidence statements. The generic statistical theory of DML is elementary and simultaneously relies on only weak theoretical requirements which will admit the use of a broad array of modern ML methods for estimating the nuisance parameters such as random forests, lasso, ridge, deep neural nets, boosted trees, and various hybrids and ensembles of these methods. We illustrate the general theory by ... Report DML EconStor (German National Library of Economics, ZBW)
institution	Open Polar
collection	EconStor (German National Library of Economics, ZBW)
op_collection_id	ftzbwkiel
language	English
topic	ddc:330 Kausalanalyse Ökonometrie
spellingShingle	ddc:330 Kausalanalyse Ökonometrie Chernozhukov, Victor Chetverikov, Denis Demirer, Mert Duflo, Esther Hansen, Christian B. Newey, Whitney K. Robins, James Double/debiased machine learning for treatment and structural parameters
topic_facet	ddc:330 Kausalanalyse Ökonometrie
description	We revisit the classic semiparametric problem of inference on a low di-mensional parameter Ø0 in the presence of high-dimensional nuisance parameters Û0. We depart from the classical setting by allowing for Û0 to be so high-dimensional that the traditional assumptions, such as Donsker properties, that limit complexity of the parameter space for this object break down. To estimate Û0, we consider the use of statistical or machine learning (ML) methods which are particularly well-suited to estimation in modern, very high-dimensional cases. ML methods perform well by employing regularization to reduce variance and trading off regularization bias with overfitting in practice. However, both regularization bias and overfitting in estimating Û0 cause a heavy bias in estimators of Ø0 that are obtained by naively plugging ML estimators of Û0 into estimating equations for Ø0. This bias results in the naive estimator failing to be N -1/2 consistent, where N is the sample size. We show that the impact of regularization bias and overfitting on estimation of the parameter of interest Ø0 can be removed by using two simple, yet critical, ingredients: (1) using Neyman-orthogonal moments/scores that have reduced sensitivity with respect to nuisance parameters to estimate Ø0, and (2) making use of cross-fitting which provides an efficient form of data-splitting. We call the resulting set of methods double or debiased ML (DML). We verify that DML delivers point estimators that concentrate in a N -1/2-neighborhood of the true parameter values and are approximately unbiased and normally distributed, which allows construction of valid confidence statements. The generic statistical theory of DML is elementary and simultaneously relies on only weak theoretical requirements which will admit the use of a broad array of modern ML methods for estimating the nuisance parameters such as random forests, lasso, ridge, deep neural nets, boosted trees, and various hybrids and ensembles of these methods. We illustrate the general theory by ...
format	Report
author	Chernozhukov, Victor Chetverikov, Denis Demirer, Mert Duflo, Esther Hansen, Christian B. Newey, Whitney K. Robins, James
author_facet	Chernozhukov, Victor Chetverikov, Denis Demirer, Mert Duflo, Esther Hansen, Christian B. Newey, Whitney K. Robins, James
author_sort	Chernozhukov, Victor
title	Double/debiased machine learning for treatment and structural parameters
title_short	Double/debiased machine learning for treatment and structural parameters
title_full	Double/debiased machine learning for treatment and structural parameters
title_fullStr	Double/debiased machine learning for treatment and structural parameters
title_full_unstemmed	Double/debiased machine learning for treatment and structural parameters
title_sort	double/debiased machine learning for treatment and structural parameters
publisher	London: Centre for Microdata Methods and Practice (cemmap)
publishDate	2017
url	http://hdl.handle.net/10419/189736 https://doi.org/10.1920/wp.cem.2017.2817
genre	DML
genre_facet	DML
op_relation	Series: cemmap working paper No. CWP28/17 gbv-ppn:889233446 doi:10.1920/wp.cem.2017.2817 http://hdl.handle.net/10419/189736 RePEc:ifs:cemmap:28/17
op_rights	http://www.econstor.eu/dspace/Nutzungsbedingungen
op_doi	https://doi.org/10.1920/wp.cem.2017.2817
_version_	1785581817560039424

Double/debiased machine learning for treatment and structural parameters

Similar Items