Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge

During the master thesis, Bayesian Additive Regression Tree (BART), Bayesian Causal Forest(BCF), and Double Machine Learning(DML) are applied to solve American Causal Inference Conference 2022 Data Challenge. Bayesian Causal Forest(BCF) is a variant of the Bayesian Additive regression tree (BART) mo...

Full description

Bibliographic Details
Main Author: Ruixuan Zhu
Format: Master Thesis
Language:English
Published: 2023
Subjects:
DML
Online Access:https://mediatum.ub.tum.de/1740110
https://mediatum.ub.tum.de/doc/1740110/document.pdf
id fttumuenchen:oai:mediatum.ub.tum.de:node/1740110
record_format openpolar
spelling fttumuenchen:oai:mediatum.ub.tum.de:node/1740110 2024-05-12T08:02:55+00:00 Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge Ruixuan Zhu 2023 application/pdf https://mediatum.ub.tum.de/1740110 https://mediatum.ub.tum.de/doc/1740110/document.pdf eng eng https://mediatum.ub.tum.de/1740110 https://mediatum.ub.tum.de/doc/1740110/document.pdf info:eu-repo/semantics/openAccess 510 Mathematik masterThesis 2023 fttumuenchen 2024-04-17T14:06:26Z During the master thesis, Bayesian Additive Regression Tree (BART), Bayesian Causal Forest(BCF), and Double Machine Learning(DML) are applied to solve American Causal Inference Conference 2022 Data Challenge. Bayesian Causal Forest(BCF) is a variant of the Bayesian Additive regression tree (BART) model. The R language is used for all implementations. For evaluation of the performances of these three models, Root Mean Squared Error(RMSE), uncertainty interval coverage, uncertainty interval width, and absolute bias are employed as metrics. Root Mean Squared Error(RMSE) and uncertainty interval coverage are emphasized among the four metrics since they are highlighted by the Data Challenge host. The evaluations show that the three models all have a good performance regarding Root Mean Square Error(RMSE) and the two BART-based models have much better performances than Double Machine Learning(DML) in terms of uncertainty interval coverage. Within BART-based models, Bayesian Causal Forest(BCF) outperformed Bayesian Additive Regression Tree(BART). Moreover, the two BART-based models outperformed Double Machine Learning(DML) significantly concerning the subgroup estimands, which is crucial for dealing with treatment effect heterogeneity. Master Thesis DML Munich University of Technology (TUM): mediaTUM
institution Open Polar
collection Munich University of Technology (TUM): mediaTUM
op_collection_id fttumuenchen
language English
topic 510 Mathematik
spellingShingle 510 Mathematik
Ruixuan Zhu
Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge
topic_facet 510 Mathematik
description During the master thesis, Bayesian Additive Regression Tree (BART), Bayesian Causal Forest(BCF), and Double Machine Learning(DML) are applied to solve American Causal Inference Conference 2022 Data Challenge. Bayesian Causal Forest(BCF) is a variant of the Bayesian Additive regression tree (BART) model. The R language is used for all implementations. For evaluation of the performances of these three models, Root Mean Squared Error(RMSE), uncertainty interval coverage, uncertainty interval width, and absolute bias are employed as metrics. Root Mean Squared Error(RMSE) and uncertainty interval coverage are emphasized among the four metrics since they are highlighted by the Data Challenge host. The evaluations show that the three models all have a good performance regarding Root Mean Square Error(RMSE) and the two BART-based models have much better performances than Double Machine Learning(DML) in terms of uncertainty interval coverage. Within BART-based models, Bayesian Causal Forest(BCF) outperformed Bayesian Additive Regression Tree(BART). Moreover, the two BART-based models outperformed Double Machine Learning(DML) significantly concerning the subgroup estimands, which is crucial for dealing with treatment effect heterogeneity.
format Master Thesis
author Ruixuan Zhu
author_facet Ruixuan Zhu
author_sort Ruixuan Zhu
title Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge
title_short Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge
title_full Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge
title_fullStr Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge
title_full_unstemmed Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge
title_sort applying double machine learning and bart methods to the american causal inference conference 2022 data challenge
publishDate 2023
url https://mediatum.ub.tum.de/1740110
https://mediatum.ub.tum.de/doc/1740110/document.pdf
genre DML
genre_facet DML
op_relation https://mediatum.ub.tum.de/1740110
https://mediatum.ub.tum.de/doc/1740110/document.pdf
op_rights info:eu-repo/semantics/openAccess
_version_ 1798845048032329728