Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge
During the master thesis, Bayesian Additive Regression Tree (BART), Bayesian Causal Forest(BCF), and Double Machine Learning(DML) are applied to solve American Causal Inference Conference 2022 Data Challenge. Bayesian Causal Forest(BCF) is a variant of the Bayesian Additive regression tree (BART) mo...
Main Author: | |
---|---|
Format: | Master Thesis |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://mediatum.ub.tum.de/1740110 https://mediatum.ub.tum.de/doc/1740110/document.pdf |
id |
fttumuenchen:oai:mediatum.ub.tum.de:node/1740110 |
---|---|
record_format |
openpolar |
spelling |
fttumuenchen:oai:mediatum.ub.tum.de:node/1740110 2024-05-12T08:02:55+00:00 Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge Ruixuan Zhu 2023 application/pdf https://mediatum.ub.tum.de/1740110 https://mediatum.ub.tum.de/doc/1740110/document.pdf eng eng https://mediatum.ub.tum.de/1740110 https://mediatum.ub.tum.de/doc/1740110/document.pdf info:eu-repo/semantics/openAccess 510 Mathematik masterThesis 2023 fttumuenchen 2024-04-17T14:06:26Z During the master thesis, Bayesian Additive Regression Tree (BART), Bayesian Causal Forest(BCF), and Double Machine Learning(DML) are applied to solve American Causal Inference Conference 2022 Data Challenge. Bayesian Causal Forest(BCF) is a variant of the Bayesian Additive regression tree (BART) model. The R language is used for all implementations. For evaluation of the performances of these three models, Root Mean Squared Error(RMSE), uncertainty interval coverage, uncertainty interval width, and absolute bias are employed as metrics. Root Mean Squared Error(RMSE) and uncertainty interval coverage are emphasized among the four metrics since they are highlighted by the Data Challenge host. The evaluations show that the three models all have a good performance regarding Root Mean Square Error(RMSE) and the two BART-based models have much better performances than Double Machine Learning(DML) in terms of uncertainty interval coverage. Within BART-based models, Bayesian Causal Forest(BCF) outperformed Bayesian Additive Regression Tree(BART). Moreover, the two BART-based models outperformed Double Machine Learning(DML) significantly concerning the subgroup estimands, which is crucial for dealing with treatment effect heterogeneity. Master Thesis DML Munich University of Technology (TUM): mediaTUM |
institution |
Open Polar |
collection |
Munich University of Technology (TUM): mediaTUM |
op_collection_id |
fttumuenchen |
language |
English |
topic |
510 Mathematik |
spellingShingle |
510 Mathematik Ruixuan Zhu Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge |
topic_facet |
510 Mathematik |
description |
During the master thesis, Bayesian Additive Regression Tree (BART), Bayesian Causal Forest(BCF), and Double Machine Learning(DML) are applied to solve American Causal Inference Conference 2022 Data Challenge. Bayesian Causal Forest(BCF) is a variant of the Bayesian Additive regression tree (BART) model. The R language is used for all implementations. For evaluation of the performances of these three models, Root Mean Squared Error(RMSE), uncertainty interval coverage, uncertainty interval width, and absolute bias are employed as metrics. Root Mean Squared Error(RMSE) and uncertainty interval coverage are emphasized among the four metrics since they are highlighted by the Data Challenge host. The evaluations show that the three models all have a good performance regarding Root Mean Square Error(RMSE) and the two BART-based models have much better performances than Double Machine Learning(DML) in terms of uncertainty interval coverage. Within BART-based models, Bayesian Causal Forest(BCF) outperformed Bayesian Additive Regression Tree(BART). Moreover, the two BART-based models outperformed Double Machine Learning(DML) significantly concerning the subgroup estimands, which is crucial for dealing with treatment effect heterogeneity. |
format |
Master Thesis |
author |
Ruixuan Zhu |
author_facet |
Ruixuan Zhu |
author_sort |
Ruixuan Zhu |
title |
Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge |
title_short |
Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge |
title_full |
Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge |
title_fullStr |
Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge |
title_full_unstemmed |
Applying double machine learning and BART methods to the American Causal Inference Conference 2022 Data Challenge |
title_sort |
applying double machine learning and bart methods to the american causal inference conference 2022 data challenge |
publishDate |
2023 |
url |
https://mediatum.ub.tum.de/1740110 https://mediatum.ub.tum.de/doc/1740110/document.pdf |
genre |
DML |
genre_facet |
DML |
op_relation |
https://mediatum.ub.tum.de/1740110 https://mediatum.ub.tum.de/doc/1740110/document.pdf |
op_rights |
info:eu-repo/semantics/openAccess |
_version_ |
1798845048032329728 |