Causal hybrid modeling with double machine learning—applications in carbon flux modeling

Hybrid modeling integrates machine learning with scientific knowledge to enhance interpretability, generalization, and adherence to natural laws. Nevertheless, equifinality and regularization biases pose challenges in hybrid modeling to achieve these purposes. This paper introduces a novel approach...

Full description

Bibliographic Details
Published in:Machine Learning: Science and Technology
Main Authors: Kai-Hendrik Cohrs, Gherardo Varando, Nuno Carvalhais, Markus Reichstein, Gustau Camps-Valls
Format: Article in Journal/Newspaper
Language:English
Published: IOP Publishing 2024
Subjects:
DML
Online Access:https://doi.org/10.1088/2632-2153/ad5a60
https://doaj.org/article/5f8cd148b4a14b0fb860bcef7d6ba5e2
id ftdoajarticles:oai:doaj.org/article:5f8cd148b4a14b0fb860bcef7d6ba5e2
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:5f8cd148b4a14b0fb860bcef7d6ba5e2 2024-09-15T18:03:49+00:00 Causal hybrid modeling with double machine learning—applications in carbon flux modeling Kai-Hendrik Cohrs Gherardo Varando Nuno Carvalhais Markus Reichstein Gustau Camps-Valls 2024-01-01T00:00:00Z https://doi.org/10.1088/2632-2153/ad5a60 https://doaj.org/article/5f8cd148b4a14b0fb860bcef7d6ba5e2 EN eng IOP Publishing https://doi.org/10.1088/2632-2153/ad5a60 https://doaj.org/toc/2632-2153 doi:10.1088/2632-2153/ad5a60 2632-2153 https://doaj.org/article/5f8cd148b4a14b0fb860bcef7d6ba5e2 Machine Learning: Science and Technology, Vol 5, Iss 3, p 035021 (2024) knowledge-guided machine learning hybrid modeling causal effect estimation double machine learning temperature sensitivity carbon flux partitioning Computer engineering. Computer hardware TK7885-7895 Electronic computers. Computer science QA75.5-76.95 article 2024 ftdoajarticles https://doi.org/10.1088/2632-2153/ad5a60 2024-08-05T17:48:54Z Hybrid modeling integrates machine learning with scientific knowledge to enhance interpretability, generalization, and adherence to natural laws. Nevertheless, equifinality and regularization biases pose challenges in hybrid modeling to achieve these purposes. This paper introduces a novel approach to estimating hybrid models via a causal inference framework, specifically employing double machine learning (DML) to estimate causal effects. We showcase its use for the Earth sciences on two problems related to carbon dioxide fluxes. In the Q _10 model, we demonstrate that DML-based hybrid modeling is superior in estimating causal parameters over end-to-end deep neural network approaches, proving efficiency, robustness to bias from regularization methods, and circumventing equifinality. Our approach, applied to carbon flux partitioning, exhibits flexibility in accommodating heterogeneous causal effects. The study emphasizes the necessity of explicitly defining causal graphs and relationships, advocating for this as a general best practice. We encourage the continued exploration of causality in hybrid models for more interpretable and trustworthy results in knowledge-guided machine learning. Article in Journal/Newspaper DML Directory of Open Access Journals: DOAJ Articles Machine Learning: Science and Technology 5 3 035021
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic knowledge-guided machine learning
hybrid modeling
causal effect estimation
double machine learning
temperature sensitivity
carbon flux partitioning
Computer engineering. Computer hardware
TK7885-7895
Electronic computers. Computer science
QA75.5-76.95
spellingShingle knowledge-guided machine learning
hybrid modeling
causal effect estimation
double machine learning
temperature sensitivity
carbon flux partitioning
Computer engineering. Computer hardware
TK7885-7895
Electronic computers. Computer science
QA75.5-76.95
Kai-Hendrik Cohrs
Gherardo Varando
Nuno Carvalhais
Markus Reichstein
Gustau Camps-Valls
Causal hybrid modeling with double machine learning—applications in carbon flux modeling
topic_facet knowledge-guided machine learning
hybrid modeling
causal effect estimation
double machine learning
temperature sensitivity
carbon flux partitioning
Computer engineering. Computer hardware
TK7885-7895
Electronic computers. Computer science
QA75.5-76.95
description Hybrid modeling integrates machine learning with scientific knowledge to enhance interpretability, generalization, and adherence to natural laws. Nevertheless, equifinality and regularization biases pose challenges in hybrid modeling to achieve these purposes. This paper introduces a novel approach to estimating hybrid models via a causal inference framework, specifically employing double machine learning (DML) to estimate causal effects. We showcase its use for the Earth sciences on two problems related to carbon dioxide fluxes. In the Q _10 model, we demonstrate that DML-based hybrid modeling is superior in estimating causal parameters over end-to-end deep neural network approaches, proving efficiency, robustness to bias from regularization methods, and circumventing equifinality. Our approach, applied to carbon flux partitioning, exhibits flexibility in accommodating heterogeneous causal effects. The study emphasizes the necessity of explicitly defining causal graphs and relationships, advocating for this as a general best practice. We encourage the continued exploration of causality in hybrid models for more interpretable and trustworthy results in knowledge-guided machine learning.
format Article in Journal/Newspaper
author Kai-Hendrik Cohrs
Gherardo Varando
Nuno Carvalhais
Markus Reichstein
Gustau Camps-Valls
author_facet Kai-Hendrik Cohrs
Gherardo Varando
Nuno Carvalhais
Markus Reichstein
Gustau Camps-Valls
author_sort Kai-Hendrik Cohrs
title Causal hybrid modeling with double machine learning—applications in carbon flux modeling
title_short Causal hybrid modeling with double machine learning—applications in carbon flux modeling
title_full Causal hybrid modeling with double machine learning—applications in carbon flux modeling
title_fullStr Causal hybrid modeling with double machine learning—applications in carbon flux modeling
title_full_unstemmed Causal hybrid modeling with double machine learning—applications in carbon flux modeling
title_sort causal hybrid modeling with double machine learning—applications in carbon flux modeling
publisher IOP Publishing
publishDate 2024
url https://doi.org/10.1088/2632-2153/ad5a60
https://doaj.org/article/5f8cd148b4a14b0fb860bcef7d6ba5e2
genre DML
genre_facet DML
op_source Machine Learning: Science and Technology, Vol 5, Iss 3, p 035021 (2024)
op_relation https://doi.org/10.1088/2632-2153/ad5a60
https://doaj.org/toc/2632-2153
doi:10.1088/2632-2153/ad5a60
2632-2153
https://doaj.org/article/5f8cd148b4a14b0fb860bcef7d6ba5e2
op_doi https://doi.org/10.1088/2632-2153/ad5a60
container_title Machine Learning: Science and Technology
container_volume 5
container_issue 3
container_start_page 035021
_version_ 1810441279437275136