Causal hybrid modeling with double machine learning ...

Hybrid modeling integrates machine learning with scientific knowledge to enhance interpretability, generalization, and adherence to natural laws. Nevertheless, equifinality and regularization biases pose challenges in hybrid modeling to achieve these purposes. This paper introduces a novel approach...

Full description

Bibliographic Details
Main Authors: Cohrs, Kai-Hendrik, Varando, Gherardo, Carvalhais, Nuno, Reichstein, Markus, Camps-Valls, Gustau
Format: Article in Journal/Newspaper
Language:unknown
Published: arXiv 2024
Subjects:
DML
Online Access:https://dx.doi.org/10.48550/arxiv.2402.13332
https://arxiv.org/abs/2402.13332
Description
Summary:Hybrid modeling integrates machine learning with scientific knowledge to enhance interpretability, generalization, and adherence to natural laws. Nevertheless, equifinality and regularization biases pose challenges in hybrid modeling to achieve these purposes. This paper introduces a novel approach to estimating hybrid models via a causal inference framework, specifically employing Double Machine Learning (DML) to estimate causal effects. We showcase its use for the Earth sciences on two problems related to carbon dioxide fluxes. In the $Q_{10}$ model, we demonstrate that DML-based hybrid modeling is superior in estimating causal parameters over end-to-end deep neural network (DNN) approaches, proving efficiency, robustness to bias from regularization methods, and circumventing equifinality. Our approach, applied to carbon flux partitioning, exhibits flexibility in accommodating heterogeneous causal effects. The study emphasizes the necessity of explicitly defining causal graphs and relationships, advocating ...