A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall?

Over the last decade, advanced statistical inference and machine learning have been used to fill the gaps in sparse surface ocean CO2 measurements (Rödenbeck et al., 2015). The estimates from these methods have been used to constrain seasonal, interannual and decadal variability in sea–air CO2 fluxe...

Full description

Bibliographic Details
Main Authors: Gregor, Luke, Lebehot, Alice, Kok, Schalk, Scheel Monteiro, Pedro M.
Format: Article in Journal/Newspaper
Language:English
Published: Copernicus 2019
Subjects:
Online Access:https://hdl.handle.net/20.500.11850/389172
https://doi.org/10.3929/ethz-b-000387134
id ftethz:oai:www.research-collection.ethz.ch:20.500.11850/389172
record_format openpolar
spelling ftethz:oai:www.research-collection.ethz.ch:20.500.11850/389172 2023-05-15T18:25:54+02:00 A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall? Gregor, Luke Lebehot, Alice Kok, Schalk Scheel Monteiro, Pedro M. 2019-12-10 application/application/pdf https://hdl.handle.net/20.500.11850/389172 https://doi.org/10.3929/ethz-b-000387134 en eng Copernicus info:eu-repo/semantics/altIdentifier/doi/10.5194/gmd-12-5113-2019 info:eu-repo/semantics/altIdentifier/wos/000502280600001 http://hdl.handle.net/20.500.11850/389172 doi:10.3929/ethz-b-000387134 info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by/4.0/ Creative Commons Attribution 4.0 International CC-BY Geoscientific Model Development, 12 (12) info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion 2019 ftethz https://doi.org/20.500.11850/389172 https://doi.org/10.3929/ethz-b-000387134 https://doi.org/10.5194/gmd-12-5113-2019 2023-02-13T00:50:22Z Over the last decade, advanced statistical inference and machine learning have been used to fill the gaps in sparse surface ocean CO2 measurements (Rödenbeck et al., 2015). The estimates from these methods have been used to constrain seasonal, interannual and decadal variability in sea–air CO2 fluxes and the drivers of these changes (Landschützer et al., 2015, 2016; Gregor et al., 2018). However, it is also becoming clear that these methods are converging towards a common bias and root mean square error (RMSE) boundary: “the wall”, which suggests that pCO2 estimates are now limited by both data gaps and scale-sensitive observations. Here, we analyse this problem by introducing a new gap-filling method, an ensemble average of six machine-learning models (CSIR-ML6 version 2019a, Council for Scientific and Industrial Research – Machine Learning ensemble with Six members), where each model is constructed with a two-step clustering-regression approach. The ensemble average is then statistically compared to well-established methods. The ensemble average, CSIR-ML6, has an RMSE of 17.16 µatm and bias of 0.89 µatm when compared to a test dataset kept separate from training procedures. However, when validating our estimates with independent datasets, we find that our method improves only incrementally on other gap-filling methods. We investigate the differences between the methods to understand the extent of the limitations of gap-filling estimates of pCO2. We show that disagreement between methods in the South Atlantic, southeastern Pacific and parts of the Southern Ocean is too large to interpret the interannual variability with confidence. We conclude that improvements in surface ocean pCO2 estimates will likely be incremental with the optimisation of gap-filling methods by (1) the inclusion of additional clustering and regression variables (e.g. eddy kinetic energy), (2) increasing the sampling resolution and (3) successfully incorporating pCO2 estimates from alternate platforms (e.g. floats, gliders) into existing ... Article in Journal/Newspaper Southern Ocean ETH Zürich Research Collection Pacific Southern Ocean
institution Open Polar
collection ETH Zürich Research Collection
op_collection_id ftethz
language English
description Over the last decade, advanced statistical inference and machine learning have been used to fill the gaps in sparse surface ocean CO2 measurements (Rödenbeck et al., 2015). The estimates from these methods have been used to constrain seasonal, interannual and decadal variability in sea–air CO2 fluxes and the drivers of these changes (Landschützer et al., 2015, 2016; Gregor et al., 2018). However, it is also becoming clear that these methods are converging towards a common bias and root mean square error (RMSE) boundary: “the wall”, which suggests that pCO2 estimates are now limited by both data gaps and scale-sensitive observations. Here, we analyse this problem by introducing a new gap-filling method, an ensemble average of six machine-learning models (CSIR-ML6 version 2019a, Council for Scientific and Industrial Research – Machine Learning ensemble with Six members), where each model is constructed with a two-step clustering-regression approach. The ensemble average is then statistically compared to well-established methods. The ensemble average, CSIR-ML6, has an RMSE of 17.16 µatm and bias of 0.89 µatm when compared to a test dataset kept separate from training procedures. However, when validating our estimates with independent datasets, we find that our method improves only incrementally on other gap-filling methods. We investigate the differences between the methods to understand the extent of the limitations of gap-filling estimates of pCO2. We show that disagreement between methods in the South Atlantic, southeastern Pacific and parts of the Southern Ocean is too large to interpret the interannual variability with confidence. We conclude that improvements in surface ocean pCO2 estimates will likely be incremental with the optimisation of gap-filling methods by (1) the inclusion of additional clustering and regression variables (e.g. eddy kinetic energy), (2) increasing the sampling resolution and (3) successfully incorporating pCO2 estimates from alternate platforms (e.g. floats, gliders) into existing ...
format Article in Journal/Newspaper
author Gregor, Luke
Lebehot, Alice
Kok, Schalk
Scheel Monteiro, Pedro M.
spellingShingle Gregor, Luke
Lebehot, Alice
Kok, Schalk
Scheel Monteiro, Pedro M.
A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall?
author_facet Gregor, Luke
Lebehot, Alice
Kok, Schalk
Scheel Monteiro, Pedro M.
author_sort Gregor, Luke
title A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall?
title_short A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall?
title_full A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall?
title_fullStr A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall?
title_full_unstemmed A comparative assessment of the uncertainties of global surface ocean CO2 estimates using a machine-learning ensemble (CSIR-ML6 version 2019a) - have we hit the wall?
title_sort comparative assessment of the uncertainties of global surface ocean co2 estimates using a machine-learning ensemble (csir-ml6 version 2019a) - have we hit the wall?
publisher Copernicus
publishDate 2019
url https://hdl.handle.net/20.500.11850/389172
https://doi.org/10.3929/ethz-b-000387134
geographic Pacific
Southern Ocean
geographic_facet Pacific
Southern Ocean
genre Southern Ocean
genre_facet Southern Ocean
op_source Geoscientific Model Development, 12 (12)
op_relation info:eu-repo/semantics/altIdentifier/doi/10.5194/gmd-12-5113-2019
info:eu-repo/semantics/altIdentifier/wos/000502280600001
http://hdl.handle.net/20.500.11850/389172
doi:10.3929/ethz-b-000387134
op_rights info:eu-repo/semantics/openAccess
http://creativecommons.org/licenses/by/4.0/
Creative Commons Attribution 4.0 International
op_rightsnorm CC-BY
op_doi https://doi.org/20.500.11850/389172
https://doi.org/10.3929/ethz-b-000387134
https://doi.org/10.5194/gmd-12-5113-2019
_version_ 1766207608290541568