IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning

Accurate long-term marine-derived biogenic sulfur aerosol concentrations at high spatial and temporal resolutions are critical for a wide range of studies including climatology, trend analysis, model evaluation, accurate investigation of their contribution to aerosol burden, or to elucidate their ra...

Full description

Bibliographic Details
Main Authors: Mansour, Karam, Decesari, Stefano, Ceburnis, Darius, Ovadnevaite, Jurgita, Russell, Lynn M., Paglione, Marco, Poulain, Laurent, Huang, Shan, O'Dowd, Colin, Rinaldi, Matteo
Format: Text
Language:English
Published: 2023
Subjects:
Online Access:https://doi.org/10.5194/essd-2023-352
https://essd.copernicus.org/preprints/essd-2023-352/
id ftcopernicus:oai:publications.copernicus.org:essdd114562
record_format openpolar
spelling ftcopernicus:oai:publications.copernicus.org:essdd114562 2024-01-07T09:45:03+01:00 IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning Mansour, Karam Decesari, Stefano Ceburnis, Darius Ovadnevaite, Jurgita Russell, Lynn M. Paglione, Marco Poulain, Laurent Huang, Shan O'Dowd, Colin Rinaldi, Matteo 2023-12-06 application/pdf https://doi.org/10.5194/essd-2023-352 https://essd.copernicus.org/preprints/essd-2023-352/ eng eng doi:10.5194/essd-2023-352 https://essd.copernicus.org/preprints/essd-2023-352/ eISSN: 1866-3516 Text 2023 ftcopernicus https://doi.org/10.5194/essd-2023-352 2023-12-11T17:24:18Z Accurate long-term marine-derived biogenic sulfur aerosol concentrations at high spatial and temporal resolutions are critical for a wide range of studies including climatology, trend analysis, model evaluation, accurate investigation of their contribution to aerosol burden, or to elucidate their radiative impacts and to provide boundary conditions for regional models. By applying machine learning algorithms, we constructed the first, publicly available, daily gridded dataset of in-situ produced biogenic methanesulfonic acid (MSA) and sulfate (SO 4 ) concentrations covering the North Atlantic Ocean. The dataset is of high spatial resolution of 0.25° × 0.25°, spanning 25 years (1998–2022), far exceeding what observations alone could achieve both space- and time-wise. The machine learning models were generated by combining in-situ observations of sulfur aerosol data at Mace Head research station, west coast of Ireland, and from NAAMES cruises in the NW Atlantic, combined with the constructed sea-to-air dimethylsulfide flux (F DMS ) and ECMWF-ERA5 reanalysis datasets. To determine the optimal method for regression, we employed four machine learning model types: support vector machines, ensemble, Gaussian process, and artificial neural networks. A comparison of the mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R 2 ) revealed that the Gaussian process regression (GPR) was the most effective algorithm, outperforming the other models in simulating the biogenic MSA and SO 4 concentrations. For predicting daily MSA (SO 4 ), GPR displayed the highest R 2 value of 0.86 (0.72) and the lowest MAE of 0.014 (0.10) µg m –3 . The GPR partial dependence analysis suggests that the relationships between predictors and MSA and SO 4 concentrations are complex rather than linear. Using the GPR algorithm, we produced a high-resolution daily dataset of In-situ Produced Biogenic MSA and SO 4 sea-level concentrations over the North Atlantic, which we named IPB-MSA&SO 4 . The obtained ... Text North Atlantic Copernicus Publications: E-Journals Mace ENVELOPE(155.883,155.883,-81.417,-81.417)
institution Open Polar
collection Copernicus Publications: E-Journals
op_collection_id ftcopernicus
language English
description Accurate long-term marine-derived biogenic sulfur aerosol concentrations at high spatial and temporal resolutions are critical for a wide range of studies including climatology, trend analysis, model evaluation, accurate investigation of their contribution to aerosol burden, or to elucidate their radiative impacts and to provide boundary conditions for regional models. By applying machine learning algorithms, we constructed the first, publicly available, daily gridded dataset of in-situ produced biogenic methanesulfonic acid (MSA) and sulfate (SO 4 ) concentrations covering the North Atlantic Ocean. The dataset is of high spatial resolution of 0.25° × 0.25°, spanning 25 years (1998–2022), far exceeding what observations alone could achieve both space- and time-wise. The machine learning models were generated by combining in-situ observations of sulfur aerosol data at Mace Head research station, west coast of Ireland, and from NAAMES cruises in the NW Atlantic, combined with the constructed sea-to-air dimethylsulfide flux (F DMS ) and ECMWF-ERA5 reanalysis datasets. To determine the optimal method for regression, we employed four machine learning model types: support vector machines, ensemble, Gaussian process, and artificial neural networks. A comparison of the mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R 2 ) revealed that the Gaussian process regression (GPR) was the most effective algorithm, outperforming the other models in simulating the biogenic MSA and SO 4 concentrations. For predicting daily MSA (SO 4 ), GPR displayed the highest R 2 value of 0.86 (0.72) and the lowest MAE of 0.014 (0.10) µg m –3 . The GPR partial dependence analysis suggests that the relationships between predictors and MSA and SO 4 concentrations are complex rather than linear. Using the GPR algorithm, we produced a high-resolution daily dataset of In-situ Produced Biogenic MSA and SO 4 sea-level concentrations over the North Atlantic, which we named IPB-MSA&SO 4 . The obtained ...
format Text
author Mansour, Karam
Decesari, Stefano
Ceburnis, Darius
Ovadnevaite, Jurgita
Russell, Lynn M.
Paglione, Marco
Poulain, Laurent
Huang, Shan
O'Dowd, Colin
Rinaldi, Matteo
spellingShingle Mansour, Karam
Decesari, Stefano
Ceburnis, Darius
Ovadnevaite, Jurgita
Russell, Lynn M.
Paglione, Marco
Poulain, Laurent
Huang, Shan
O'Dowd, Colin
Rinaldi, Matteo
IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning
author_facet Mansour, Karam
Decesari, Stefano
Ceburnis, Darius
Ovadnevaite, Jurgita
Russell, Lynn M.
Paglione, Marco
Poulain, Laurent
Huang, Shan
O'Dowd, Colin
Rinaldi, Matteo
author_sort Mansour, Karam
title IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning
title_short IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning
title_full IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning
title_fullStr IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning
title_full_unstemmed IPB-MSA&SO4: a daily 0.25° resolution dataset of In-situ Produced Biogenic Methanesulfonic Acid and Sulfate over the North Atlantic during 1998–2022 based on machine learning
title_sort ipb-msa&so4: a daily 0.25° resolution dataset of in-situ produced biogenic methanesulfonic acid and sulfate over the north atlantic during 1998–2022 based on machine learning
publishDate 2023
url https://doi.org/10.5194/essd-2023-352
https://essd.copernicus.org/preprints/essd-2023-352/
long_lat ENVELOPE(155.883,155.883,-81.417,-81.417)
geographic Mace
geographic_facet Mace
genre North Atlantic
genre_facet North Atlantic
op_source eISSN: 1866-3516
op_relation doi:10.5194/essd-2023-352
https://essd.copernicus.org/preprints/essd-2023-352/
op_doi https://doi.org/10.5194/essd-2023-352
_version_ 1787426492255305728