IPB-MSA&SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning

Accurate long-term marine-derived biogenic sulfur aerosol concentrations at high spatial and temporal resolutions are critical for a wide range of studies, including climatology, trend analysis, and model evaluation; this information is also imperative for the accurate investigation of the contribut...

Full description

Bibliographic Details
Published in:Earth System Science Data
Main Authors: Mansour, Karam, Decesari, Stefano, Ceburnis, Darius, Ovadnevaite, Jurgita, Russell, Lynn M., Paglione, Marco, Poulain, Laurent, Huang, Shan, O'Dowd, Colin, Rinaldi, Matteo
Format: Text
Language:English
Published: 2024
Subjects:
Online Access:https://doi.org/10.5194/essd-16-2717-2024
https://essd.copernicus.org/articles/16/2717/2024/
id ftcopernicus:oai:publications.copernicus.org:essd114562
record_format openpolar
spelling ftcopernicus:oai:publications.copernicus.org:essd114562 2024-09-15T18:22:17+00:00 IPB-MSA&amp;SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning Mansour, Karam Decesari, Stefano Ceburnis, Darius Ovadnevaite, Jurgita Russell, Lynn M. Paglione, Marco Poulain, Laurent Huang, Shan O'Dowd, Colin Rinaldi, Matteo 2024-06-12 application/pdf https://doi.org/10.5194/essd-16-2717-2024 https://essd.copernicus.org/articles/16/2717/2024/ eng eng doi:10.5194/essd-16-2717-2024 https://essd.copernicus.org/articles/16/2717/2024/ eISSN: 1866-3516 Text 2024 ftcopernicus https://doi.org/10.5194/essd-16-2717-2024 2024-08-28T05:24:22Z Accurate long-term marine-derived biogenic sulfur aerosol concentrations at high spatial and temporal resolutions are critical for a wide range of studies, including climatology, trend analysis, and model evaluation; this information is also imperative for the accurate investigation of the contribution of marine-derived biogenic sulfur aerosol concentrations to the aerosol burden, for the elucidation of their radiative impacts, and to provide boundary conditions for regional models. By applying machine learning algorithms, we constructed the first publicly available daily gridded dataset of in situ-produced biogenic methanesulfonic acid (MSA) and non-sea-salt sulfate (nss-SO 4 = <svg:svg xmlns:svg="http://www.w3.org/2000/svg" width="8pt" height="14pt" class="svg-formula" dspmath="mathimg" md5hash="681c12b7ebc0d0a218afd389f448be6c"><svg:image xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="essd-16-2717-2024-ie00001.svg" width="8pt" height="14pt" src="essd-16-2717-2024-ie00001.png"/> </svg:svg> ) concentrations covering the North Atlantic. The dataset is of high spatial resolution (0.25° × 0.25°) and spans 25 years (1998–2022), far exceeding what observations alone could achieve both spatially and temporally. The machine learning models were generated by combining in situ observations of sulfur aerosol data from Mace Head Atmospheric Research Station, located on the west coast of Ireland, and from the North Atlantic Aerosols and Marine Ecosystems Study (NAAMES) cruises in the northwestern Atlantic with the constructed sea-to-air dimethylsulfide flux ( F DMS ) and ECMWF ERA5 reanalysis datasets. To determine the optimal method for regression, we employed five machine learning model types: support vector machines, decision tree, regression ensemble, Gaussian process regression, and artificial neural networks. A comparison of the mean absolute error (MAE), root-mean-square error (RMSE), and coefficient of determination ( R 2 ) revealed that Gaussian process regression (GPR) was the most ... Text North Atlantic Copernicus Publications: E-Journals Earth System Science Data 16 6 2717 2740
institution Open Polar
collection Copernicus Publications: E-Journals
op_collection_id ftcopernicus
language English
description Accurate long-term marine-derived biogenic sulfur aerosol concentrations at high spatial and temporal resolutions are critical for a wide range of studies, including climatology, trend analysis, and model evaluation; this information is also imperative for the accurate investigation of the contribution of marine-derived biogenic sulfur aerosol concentrations to the aerosol burden, for the elucidation of their radiative impacts, and to provide boundary conditions for regional models. By applying machine learning algorithms, we constructed the first publicly available daily gridded dataset of in situ-produced biogenic methanesulfonic acid (MSA) and non-sea-salt sulfate (nss-SO 4 = <svg:svg xmlns:svg="http://www.w3.org/2000/svg" width="8pt" height="14pt" class="svg-formula" dspmath="mathimg" md5hash="681c12b7ebc0d0a218afd389f448be6c"><svg:image xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="essd-16-2717-2024-ie00001.svg" width="8pt" height="14pt" src="essd-16-2717-2024-ie00001.png"/> </svg:svg> ) concentrations covering the North Atlantic. The dataset is of high spatial resolution (0.25° × 0.25°) and spans 25 years (1998–2022), far exceeding what observations alone could achieve both spatially and temporally. The machine learning models were generated by combining in situ observations of sulfur aerosol data from Mace Head Atmospheric Research Station, located on the west coast of Ireland, and from the North Atlantic Aerosols and Marine Ecosystems Study (NAAMES) cruises in the northwestern Atlantic with the constructed sea-to-air dimethylsulfide flux ( F DMS ) and ECMWF ERA5 reanalysis datasets. To determine the optimal method for regression, we employed five machine learning model types: support vector machines, decision tree, regression ensemble, Gaussian process regression, and artificial neural networks. A comparison of the mean absolute error (MAE), root-mean-square error (RMSE), and coefficient of determination ( R 2 ) revealed that Gaussian process regression (GPR) was the most ...
format Text
author Mansour, Karam
Decesari, Stefano
Ceburnis, Darius
Ovadnevaite, Jurgita
Russell, Lynn M.
Paglione, Marco
Poulain, Laurent
Huang, Shan
O'Dowd, Colin
Rinaldi, Matteo
spellingShingle Mansour, Karam
Decesari, Stefano
Ceburnis, Darius
Ovadnevaite, Jurgita
Russell, Lynn M.
Paglione, Marco
Poulain, Laurent
Huang, Shan
O'Dowd, Colin
Rinaldi, Matteo
IPB-MSA&amp;SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning
author_facet Mansour, Karam
Decesari, Stefano
Ceburnis, Darius
Ovadnevaite, Jurgita
Russell, Lynn M.
Paglione, Marco
Poulain, Laurent
Huang, Shan
O'Dowd, Colin
Rinaldi, Matteo
author_sort Mansour, Karam
title IPB-MSA&amp;SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning
title_short IPB-MSA&amp;SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning
title_full IPB-MSA&amp;SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning
title_fullStr IPB-MSA&amp;SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning
title_full_unstemmed IPB-MSA&amp;SO4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the North Atlantic during 1998–2022 based on machine learning
title_sort ipb-msa&amp;so4: a daily 0.25° resolution dataset of in situ-produced biogenic methanesulfonic acid and sulfate over the north atlantic during 1998–2022 based on machine learning
publishDate 2024
url https://doi.org/10.5194/essd-16-2717-2024
https://essd.copernicus.org/articles/16/2717/2024/
genre North Atlantic
genre_facet North Atlantic
op_source eISSN: 1866-3516
op_relation doi:10.5194/essd-16-2717-2024
https://essd.copernicus.org/articles/16/2717/2024/
op_doi https://doi.org/10.5194/essd-16-2717-2024
container_title Earth System Science Data
container_volume 16
container_issue 6
container_start_page 2717
op_container_end_page 2740
_version_ 1810461923044491264