How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth

Many applications of geophysical data – whether from surface observations, satellite retrievals, or model simulations – rely on aggregates produced at coarser spatial (e.g. degrees) and/or temporal (e.g. daily and monthly) resolution than the highest available from the technique. Almost all of these...

Full description

Bibliographic Details
Published in:Atmospheric Chemistry and Physics
Main Authors: A. M. Sayer, K. D. Knobelspiesse
Format: Article in Journal/Newspaper
Language:English
Published: Copernicus Publications 2019
Subjects:
Online Access:https://doi.org/10.5194/acp-19-15023-2019
https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca
id ftdoajarticles:oai:doaj.org/article:731825c70f9f46329ae88525fd8fc6ca
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:731825c70f9f46329ae88525fd8fc6ca 2023-05-15T13:06:45+02:00 How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth A. M. Sayer K. D. Knobelspiesse 2019-12-01T00:00:00Z https://doi.org/10.5194/acp-19-15023-2019 https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca EN eng Copernicus Publications https://www.atmos-chem-phys.net/19/15023/2019/acp-19-15023-2019.pdf https://doaj.org/toc/1680-7316 https://doaj.org/toc/1680-7324 doi:10.5194/acp-19-15023-2019 1680-7316 1680-7324 https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca Atmospheric Chemistry and Physics, Vol 19, Pp 15023-15048 (2019) Physics QC1-999 Chemistry QD1-999 article 2019 ftdoajarticles https://doi.org/10.5194/acp-19-15023-2019 2022-12-31T03:49:43Z Many applications of geophysical data – whether from surface observations, satellite retrievals, or model simulations – rely on aggregates produced at coarser spatial (e.g. degrees) and/or temporal (e.g. daily and monthly) resolution than the highest available from the technique. Almost all of these aggregates report the arithmetic mean and standard deviation as summary statistics, which are what data users employ in their analyses. These statistics are most meaningful for normally distributed data; however, for some quantities, such as aerosol optical depth (AOD), it is well-known that distributions are on large scales closer to log-normal, for which a geometric mean and standard deviation would be more appropriate. This study presents a method of assessing whether a given sample of data is more consistent with an underlying normal or log-normal distribution, using the Shapiro–Wilk test, and tests AOD frequency distributions on spatial scales of 1 ∘ and daily, monthly, and seasonal temporal scales. A broadly consistent picture is observed using Aerosol Robotic Network (AERONET), Multiangle Imaging SpectroRadiometer (MISR), Moderate Resolution Imagining Spectroradiometer (MODIS), and Goddard Earth Observing System Version 5 Nature Run (G5NR) data. These data sets are complementary: AERONET has the highest AOD accuracy but is sparse, and MISR and MODIS represent different satellite retrieval techniques and sampling. As a model simulation, G5NR is spatiotemporally complete. As timescales increase from days to months to seasons, data become increasingly more consistent with log-normal than normal distributions, and the differences between arithmetic- and geometric-mean AOD become larger, with geometric mean becoming systematically smaller. Assuming normality systematically overstates both the typical level of AOD and its variability. There is considerable regional heterogeneity in the results: in low-AOD regions such as the open ocean and mountains, often the AOD difference is small enough ( <0.01 ) to be ... Article in Journal/Newspaper Aerosol Robotic Network Directory of Open Access Journals: DOAJ Articles Atmospheric Chemistry and Physics 19 23 15023 15048
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic Physics
QC1-999
Chemistry
QD1-999
spellingShingle Physics
QC1-999
Chemistry
QD1-999
A. M. Sayer
K. D. Knobelspiesse
How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth
topic_facet Physics
QC1-999
Chemistry
QD1-999
description Many applications of geophysical data – whether from surface observations, satellite retrievals, or model simulations – rely on aggregates produced at coarser spatial (e.g. degrees) and/or temporal (e.g. daily and monthly) resolution than the highest available from the technique. Almost all of these aggregates report the arithmetic mean and standard deviation as summary statistics, which are what data users employ in their analyses. These statistics are most meaningful for normally distributed data; however, for some quantities, such as aerosol optical depth (AOD), it is well-known that distributions are on large scales closer to log-normal, for which a geometric mean and standard deviation would be more appropriate. This study presents a method of assessing whether a given sample of data is more consistent with an underlying normal or log-normal distribution, using the Shapiro–Wilk test, and tests AOD frequency distributions on spatial scales of 1 ∘ and daily, monthly, and seasonal temporal scales. A broadly consistent picture is observed using Aerosol Robotic Network (AERONET), Multiangle Imaging SpectroRadiometer (MISR), Moderate Resolution Imagining Spectroradiometer (MODIS), and Goddard Earth Observing System Version 5 Nature Run (G5NR) data. These data sets are complementary: AERONET has the highest AOD accuracy but is sparse, and MISR and MODIS represent different satellite retrieval techniques and sampling. As a model simulation, G5NR is spatiotemporally complete. As timescales increase from days to months to seasons, data become increasingly more consistent with log-normal than normal distributions, and the differences between arithmetic- and geometric-mean AOD become larger, with geometric mean becoming systematically smaller. Assuming normality systematically overstates both the typical level of AOD and its variability. There is considerable regional heterogeneity in the results: in low-AOD regions such as the open ocean and mountains, often the AOD difference is small enough ( <0.01 ) to be ...
format Article in Journal/Newspaper
author A. M. Sayer
K. D. Knobelspiesse
author_facet A. M. Sayer
K. D. Knobelspiesse
author_sort A. M. Sayer
title How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth
title_short How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth
title_full How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth
title_fullStr How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth
title_full_unstemmed How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth
title_sort how should we aggregate data? methods accounting for the numerical distributions, with an assessment of aerosol optical depth
publisher Copernicus Publications
publishDate 2019
url https://doi.org/10.5194/acp-19-15023-2019
https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca
genre Aerosol Robotic Network
genre_facet Aerosol Robotic Network
op_source Atmospheric Chemistry and Physics, Vol 19, Pp 15023-15048 (2019)
op_relation https://www.atmos-chem-phys.net/19/15023/2019/acp-19-15023-2019.pdf
https://doaj.org/toc/1680-7316
https://doaj.org/toc/1680-7324
doi:10.5194/acp-19-15023-2019
1680-7316
1680-7324
https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca
op_doi https://doi.org/10.5194/acp-19-15023-2019
container_title Atmospheric Chemistry and Physics
container_volume 19
container_issue 23
container_start_page 15023
op_container_end_page 15048
_version_ 1766019423172296704