How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth
Many applications of geophysical data – whether from surface observations, satellite retrievals, or model simulations – rely on aggregates produced at coarser spatial (e.g. degrees) and/or temporal (e.g. daily and monthly) resolution than the highest available from the technique. Almost all of these...
Published in: | Atmospheric Chemistry and Physics |
---|---|
Main Authors: | , |
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
Copernicus Publications
2019
|
Subjects: | |
Online Access: | https://doi.org/10.5194/acp-19-15023-2019 https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca |
id |
ftdoajarticles:oai:doaj.org/article:731825c70f9f46329ae88525fd8fc6ca |
---|---|
record_format |
openpolar |
spelling |
ftdoajarticles:oai:doaj.org/article:731825c70f9f46329ae88525fd8fc6ca 2023-05-15T13:06:45+02:00 How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth A. M. Sayer K. D. Knobelspiesse 2019-12-01T00:00:00Z https://doi.org/10.5194/acp-19-15023-2019 https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca EN eng Copernicus Publications https://www.atmos-chem-phys.net/19/15023/2019/acp-19-15023-2019.pdf https://doaj.org/toc/1680-7316 https://doaj.org/toc/1680-7324 doi:10.5194/acp-19-15023-2019 1680-7316 1680-7324 https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca Atmospheric Chemistry and Physics, Vol 19, Pp 15023-15048 (2019) Physics QC1-999 Chemistry QD1-999 article 2019 ftdoajarticles https://doi.org/10.5194/acp-19-15023-2019 2022-12-31T03:49:43Z Many applications of geophysical data – whether from surface observations, satellite retrievals, or model simulations – rely on aggregates produced at coarser spatial (e.g. degrees) and/or temporal (e.g. daily and monthly) resolution than the highest available from the technique. Almost all of these aggregates report the arithmetic mean and standard deviation as summary statistics, which are what data users employ in their analyses. These statistics are most meaningful for normally distributed data; however, for some quantities, such as aerosol optical depth (AOD), it is well-known that distributions are on large scales closer to log-normal, for which a geometric mean and standard deviation would be more appropriate. This study presents a method of assessing whether a given sample of data is more consistent with an underlying normal or log-normal distribution, using the Shapiro–Wilk test, and tests AOD frequency distributions on spatial scales of 1 ∘ and daily, monthly, and seasonal temporal scales. A broadly consistent picture is observed using Aerosol Robotic Network (AERONET), Multiangle Imaging SpectroRadiometer (MISR), Moderate Resolution Imagining Spectroradiometer (MODIS), and Goddard Earth Observing System Version 5 Nature Run (G5NR) data. These data sets are complementary: AERONET has the highest AOD accuracy but is sparse, and MISR and MODIS represent different satellite retrieval techniques and sampling. As a model simulation, G5NR is spatiotemporally complete. As timescales increase from days to months to seasons, data become increasingly more consistent with log-normal than normal distributions, and the differences between arithmetic- and geometric-mean AOD become larger, with geometric mean becoming systematically smaller. Assuming normality systematically overstates both the typical level of AOD and its variability. There is considerable regional heterogeneity in the results: in low-AOD regions such as the open ocean and mountains, often the AOD difference is small enough ( <0.01 ) to be ... Article in Journal/Newspaper Aerosol Robotic Network Directory of Open Access Journals: DOAJ Articles Atmospheric Chemistry and Physics 19 23 15023 15048 |
institution |
Open Polar |
collection |
Directory of Open Access Journals: DOAJ Articles |
op_collection_id |
ftdoajarticles |
language |
English |
topic |
Physics QC1-999 Chemistry QD1-999 |
spellingShingle |
Physics QC1-999 Chemistry QD1-999 A. M. Sayer K. D. Knobelspiesse How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth |
topic_facet |
Physics QC1-999 Chemistry QD1-999 |
description |
Many applications of geophysical data – whether from surface observations, satellite retrievals, or model simulations – rely on aggregates produced at coarser spatial (e.g. degrees) and/or temporal (e.g. daily and monthly) resolution than the highest available from the technique. Almost all of these aggregates report the arithmetic mean and standard deviation as summary statistics, which are what data users employ in their analyses. These statistics are most meaningful for normally distributed data; however, for some quantities, such as aerosol optical depth (AOD), it is well-known that distributions are on large scales closer to log-normal, for which a geometric mean and standard deviation would be more appropriate. This study presents a method of assessing whether a given sample of data is more consistent with an underlying normal or log-normal distribution, using the Shapiro–Wilk test, and tests AOD frequency distributions on spatial scales of 1 ∘ and daily, monthly, and seasonal temporal scales. A broadly consistent picture is observed using Aerosol Robotic Network (AERONET), Multiangle Imaging SpectroRadiometer (MISR), Moderate Resolution Imagining Spectroradiometer (MODIS), and Goddard Earth Observing System Version 5 Nature Run (G5NR) data. These data sets are complementary: AERONET has the highest AOD accuracy but is sparse, and MISR and MODIS represent different satellite retrieval techniques and sampling. As a model simulation, G5NR is spatiotemporally complete. As timescales increase from days to months to seasons, data become increasingly more consistent with log-normal than normal distributions, and the differences between arithmetic- and geometric-mean AOD become larger, with geometric mean becoming systematically smaller. Assuming normality systematically overstates both the typical level of AOD and its variability. There is considerable regional heterogeneity in the results: in low-AOD regions such as the open ocean and mountains, often the AOD difference is small enough ( <0.01 ) to be ... |
format |
Article in Journal/Newspaper |
author |
A. M. Sayer K. D. Knobelspiesse |
author_facet |
A. M. Sayer K. D. Knobelspiesse |
author_sort |
A. M. Sayer |
title |
How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth |
title_short |
How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth |
title_full |
How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth |
title_fullStr |
How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth |
title_full_unstemmed |
How should we aggregate data? Methods accounting for the numerical distributions, with an assessment of aerosol optical depth |
title_sort |
how should we aggregate data? methods accounting for the numerical distributions, with an assessment of aerosol optical depth |
publisher |
Copernicus Publications |
publishDate |
2019 |
url |
https://doi.org/10.5194/acp-19-15023-2019 https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca |
genre |
Aerosol Robotic Network |
genre_facet |
Aerosol Robotic Network |
op_source |
Atmospheric Chemistry and Physics, Vol 19, Pp 15023-15048 (2019) |
op_relation |
https://www.atmos-chem-phys.net/19/15023/2019/acp-19-15023-2019.pdf https://doaj.org/toc/1680-7316 https://doaj.org/toc/1680-7324 doi:10.5194/acp-19-15023-2019 1680-7316 1680-7324 https://doaj.org/article/731825c70f9f46329ae88525fd8fc6ca |
op_doi |
https://doi.org/10.5194/acp-19-15023-2019 |
container_title |
Atmospheric Chemistry and Physics |
container_volume |
19 |
container_issue |
23 |
container_start_page |
15023 |
op_container_end_page |
15048 |
_version_ |
1766019423172296704 |