LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion

Developing a big data analytics framework for generating the Long-term Gap-free High-resolution Air Pollutant concentration dataset (abbreviated as LGHAP) is of great significance for environmental management and Earth system science analysis. By synergistically integrating multimodal aerosol data a...

Full description

Bibliographic Details
Published in:Earth System Science Data
Main Authors: Bai, Kaixu, Li, Ke, Ma, Mingliang, Li, Kaitao, Li, Zhengqiang, Guo, Jianping, Chang, Ni-Bin, Tan, Zhuo, Han, Di
Format: Article in Journal/Newspaper
Language:English
Published: Copernicus Publications 2022
Subjects:
Online Access:https://doi.org/10.5194/essd-14-907-2022
https://noa.gwlb.de/receive/cop_mods_00060253
https://noa.gwlb.de/servlets/MCRFileNodeServlet/cop_derivate_00059900/essd-14-907-2022.pdf
https://essd.copernicus.org/articles/14/907/2022/essd-14-907-2022.pdf
id ftnonlinearchiv:oai:noa.gwlb.de:cop_mods_00060253
record_format openpolar
spelling ftnonlinearchiv:oai:noa.gwlb.de:cop_mods_00060253 2023-05-15T13:07:15+02:00 LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion Bai, Kaixu Li, Ke Ma, Mingliang Li, Kaitao Li, Zhengqiang Guo, Jianping Chang, Ni-Bin Tan, Zhuo Han, Di 2022-02 electronic https://doi.org/10.5194/essd-14-907-2022 https://noa.gwlb.de/receive/cop_mods_00060253 https://noa.gwlb.de/servlets/MCRFileNodeServlet/cop_derivate_00059900/essd-14-907-2022.pdf https://essd.copernicus.org/articles/14/907/2022/essd-14-907-2022.pdf eng eng Copernicus Publications Earth System Science Data -- http://www.earth-syst-sci-data.net/volumes_and_issues.html -- http://www.bibliothek.uni-regensburg.de/ezeit/?2475469 -- 1866-3516 https://doi.org/10.5194/essd-14-907-2022 https://noa.gwlb.de/receive/cop_mods_00060253 https://noa.gwlb.de/servlets/MCRFileNodeServlet/cop_derivate_00059900/essd-14-907-2022.pdf https://essd.copernicus.org/articles/14/907/2022/essd-14-907-2022.pdf https://creativecommons.org/licenses/by/4.0/ uneingeschränkt info:eu-repo/semantics/openAccess CC-BY article Verlagsveröffentlichung article Text doc-type:article 2022 ftnonlinearchiv https://doi.org/10.5194/essd-14-907-2022 2022-02-28T00:09:07Z Developing a big data analytics framework for generating the Long-term Gap-free High-resolution Air Pollutant concentration dataset (abbreviated as LGHAP) is of great significance for environmental management and Earth system science analysis. By synergistically integrating multimodal aerosol data acquired from diverse sources via a tensor-flow-based data fusion method, a gap-free aerosol optical depth (AOD) dataset with a daily 1 km resolution covering the period of 2000–2020 in China was generated. Specifically, data gaps in daily AOD imageries from the Moderate Resolution Imaging Spectroradiometer (MODIS) aboard Terra were reconstructed based on a set of AOD data tensors acquired from diverse satellites, numerical analysis, and in situ air quality measurements via integrative efforts of spatial pattern recognition for high-dimensional gridded image analysis and knowledge transfer in statistical data mining. To our knowledge, this is the first long-term gap-free high-resolution AOD dataset in China, from which spatially contiguous PM2.5 and PM10 concentrations were then estimated using an ensemble learning approach. Ground validation results indicate that the LGHAP AOD data are in good agreement with in situ AOD observations from the Aerosol Robotic Network (AERONET), with an R of 0.91 and RMSE equaling 0.21. Meanwhile, PM2.5 and PM10 estimations also agreed well with ground measurements, with R values of 0.95 and 0.94 and RMSEs of 12.03 and 19.56 µg m−3, respectively. The LGHAP provides a suite of long-term gap-free gridded maps with a high resolution to better examine aerosol changes in China over the past 2 decades, from which three major variation periods of haze pollution in China were revealed. Additionally, the proportion of the population exposed to unhealthy PM2.5 increased from 50.60 % in 2000 to 63.81 % in 2014 across China, which was then reduced drastically to 34.03 % in 2020. Overall, the generated LGHAP dataset has great potential to trigger multidisciplinary applications in Earth observations, climate change, public health, ecosystem assessment, and environmental management. The daily resolution AOD, PM2.5, and PM10 datasets are publicly available at https://doi.org/10.5281/zenodo.5652257 (Bai et al., 2021a), https://doi.org/10.5281/zenodo.5652265 (Bai et al., 2021b), and https://doi.org/10.5281/zenodo.5652263 (Bai et al., 2021c), respectively. Monthly and annual datasets can be acquired from https://doi.org/10.5281/zenodo.5655797 (Bai et al., 2021d) and https://doi.org/10.5281/zenodo.5655807 (Bai et al., 2021e), respectively. Python, MATLAB, R, and IDL codes are also provided to help users read and visualize these data. Article in Journal/Newspaper Aerosol Robotic Network Niedersächsisches Online-Archiv NOA Earth System Science Data 14 2 907 927
institution Open Polar
collection Niedersächsisches Online-Archiv NOA
op_collection_id ftnonlinearchiv
language English
topic article
Verlagsveröffentlichung
spellingShingle article
Verlagsveröffentlichung
Bai, Kaixu
Li, Ke
Ma, Mingliang
Li, Kaitao
Li, Zhengqiang
Guo, Jianping
Chang, Ni-Bin
Tan, Zhuo
Han, Di
LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion
topic_facet article
Verlagsveröffentlichung
description Developing a big data analytics framework for generating the Long-term Gap-free High-resolution Air Pollutant concentration dataset (abbreviated as LGHAP) is of great significance for environmental management and Earth system science analysis. By synergistically integrating multimodal aerosol data acquired from diverse sources via a tensor-flow-based data fusion method, a gap-free aerosol optical depth (AOD) dataset with a daily 1 km resolution covering the period of 2000–2020 in China was generated. Specifically, data gaps in daily AOD imageries from the Moderate Resolution Imaging Spectroradiometer (MODIS) aboard Terra were reconstructed based on a set of AOD data tensors acquired from diverse satellites, numerical analysis, and in situ air quality measurements via integrative efforts of spatial pattern recognition for high-dimensional gridded image analysis and knowledge transfer in statistical data mining. To our knowledge, this is the first long-term gap-free high-resolution AOD dataset in China, from which spatially contiguous PM2.5 and PM10 concentrations were then estimated using an ensemble learning approach. Ground validation results indicate that the LGHAP AOD data are in good agreement with in situ AOD observations from the Aerosol Robotic Network (AERONET), with an R of 0.91 and RMSE equaling 0.21. Meanwhile, PM2.5 and PM10 estimations also agreed well with ground measurements, with R values of 0.95 and 0.94 and RMSEs of 12.03 and 19.56 µg m−3, respectively. The LGHAP provides a suite of long-term gap-free gridded maps with a high resolution to better examine aerosol changes in China over the past 2 decades, from which three major variation periods of haze pollution in China were revealed. Additionally, the proportion of the population exposed to unhealthy PM2.5 increased from 50.60 % in 2000 to 63.81 % in 2014 across China, which was then reduced drastically to 34.03 % in 2020. Overall, the generated LGHAP dataset has great potential to trigger multidisciplinary applications in Earth observations, climate change, public health, ecosystem assessment, and environmental management. The daily resolution AOD, PM2.5, and PM10 datasets are publicly available at https://doi.org/10.5281/zenodo.5652257 (Bai et al., 2021a), https://doi.org/10.5281/zenodo.5652265 (Bai et al., 2021b), and https://doi.org/10.5281/zenodo.5652263 (Bai et al., 2021c), respectively. Monthly and annual datasets can be acquired from https://doi.org/10.5281/zenodo.5655797 (Bai et al., 2021d) and https://doi.org/10.5281/zenodo.5655807 (Bai et al., 2021e), respectively. Python, MATLAB, R, and IDL codes are also provided to help users read and visualize these data.
format Article in Journal/Newspaper
author Bai, Kaixu
Li, Ke
Ma, Mingliang
Li, Kaitao
Li, Zhengqiang
Guo, Jianping
Chang, Ni-Bin
Tan, Zhuo
Han, Di
author_facet Bai, Kaixu
Li, Ke
Ma, Mingliang
Li, Kaitao
Li, Zhengqiang
Guo, Jianping
Chang, Ni-Bin
Tan, Zhuo
Han, Di
author_sort Bai, Kaixu
title LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion
title_short LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion
title_full LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion
title_fullStr LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion
title_full_unstemmed LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion
title_sort lghap: the long-term gap-free high-resolution air pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion
publisher Copernicus Publications
publishDate 2022
url https://doi.org/10.5194/essd-14-907-2022
https://noa.gwlb.de/receive/cop_mods_00060253
https://noa.gwlb.de/servlets/MCRFileNodeServlet/cop_derivate_00059900/essd-14-907-2022.pdf
https://essd.copernicus.org/articles/14/907/2022/essd-14-907-2022.pdf
genre Aerosol Robotic Network
genre_facet Aerosol Robotic Network
op_relation Earth System Science Data -- http://www.earth-syst-sci-data.net/volumes_and_issues.html -- http://www.bibliothek.uni-regensburg.de/ezeit/?2475469 -- 1866-3516
https://doi.org/10.5194/essd-14-907-2022
https://noa.gwlb.de/receive/cop_mods_00060253
https://noa.gwlb.de/servlets/MCRFileNodeServlet/cop_derivate_00059900/essd-14-907-2022.pdf
https://essd.copernicus.org/articles/14/907/2022/essd-14-907-2022.pdf
op_rights https://creativecommons.org/licenses/by/4.0/
uneingeschränkt
info:eu-repo/semantics/openAccess
op_rightsnorm CC-BY
op_doi https://doi.org/10.5194/essd-14-907-2022
container_title Earth System Science Data
container_volume 14
container_issue 2
container_start_page 907
op_container_end_page 927
_version_ 1766042893482459136