Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm

Droplet-level interactions in clouds are often parameterized by a modified gamma fitted to a “global” droplet size distribution. Do “local” droplet size distributions of relevance to microphysical processes look like these average distributions? This paper describes an algorithm to search and classi...

Full description

Bibliographic Details
Published in:Artificial Intelligence for the Earth Systems
Main Authors: Allwayin, Nithin, Larsen, Michael L., Shaw, Alexander G., Shaw, Raymond A.
Language:unknown
Published: 2022
Subjects:
Online Access:http://www.osti.gov/servlets/purl/1884127
https://www.osti.gov/biblio/1884127
https://doi.org/10.1175/aies-d-22-0003.1
id ftosti:oai:osti.gov:1884127
record_format openpolar
spelling ftosti:oai:osti.gov:1884127 2023-07-30T04:05:33+02:00 Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm Allwayin, Nithin Larsen, Michael L. Shaw, Alexander G. Shaw, Raymond A. 2022-09-22 application/pdf http://www.osti.gov/servlets/purl/1884127 https://www.osti.gov/biblio/1884127 https://doi.org/10.1175/aies-d-22-0003.1 unknown http://www.osti.gov/servlets/purl/1884127 https://www.osti.gov/biblio/1884127 https://doi.org/10.1175/aies-d-22-0003.1 doi:10.1175/aies-d-22-0003.1 54 ENVIRONMENTAL SCIENCES 2022 ftosti https://doi.org/10.1175/aies-d-22-0003.1 2023-07-11T10:14:26Z Droplet-level interactions in clouds are often parameterized by a modified gamma fitted to a “global” droplet size distribution. Do “local” droplet size distributions of relevance to microphysical processes look like these average distributions? This paper describes an algorithm to search and classify characteristic size distributions within a cloud. The approach combines hypothesis testing, specifically the Kolmogorov-Smirnov (KS) test, and a widely-used class of machine-learning algorithms for identifying clusters of samples with similar properties: Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is used as the specific example for illustration. The two-sample KS test does not presume any specific distribution, is parameter free, and avoids biases from binning. Importantly, the number of clusters is not an input parameter of the DBSCAN-type algorithms, but is independently determined in an unsupervised fashion. As implemented, it works on an abstract space from the KS test results, and hence spatial correlation is not required for a cluster. The method is explored using data obtained from Holographic Detector for Clouds (HOLODEC) deployed during the Aerosol and Cloud Experiments in the Eastern North Atlantic (ACE-ENA) field campaign. The algorithm identifies evidence of the existence of clusters of nearly-identical local size distributions. It is found that cloud segments have as few as one and as many as seven characteristic size distributions. To validate the algorithm’s robustness, it is tested on a synthetic dataset and successfully identifies the predefined distributions at plausible noise levels. The algorithm is general and is expected to be useful in other applications, such as remote sensing of cloud and rain properties. Other/Unknown Material North Atlantic SciTec Connect (Office of Scientific and Technical Information - OSTI, U.S. Department of Energy) Artificial Intelligence for the Earth Systems 1 3
institution Open Polar
collection SciTec Connect (Office of Scientific and Technical Information - OSTI, U.S. Department of Energy)
op_collection_id ftosti
language unknown
topic 54 ENVIRONMENTAL SCIENCES
spellingShingle 54 ENVIRONMENTAL SCIENCES
Allwayin, Nithin
Larsen, Michael L.
Shaw, Alexander G.
Shaw, Raymond A.
Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm
topic_facet 54 ENVIRONMENTAL SCIENCES
description Droplet-level interactions in clouds are often parameterized by a modified gamma fitted to a “global” droplet size distribution. Do “local” droplet size distributions of relevance to microphysical processes look like these average distributions? This paper describes an algorithm to search and classify characteristic size distributions within a cloud. The approach combines hypothesis testing, specifically the Kolmogorov-Smirnov (KS) test, and a widely-used class of machine-learning algorithms for identifying clusters of samples with similar properties: Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is used as the specific example for illustration. The two-sample KS test does not presume any specific distribution, is parameter free, and avoids biases from binning. Importantly, the number of clusters is not an input parameter of the DBSCAN-type algorithms, but is independently determined in an unsupervised fashion. As implemented, it works on an abstract space from the KS test results, and hence spatial correlation is not required for a cluster. The method is explored using data obtained from Holographic Detector for Clouds (HOLODEC) deployed during the Aerosol and Cloud Experiments in the Eastern North Atlantic (ACE-ENA) field campaign. The algorithm identifies evidence of the existence of clusters of nearly-identical local size distributions. It is found that cloud segments have as few as one and as many as seven characteristic size distributions. To validate the algorithm’s robustness, it is tested on a synthetic dataset and successfully identifies the predefined distributions at plausible noise levels. The algorithm is general and is expected to be useful in other applications, such as remote sensing of cloud and rain properties.
author Allwayin, Nithin
Larsen, Michael L.
Shaw, Alexander G.
Shaw, Raymond A.
author_facet Allwayin, Nithin
Larsen, Michael L.
Shaw, Alexander G.
Shaw, Raymond A.
author_sort Allwayin, Nithin
title Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm
title_short Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm
title_full Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm
title_fullStr Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm
title_full_unstemmed Automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm
title_sort automated identification of characteristic droplet size distributions in stratocumulus clouds utilizing a data clustering algorithm
publishDate 2022
url http://www.osti.gov/servlets/purl/1884127
https://www.osti.gov/biblio/1884127
https://doi.org/10.1175/aies-d-22-0003.1
genre North Atlantic
genre_facet North Atlantic
op_relation http://www.osti.gov/servlets/purl/1884127
https://www.osti.gov/biblio/1884127
https://doi.org/10.1175/aies-d-22-0003.1
doi:10.1175/aies-d-22-0003.1
op_doi https://doi.org/10.1175/aies-d-22-0003.1
container_title Artificial Intelligence for the Earth Systems
container_volume 1
container_issue 3
_version_ 1772817530387169280