Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context

Abstract Clustering algorithms are commonly used for inspecting the behavior of clouds in both model and satellite data sets. Often overlooked in cluster analysis is the variability that occurs within any clusters generated. This is particularly important in the geophysics where clusters are often g...

Full description

Bibliographic Details
Published in:Earth and Space Science
Main Authors: A. J. Schuddeboom, A. J. McDonald
Format: Article in Journal/Newspaper
Language:English
Published: American Geophysical Union (AGU) 2023
Subjects:
Online Access:https://doi.org/10.1029/2022EA002373
https://doaj.org/article/6fafa3b5a66c484fa1de9e850c90e361
id ftdoajarticles:oai:doaj.org/article:6fafa3b5a66c484fa1de9e850c90e361
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:6fafa3b5a66c484fa1de9e850c90e361 2023-05-15T15:11:39+02:00 Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context A. J. Schuddeboom A. J. McDonald 2023-01-01T00:00:00Z https://doi.org/10.1029/2022EA002373 https://doaj.org/article/6fafa3b5a66c484fa1de9e850c90e361 EN eng American Geophysical Union (AGU) https://doi.org/10.1029/2022EA002373 https://doaj.org/toc/2333-5084 2333-5084 doi:10.1029/2022EA002373 https://doaj.org/article/6fafa3b5a66c484fa1de9e850c90e361 Earth and Space Science, Vol 10, Iss 1, Pp n/a-n/a (2023) clustering cloud modeling Moderate Resolution Imaging Spectroradiometer machine learning Clouds and the Earth's Radiant Energy System entropy Astronomy QB1-991 Geology QE1-996.5 article 2023 ftdoajarticles https://doi.org/10.1029/2022EA002373 2023-01-29T01:26:13Z Abstract Clustering algorithms are commonly used for inspecting the behavior of clouds in both model and satellite data sets. Often overlooked in cluster analysis is the variability that occurs within any clusters generated. This is particularly important in the geophysics where clusters are often generated with a focus on interpretability over mathematical optimization. Two metrics, the Davies‐Bouldin index and the subsom entropy, are used to identify clusters with large internal variability. These metrics are applied to an example set of clusters from prior research that were generated using cloud top pressure‐cloud optical thickness joint histograms from the Moderate Resolution Imaging Spectroradiometer data set. Applying these metrics to the clusters identifies one cluster in particular as a major outlier. Examining the calculations behind these metrics in more detail provides further information about the internal variability of the clusters. The clusters are also examined over several geographic regions showing mostly consistent behavior. There are, however, some large anomalies such as the behavior of the clear sky cluster or the behavior of several different clusters over the Arctic Ocean. To aide our interpretation of these results, two clusters are chosen for a detailed analysis of their subclusters. The geographic distributions and radiative properties of these subclusters are examined and clearly identify that subclusters have physically distinct behavior. This result illustrates that these metrics are capable of determining when a cluster contains physically distinct subclusters. This demonstrates the potential utility of these metrics if they were applied to other geophysical data sets. Article in Journal/Newspaper Arctic Arctic Ocean Directory of Open Access Journals: DOAJ Articles Arctic Arctic Ocean Earth and Space Science 10 1
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic clustering
cloud modeling
Moderate Resolution Imaging Spectroradiometer
machine learning
Clouds and the Earth's Radiant Energy System
entropy
Astronomy
QB1-991
Geology
QE1-996.5
spellingShingle clustering
cloud modeling
Moderate Resolution Imaging Spectroradiometer
machine learning
Clouds and the Earth's Radiant Energy System
entropy
Astronomy
QB1-991
Geology
QE1-996.5
A. J. Schuddeboom
A. J. McDonald
Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context
topic_facet clustering
cloud modeling
Moderate Resolution Imaging Spectroradiometer
machine learning
Clouds and the Earth's Radiant Energy System
entropy
Astronomy
QB1-991
Geology
QE1-996.5
description Abstract Clustering algorithms are commonly used for inspecting the behavior of clouds in both model and satellite data sets. Often overlooked in cluster analysis is the variability that occurs within any clusters generated. This is particularly important in the geophysics where clusters are often generated with a focus on interpretability over mathematical optimization. Two metrics, the Davies‐Bouldin index and the subsom entropy, are used to identify clusters with large internal variability. These metrics are applied to an example set of clusters from prior research that were generated using cloud top pressure‐cloud optical thickness joint histograms from the Moderate Resolution Imaging Spectroradiometer data set. Applying these metrics to the clusters identifies one cluster in particular as a major outlier. Examining the calculations behind these metrics in more detail provides further information about the internal variability of the clusters. The clusters are also examined over several geographic regions showing mostly consistent behavior. There are, however, some large anomalies such as the behavior of the clear sky cluster or the behavior of several different clusters over the Arctic Ocean. To aide our interpretation of these results, two clusters are chosen for a detailed analysis of their subclusters. The geographic distributions and radiative properties of these subclusters are examined and clearly identify that subclusters have physically distinct behavior. This result illustrates that these metrics are capable of determining when a cluster contains physically distinct subclusters. This demonstrates the potential utility of these metrics if they were applied to other geophysical data sets.
format Article in Journal/Newspaper
author A. J. Schuddeboom
A. J. McDonald
author_facet A. J. Schuddeboom
A. J. McDonald
author_sort A. J. Schuddeboom
title Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context
title_short Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context
title_full Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context
title_fullStr Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context
title_full_unstemmed Understanding Internal Cluster Variability Through Subcluster Metric Analysis in a Geophysical Context
title_sort understanding internal cluster variability through subcluster metric analysis in a geophysical context
publisher American Geophysical Union (AGU)
publishDate 2023
url https://doi.org/10.1029/2022EA002373
https://doaj.org/article/6fafa3b5a66c484fa1de9e850c90e361
geographic Arctic
Arctic Ocean
geographic_facet Arctic
Arctic Ocean
genre Arctic
Arctic Ocean
genre_facet Arctic
Arctic Ocean
op_source Earth and Space Science, Vol 10, Iss 1, Pp n/a-n/a (2023)
op_relation https://doi.org/10.1029/2022EA002373
https://doaj.org/toc/2333-5084
2333-5084
doi:10.1029/2022EA002373
https://doaj.org/article/6fafa3b5a66c484fa1de9e850c90e361
op_doi https://doi.org/10.1029/2022EA002373
container_title Earth and Space Science
container_volume 10
container_issue 1
_version_ 1766342476593889280