Retrieval of Volcanic Ash Cloud Base Height Using Machine Learning Algorithms
There are distinct differences between radiation characteristics of volcanic ash and meteorological clouds, and conventional retrieval methods for cloud base height (CBH) of the latter are difficult to apply to volcanic ash without substantial parameterisation and model correction. Furthermore, exis...
Published in: | Atmosphere |
---|---|
Main Authors: | , , , , |
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
MDPI AG
2023
|
Subjects: | |
Online Access: | https://doi.org/10.3390/atmos14020228 https://doaj.org/article/2c4c77867fc54b15b1eb58f6f103f207 |
Summary: | There are distinct differences between radiation characteristics of volcanic ash and meteorological clouds, and conventional retrieval methods for cloud base height (CBH) of the latter are difficult to apply to volcanic ash without substantial parameterisation and model correction. Furthermore, existing CBH inversion methods have limitations, including the involvement of many empirical formulae and a dependence on the accuracy of upstream cloud products. A machine learning (ML) method was developed for the retrieval of volcanic ash cloud base height (VBH) to reduce uncertainties in physical CBH retrieval methods. This new methodology takes advantage of polar-orbit active remote-sensing data from the Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP), from vertical profile information and from geostationary passive remote-sensing measurements from the Spinning Enhanced Visible and Infrared Imager (SEVIRI) and the Advanced Geostationary Radiation Imager (AGRI) aboard the Meteosat Second Generation (MSG) and FengYun-4B (FY-4B) satellites, respectively. The methodology involves a statistics-based algorithm with hybrid use of principal component analysis (PCA) and one of four ML algorithms including the k-nearest neighbour (KNN), extreme gradient boosting (XGBoost), random forest (RF), and gradient boosting decision tree (GBDT) methods. Eruptions of the Eyjafjallajökull volcano (Iceland) during April-May 2010, the Puyehue-Cordón Caulle volcanic complex (Chilean Andes) in June 2011, and the Hunga Tonga-Hunga Ha’apai volcano (Tonga) in January 2022 were selected as typical cases for the construction of the training and validation sample sets. We demonstrate that a combination of PCA and GBDT performs more accurately than other combinations, with a mean absolute error (MAE) of 1.152 km, a root mean square error (RMSE) of 1.529 km, and a Pearson’s correlation coefficient (r) of 0.724. Use of PCA as an additional process before training reduces feature relevance between input predictors and improves algorithm ... |
---|