Listening to Ice and Ocean: Machine Learning for Seismic and Acoustic Environmental Characterization

Seismology and ocean acoustics are important remote sensing tools, enabling observation of environments that are difficult to access and directly measure. Seismic and ocean acoustic remote sensing are data-intensive tasks, and the proliferation of remote sensing systems has led to the generation of...

Full description

Bibliographic Details
Main Author: Jenkins II, William Frost
Other Authors: Gerstoft, Peter
Format: Thesis
Language:English
Published: eScholarship, University of California 2023
Subjects:
Online Access:https://escholarship.org/uc/item/26p4w53f
Description
Summary:Seismology and ocean acoustics are important remote sensing tools, enabling observation of environments that are difficult to access and directly measure. Seismic and ocean acoustic remote sensing are data-intensive tasks, and the proliferation of remote sensing systems has led to the generation of vast amounts of data. Meanwhile, advances in machine learning (ML) techniques and computational capacity have yielded state-of-the-art methodologies for processing and analyzing large seismic and acoustic data sets. This dissertation presents two ML-based paradigms for the characterization of environments using seismic and acoustic data.First, unsupervised ML is demonstrated for automatically identifying dominant types of seismicity present data recorded from a 34-station broadband seismic array deployed on the Ross Ice Shelf (RIS), Antarctica from 2014 to 2017. The data set contains signals generated by glaciological processes that have been used to monitor the integrity and dynamics of ice shelves. Deep clustering automatically groups these signals into classes without the need for manual labeling, enabling comparison of potential source mechanisms with not only the spatial and temporal distributions of the signals but also their characteristics. The method learns the salient features of spectrograms and encodes them into a lower-dimensional latent representation using an autoencoder, a type of deep neural network. Two clustering methods are applied to the latent data and compared: a Gaussian mixture model (GMM) and deep-embedded clustering (DEC). Dominant types of seismic signals are identified and compared with environmental data such as temperature, wind speed, tides, and sea ice concentration. The highest seismicity occurred at the RIS front during the 2016 El NiƱo summer, and diurnally near grounding zones throughout the deployment.The second paradigm presents Bayesian optimization (BO) as a method for efficiently estimating geoacoustic parameters within a fixed computational budget. An objective function is ...