An unsupervised Hidden Markov Model-based system for the detection and classification of blue whale vocalizations off Chile

In this paper, we present an automatic method, without human supervision, for the detection and classification of blue whale vocalizations from passive acoustic monitoring (PAM) data using Hidden Markov Model technology implemented with a state-of-the-art machine learning platform, the Kaldi speech...

Full description

Bibliographic Details
Published in:Annals of Operations Research
Main Authors: Buchan, Susannah J., Mahú Sinclair, Rodrigo, Wuth, Jorge, Balcazar Cabrera, Naysa, Gutiérrez, Laura, Neira, Sergio, Becerra Yoma, Néstor
Format: Article in Journal/Newspaper
Language:English
Published: Taylor & Francis 2020
Subjects:
HMM
Online Access:https://doi.org/10.1080/09524622.2018.1563758
https://repositorio.uchile.cl/handle/2250/174698
Description
Summary:In this paper, we present an automatic method, without human supervision, for the detection and classification of blue whale vocalizations from passive acoustic monitoring (PAM) data using Hidden Markov Model technology implemented with a state-of-the-art machine learning platform, the Kaldi speech processing toolkit. 157.5 hours of PAM data were annotated for model training and testing, selected from a dataset collected from the Corcovado Gulf, Chilean Patagonia in 2016. The system obtained produced 85.3% accuracy for detection and classification of a range of different blue whale vocalizations. This system was then validated by comparing its unsupervised detection and classification results with the published results of southeast Pacific blue whale song phrase (‘SEP2’) via spectrogram cross-correlation, involving a dataset collected with a different hydrophone instrument. The proposed system led to a reduction in the root mean square error relative to published results as high as 80% when compared with comparable methods employed elsewhere. This is a significant step in advancing the monitoring of endangered whale populations in this region, which remains poorly covered in terms of PAM and general ocean observation. With further training, testing and validation, this system can be applied to other target signals and regions of the world ocean.