Using self-organizing maps to classify humpback whale song units and quantify their similarity

Classification of vocal signals can be undertaken using a wide variety of qualitative and quantitative techniques. Using east Australian humpback whale song from 2002-2014, a subset of vocal signals were acoustically measured and then classified using a self-organizing map (SOM). The SOM created 1)...

Full description

Bibliographic Details
Published in:The Journal of the Acoustical Society of America
Main Authors: Allen, Jenny A., Murray, Anita, Noad, Michael J., Dunlop, Rebecca A., Garland, Ellen Clare
Format: Article in Journal/Newspaper
Language:English
Published: 2017
Subjects:
Online Access:https://research-portal.st-andrews.ac.uk/en/researchoutput/using-selforganizing-maps-to-classify-humpback-whale-song-units-and-quantify-their-similarity(23c48721-dfaf-4525-88e9-4ac2d22e0631).html
https://doi.org/10.1121/1.4982040
https://research-repository.st-andrews.ac.uk/bitstream/10023/13109/1/Allen_2017_Self_organizing_JASA_AAM.pdf
Description
Summary:Classification of vocal signals can be undertaken using a wide variety of qualitative and quantitative techniques. Using east Australian humpback whale song from 2002-2014, a subset of vocal signals were acoustically measured and then classified using a self-organizing map (SOM). The SOM created 1) an acoustic dictionary of units representing the song’s repertoire, and 2) Cartesian distance measurements among all unit types (SOM nodes). Utilizing the SOM dictionary as a guide, additional song recordings from east Australia were rapidly (manually) transcribed. To assess the similarity in song sequences, the Cartesian distance output from the SOM was applied in Levenshtein distance similarity analyses as a weighting factor to better incorporate unit similarity in the calculation (previously a qualitative process). SOMs provide a more robust and repeatable means of categorizing acoustic signals along with a clear quantitative measurement of sound type similarity based on acoustic features. This method can be utilized for a wide variety of acoustic databases especially those containing very large datasets, and be applied across the vocalization research community to help address concerns surrounding inconsistency in manual classification.