Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours
We present a learning-based method for extracting whistles of toothed whales (Odontoceti) in hydrophone recordings. Our method represents audio signals as time-frequency spectrograms and decomposes each spectrogram into a set of time-frequency patches. A deep neural network learns archetypical patte...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2020
|
Subjects: | |
Online Access: | http://arxiv.org/abs/2005.08894 |
id |
ftarxivpreprints:oai:arXiv.org:2005.08894 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:2005.08894 2023-09-05T13:23:44+02:00 Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours Li, Pu Liua, Xiaobai Palmer, K. J. Fleishman, Erica Gillespie, Douglas Nosal, Eva-Marie Shiu, Yu Klinck, Holger Cholewiak, Danielle Helble, Tyler Roch, Marie A. 2020-05-18 http://arxiv.org/abs/2005.08894 unknown http://arxiv.org/abs/2005.08894 Quantitative Biology - Quantitative Methods Computer Science - Sound Electrical Engineering and Systems Science - Audio and Speech Processing text 2020 ftarxivpreprints 2023-08-16T15:53:20Z We present a learning-based method for extracting whistles of toothed whales (Odontoceti) in hydrophone recordings. Our method represents audio signals as time-frequency spectrograms and decomposes each spectrogram into a set of time-frequency patches. A deep neural network learns archetypical patterns (e.g., crossings, frequency modulated sweeps) from the spectrogram patches and predicts time-frequency peaks that are associated with whistles. We also developed a comprehensive method to synthesize training samples from background environments and train the network with minimal human annotation effort. We applied the proposed learn-from-synthesis method to a subset of the public Detection, Classification, Localization, and Density Estimation (DCLDE) 2011 workshop data to extract whistle confidence maps, which we then processed with an existing contour extractor to produce whistle annotations. The F1-score of our best synthesis method was 0.158 greater than our baseline whistle extraction algorithm (~25% improvement) when applied to common dolphin (Delphinus spp.) and bottlenose dolphin (Tursiops truncatus) whistles. Comment: Invited paper for International Joint Conference on Neural Networks Text toothed whales ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Quantitative Biology - Quantitative Methods Computer Science - Sound Electrical Engineering and Systems Science - Audio and Speech Processing |
spellingShingle |
Quantitative Biology - Quantitative Methods Computer Science - Sound Electrical Engineering and Systems Science - Audio and Speech Processing Li, Pu Liua, Xiaobai Palmer, K. J. Fleishman, Erica Gillespie, Douglas Nosal, Eva-Marie Shiu, Yu Klinck, Holger Cholewiak, Danielle Helble, Tyler Roch, Marie A. Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours |
topic_facet |
Quantitative Biology - Quantitative Methods Computer Science - Sound Electrical Engineering and Systems Science - Audio and Speech Processing |
description |
We present a learning-based method for extracting whistles of toothed whales (Odontoceti) in hydrophone recordings. Our method represents audio signals as time-frequency spectrograms and decomposes each spectrogram into a set of time-frequency patches. A deep neural network learns archetypical patterns (e.g., crossings, frequency modulated sweeps) from the spectrogram patches and predicts time-frequency peaks that are associated with whistles. We also developed a comprehensive method to synthesize training samples from background environments and train the network with minimal human annotation effort. We applied the proposed learn-from-synthesis method to a subset of the public Detection, Classification, Localization, and Density Estimation (DCLDE) 2011 workshop data to extract whistle confidence maps, which we then processed with an existing contour extractor to produce whistle annotations. The F1-score of our best synthesis method was 0.158 greater than our baseline whistle extraction algorithm (~25% improvement) when applied to common dolphin (Delphinus spp.) and bottlenose dolphin (Tursiops truncatus) whistles. Comment: Invited paper for International Joint Conference on Neural Networks |
format |
Text |
author |
Li, Pu Liua, Xiaobai Palmer, K. J. Fleishman, Erica Gillespie, Douglas Nosal, Eva-Marie Shiu, Yu Klinck, Holger Cholewiak, Danielle Helble, Tyler Roch, Marie A. |
author_facet |
Li, Pu Liua, Xiaobai Palmer, K. J. Fleishman, Erica Gillespie, Douglas Nosal, Eva-Marie Shiu, Yu Klinck, Holger Cholewiak, Danielle Helble, Tyler Roch, Marie A. |
author_sort |
Li, Pu |
title |
Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours |
title_short |
Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours |
title_full |
Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours |
title_fullStr |
Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours |
title_full_unstemmed |
Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours |
title_sort |
learning deep models from synthetic data for extracting dolphin whistle contours |
publishDate |
2020 |
url |
http://arxiv.org/abs/2005.08894 |
genre |
toothed whales |
genre_facet |
toothed whales |
op_relation |
http://arxiv.org/abs/2005.08894 |
_version_ |
1776204328554463232 |