Learning to detect odontocete whistles from generative samples:
We aim to detect Odontoceti (toothed whale) whistles from synthetic samples by learning the underlying distribution of the delphinid sounds and noise environment present in the real spectrograms. We present an unsupervised / self-supervised learning method that generates synthetic data to augment ex...
Other Authors: | , , , , |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/20.500.11929/sdsu:89727 |
id |
ftsandiegostateu:oai:drupal-site.org:sdsu_89727 |
---|---|
record_format |
openpolar |
spelling |
ftsandiegostateu:oai:drupal-site.org:sdsu_89727 2023-05-15T18:33:26+02:00 Learning to detect odontocete whistles from generative samples: Shah, Saumil Mehulbhai (author) Roch, Marie A. (Advisor) Liu, Xiaobai (Committee Member) Bailey, Barbara A. (Committee Member) Computer Science 2020-06-30 80 pages https://hdl.handle.net/20.500.11929/sdsu:89727 doi: en_US eng sdsu:89727 local: uri: doi: http://hdl.handle.net/20.500.11929/sdsu:89727 Thesis 2020 ftsandiegostateu https://doi.org/20.500.11929/sdsu:89727 2022-03-24T18:50:14Z We aim to detect Odontoceti (toothed whale) whistles from synthetic samples by learning the underlying distribution of the delphinid sounds and noise environment present in the real spectrograms. We present an unsupervised / self-supervised learning method that generates synthetic data to augment existing data for extracting whistle contours in time-frequency spectrograms. Our approach is novel compared to existing synthesis based techniques because it relies on deep neural networks to synthesize data that resemble original data. The proposed architecture employs a combination of generative adversarial networks (WGANs+CycleGAN) to generate synthetic whistles and noise spectrogram patches with their corresponding ground truth (GT) labels. These GANs produce synthetic spectrogram patches that look like patches of the real spectrograms created from the underwater hydrophone recordings, and their synthetic GT tonals mimic the actual human analyst annotations. We propose this as an alternative data-generation method for creating an augmented dataset used for training current CNN-based models (e.g., WGT). This CNN-based model produces confidence maps of whistle presence in the spectrogram that serves as the input to the existing whistle extraction system (e.g., Silbido). Our best synthesis method (100% original data + 100% synthetic data) showed a ⇠10-28% improvement in the F1 score of peak whistle energy performance compared to the existing synthesis-based algorithms such as EdgeGT, EdgeCanny which were trained on the entire dataset. Our method trained only a fraction of the data (6.25% original data + 1000% synthetic data) showed a 10% decrease in F1 score compared to existing synthesis-method trained on similar amounts of data like μWGT, and μWGT-RG. Also, The F1-score of the proposed method was 0.172 greater than our baseline whistle extraction algorithm (⇠27% improvement), and the precision was 0.332 considerably higher than the baseline method (⇠52% improvement) when applied to the whistles of long-beaked common ... Thesis toothed whale SDSUnbound (San Diego State University) Silbido ENVELOPE(-67.593,-67.593,-67.497,-67.497) |
institution |
Open Polar |
collection |
SDSUnbound (San Diego State University) |
op_collection_id |
ftsandiegostateu |
language |
English |
description |
We aim to detect Odontoceti (toothed whale) whistles from synthetic samples by learning the underlying distribution of the delphinid sounds and noise environment present in the real spectrograms. We present an unsupervised / self-supervised learning method that generates synthetic data to augment existing data for extracting whistle contours in time-frequency spectrograms. Our approach is novel compared to existing synthesis based techniques because it relies on deep neural networks to synthesize data that resemble original data. The proposed architecture employs a combination of generative adversarial networks (WGANs+CycleGAN) to generate synthetic whistles and noise spectrogram patches with their corresponding ground truth (GT) labels. These GANs produce synthetic spectrogram patches that look like patches of the real spectrograms created from the underwater hydrophone recordings, and their synthetic GT tonals mimic the actual human analyst annotations. We propose this as an alternative data-generation method for creating an augmented dataset used for training current CNN-based models (e.g., WGT). This CNN-based model produces confidence maps of whistle presence in the spectrogram that serves as the input to the existing whistle extraction system (e.g., Silbido). Our best synthesis method (100% original data + 100% synthetic data) showed a ⇠10-28% improvement in the F1 score of peak whistle energy performance compared to the existing synthesis-based algorithms such as EdgeGT, EdgeCanny which were trained on the entire dataset. Our method trained only a fraction of the data (6.25% original data + 1000% synthetic data) showed a 10% decrease in F1 score compared to existing synthesis-method trained on similar amounts of data like μWGT, and μWGT-RG. Also, The F1-score of the proposed method was 0.172 greater than our baseline whistle extraction algorithm (⇠27% improvement), and the precision was 0.332 considerably higher than the baseline method (⇠52% improvement) when applied to the whistles of long-beaked common ... |
author2 |
Shah, Saumil Mehulbhai (author) Roch, Marie A. (Advisor) Liu, Xiaobai (Committee Member) Bailey, Barbara A. (Committee Member) Computer Science |
format |
Thesis |
title |
Learning to detect odontocete whistles from generative samples: |
spellingShingle |
Learning to detect odontocete whistles from generative samples: |
title_short |
Learning to detect odontocete whistles from generative samples: |
title_full |
Learning to detect odontocete whistles from generative samples: |
title_fullStr |
Learning to detect odontocete whistles from generative samples: |
title_full_unstemmed |
Learning to detect odontocete whistles from generative samples: |
title_sort |
learning to detect odontocete whistles from generative samples: |
publishDate |
2020 |
url |
https://hdl.handle.net/20.500.11929/sdsu:89727 |
long_lat |
ENVELOPE(-67.593,-67.593,-67.497,-67.497) |
geographic |
Silbido |
geographic_facet |
Silbido |
genre |
toothed whale |
genre_facet |
toothed whale |
op_relation |
sdsu:89727 local: uri: doi: http://hdl.handle.net/20.500.11929/sdsu:89727 |
op_doi |
https://doi.org/20.500.11929/sdsu:89727 |
_version_ |
1766218036078968832 |