North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning
A new method for North Atlantic Right Whales (NARW) up-call detection using Multimodel Deep Learning (MMDL) is presented in this paper. In this approach, signals from passive acoustic sensors are first converted to spectrogram and scalogram images, which are time-frequency representations of the sig...
Main Authors: | , , , , , |
---|---|
Format: | Article in Journal/Newspaper |
Language: | unknown |
Published: |
arXiv
2020
|
Subjects: | |
Online Access: | https://dx.doi.org/10.48550/arxiv.2005.08356 https://arxiv.org/abs/2005.08356 |
id |
ftdatacite:10.48550/arxiv.2005.08356 |
---|---|
record_format |
openpolar |
spelling |
ftdatacite:10.48550/arxiv.2005.08356 2023-05-15T17:30:22+02:00 North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning Ibrahim, Ali K Zhuang, Hanqi Ch'erubin, Laurent M. Erdol, Nurgun Corry-Crowe, Gregory O Ali, Ali Muhamed 2020 https://dx.doi.org/10.48550/arxiv.2005.08356 https://arxiv.org/abs/2005.08356 unknown arXiv Creative Commons Attribution Non Commercial Share Alike 4.0 International https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode cc-by-nc-sa-4.0 CC-BY-NC-SA Audio and Speech Processing eess.AS Sound cs.SD FOS Electrical engineering, electronic engineering, information engineering FOS Computer and information sciences Article CreativeWork article Preprint 2020 ftdatacite https://doi.org/10.48550/arxiv.2005.08356 2022-03-10T15:43:30Z A new method for North Atlantic Right Whales (NARW) up-call detection using Multimodel Deep Learning (MMDL) is presented in this paper. In this approach, signals from passive acoustic sensors are first converted to spectrogram and scalogram images, which are time-frequency representations of the signals. These images are in turn used to train an MMDL detec-tor, consisting of Convolutional Neural Networks (CNNs) and Stacked Auto Encoders (SAEs). Our experimental studies revealed that CNNs work better with spectrograms and SAEs with sca-lograms. Therefore in our experimental design, the CNNs are trained by using spectrogram im-ages, and the SAEs are trained by using scalogram images. A fusion mechanism is used to fuse the results from individual neural networks. In this paper, the results obtained from the MMDL detector are compared with those obtained from conventional machine learning algorithms trained with handcraft features. It is shown that the performance of the MMDL detector is sig-nificantly better than those of the representative conventional machine learning methods in terms of up-call detection rate, non-up-call detection rate, and false alarm rate. Article in Journal/Newspaper North Atlantic DataCite Metadata Store (German National Library of Science and Technology) |
institution |
Open Polar |
collection |
DataCite Metadata Store (German National Library of Science and Technology) |
op_collection_id |
ftdatacite |
language |
unknown |
topic |
Audio and Speech Processing eess.AS Sound cs.SD FOS Electrical engineering, electronic engineering, information engineering FOS Computer and information sciences |
spellingShingle |
Audio and Speech Processing eess.AS Sound cs.SD FOS Electrical engineering, electronic engineering, information engineering FOS Computer and information sciences Ibrahim, Ali K Zhuang, Hanqi Ch'erubin, Laurent M. Erdol, Nurgun Corry-Crowe, Gregory O Ali, Ali Muhamed North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning |
topic_facet |
Audio and Speech Processing eess.AS Sound cs.SD FOS Electrical engineering, electronic engineering, information engineering FOS Computer and information sciences |
description |
A new method for North Atlantic Right Whales (NARW) up-call detection using Multimodel Deep Learning (MMDL) is presented in this paper. In this approach, signals from passive acoustic sensors are first converted to spectrogram and scalogram images, which are time-frequency representations of the signals. These images are in turn used to train an MMDL detec-tor, consisting of Convolutional Neural Networks (CNNs) and Stacked Auto Encoders (SAEs). Our experimental studies revealed that CNNs work better with spectrograms and SAEs with sca-lograms. Therefore in our experimental design, the CNNs are trained by using spectrogram im-ages, and the SAEs are trained by using scalogram images. A fusion mechanism is used to fuse the results from individual neural networks. In this paper, the results obtained from the MMDL detector are compared with those obtained from conventional machine learning algorithms trained with handcraft features. It is shown that the performance of the MMDL detector is sig-nificantly better than those of the representative conventional machine learning methods in terms of up-call detection rate, non-up-call detection rate, and false alarm rate. |
format |
Article in Journal/Newspaper |
author |
Ibrahim, Ali K Zhuang, Hanqi Ch'erubin, Laurent M. Erdol, Nurgun Corry-Crowe, Gregory O Ali, Ali Muhamed |
author_facet |
Ibrahim, Ali K Zhuang, Hanqi Ch'erubin, Laurent M. Erdol, Nurgun Corry-Crowe, Gregory O Ali, Ali Muhamed |
author_sort |
Ibrahim, Ali K |
title |
North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning |
title_short |
North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning |
title_full |
North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning |
title_fullStr |
North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning |
title_full_unstemmed |
North Atlantic Right Whales Up-call Detection Using Multimodel Deep Learning |
title_sort |
north atlantic right whales up-call detection using multimodel deep learning |
publisher |
arXiv |
publishDate |
2020 |
url |
https://dx.doi.org/10.48550/arxiv.2005.08356 https://arxiv.org/abs/2005.08356 |
genre |
North Atlantic |
genre_facet |
North Atlantic |
op_rights |
Creative Commons Attribution Non Commercial Share Alike 4.0 International https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode cc-by-nc-sa-4.0 |
op_rightsnorm |
CC-BY-NC-SA |
op_doi |
https://doi.org/10.48550/arxiv.2005.08356 |
_version_ |
1766126725919408128 |