Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals...
Published in: | Interspeech 2019 |
---|---|
Main Authors: | , |
Other Authors: | , , |
Format: | Conference Object |
Language: | English |
Published: |
HAL CCSD
2019
|
Subjects: | |
Online Access: | https://hal.science/hal-03946087 https://doi.org/10.21437/Interspeech.2019-1693 |
id |
ftunivnantes:oai:HAL:hal-03946087v1 |
---|---|
record_format |
openpolar |
spelling |
ftunivnantes:oai:HAL:hal-03946087v1 2023-05-15T17:53:08+02:00 Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge Caraty, Marie-José Montacié, Claude Équipe Linguistique computationnelle (STIH-LC) Sens, Texte, Informatique, Histoire (STIH) Sorbonne Université (SU)-Sorbonne Université (SU) Graz, Austria, France 2019-09-15 https://hal.science/hal-03946087 https://doi.org/10.21437/Interspeech.2019-1693 en eng HAL CCSD ISCA info:eu-repo/semantics/altIdentifier/doi/10.21437/Interspeech.2019-1693 hal-03946087 https://hal.science/hal-03946087 doi:10.21437/Interspeech.2019-1693 Interspeech 2019 https://hal.science/hal-03946087 Interspeech 2019, Sep 2019, Graz, Austria, France. pp.2428-2432, ⟨10.21437/Interspeech.2019-1693⟩ [INFO]Computer Science [cs] info:eu-repo/semantics/conferenceObject Conference papers 2019 ftunivnantes https://doi.org/10.21437/Interspeech.2019-1693 2023-02-08T01:50:37Z The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals. Experiments were conducted on DeepAL Fieldwork Data (DLFD). Underwater sounds were recorded in northern British Columbia by a hydrophones array. Recordings were labeled by marine biologists in Orca sounds or Noise. We have investigated multiresolution analysis according to the three main relevant acoustic levels: spatial, temporal and spectral. For this purpose, we studied the beamforming array analysis, the multitemporal resolution and the multilevel wavelet decomposition. For the spatial level, a beamforming algorithm was used for denoising the underwater audio signal.For the temporal level, two sets of multitemporal three-level features were extracted using pyramidal representation. For the spectral level, in order to detect transient sound, waveletanalysis was computed using various wavelet families. At last, an Orca Activity detector was designed combining ComParE set with multitemporal and multilevel wavelet features.Experiments on the Test set have shown a significant improvement of 0.051, compared to the baseline performance of the Challenge (0.866) Conference Object Orca Université de Nantes: HAL-UNIV-NANTES Interspeech 2019 2428 2432 |
institution |
Open Polar |
collection |
Université de Nantes: HAL-UNIV-NANTES |
op_collection_id |
ftunivnantes |
language |
English |
topic |
[INFO]Computer Science [cs] |
spellingShingle |
[INFO]Computer Science [cs] Caraty, Marie-José Montacié, Claude Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge |
topic_facet |
[INFO]Computer Science [cs] |
description |
The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals. Experiments were conducted on DeepAL Fieldwork Data (DLFD). Underwater sounds were recorded in northern British Columbia by a hydrophones array. Recordings were labeled by marine biologists in Orca sounds or Noise. We have investigated multiresolution analysis according to the three main relevant acoustic levels: spatial, temporal and spectral. For this purpose, we studied the beamforming array analysis, the multitemporal resolution and the multilevel wavelet decomposition. For the spatial level, a beamforming algorithm was used for denoising the underwater audio signal.For the temporal level, two sets of multitemporal three-level features were extracted using pyramidal representation. For the spectral level, in order to detect transient sound, waveletanalysis was computed using various wavelet families. At last, an Orca Activity detector was designed combining ComParE set with multitemporal and multilevel wavelet features.Experiments on the Test set have shown a significant improvement of 0.051, compared to the baseline performance of the Challenge (0.866) |
author2 |
Équipe Linguistique computationnelle (STIH-LC) Sens, Texte, Informatique, Histoire (STIH) Sorbonne Université (SU)-Sorbonne Université (SU) |
format |
Conference Object |
author |
Caraty, Marie-José Montacié, Claude |
author_facet |
Caraty, Marie-José Montacié, Claude |
author_sort |
Caraty, Marie-José |
title |
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge |
title_short |
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge |
title_full |
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge |
title_fullStr |
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge |
title_full_unstemmed |
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge |
title_sort |
spatial, temporal and spectral multiresolution analysis for the interspeech 2019 compare challenge |
publisher |
HAL CCSD |
publishDate |
2019 |
url |
https://hal.science/hal-03946087 https://doi.org/10.21437/Interspeech.2019-1693 |
op_coverage |
Graz, Austria, France |
genre |
Orca |
genre_facet |
Orca |
op_source |
Interspeech 2019 https://hal.science/hal-03946087 Interspeech 2019, Sep 2019, Graz, Austria, France. pp.2428-2432, ⟨10.21437/Interspeech.2019-1693⟩ |
op_relation |
info:eu-repo/semantics/altIdentifier/doi/10.21437/Interspeech.2019-1693 hal-03946087 https://hal.science/hal-03946087 doi:10.21437/Interspeech.2019-1693 |
op_doi |
https://doi.org/10.21437/Interspeech.2019-1693 |
container_title |
Interspeech 2019 |
container_start_page |
2428 |
op_container_end_page |
2432 |
_version_ |
1766160862993711104 |