Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge

The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals...

Full description

Bibliographic Details
Published in:Interspeech 2019
Main Authors: Caraty, Marie-José, Montacié, Claude
Other Authors: Équipe Linguistique computationnelle (STIH-LC), Sens, Texte, Informatique, Histoire (STIH), Sorbonne Université (SU)-Sorbonne Université (SU)
Format: Conference Object
Language:English
Published: HAL CCSD 2019
Subjects:
Online Access:https://hal.science/hal-03946087
https://doi.org/10.21437/Interspeech.2019-1693
id ftccsdartic:oai:HAL:hal-03946087v1
record_format openpolar
spelling ftccsdartic:oai:HAL:hal-03946087v1 2023-11-05T03:44:31+01:00 Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge Caraty, Marie-José Montacié, Claude Équipe Linguistique computationnelle (STIH-LC) Sens, Texte, Informatique, Histoire (STIH) Sorbonne Université (SU)-Sorbonne Université (SU) Graz, Austria, France 2019-09-15 https://hal.science/hal-03946087 https://doi.org/10.21437/Interspeech.2019-1693 en eng HAL CCSD ISCA info:eu-repo/semantics/altIdentifier/doi/10.21437/Interspeech.2019-1693 hal-03946087 https://hal.science/hal-03946087 doi:10.21437/Interspeech.2019-1693 Interspeech 2019 https://hal.science/hal-03946087 Interspeech 2019, Sep 2019, Graz, Austria, France. pp.2428-2432, ⟨10.21437/Interspeech.2019-1693⟩ [INFO]Computer Science [cs] info:eu-repo/semantics/conferenceObject Conference papers 2019 ftccsdartic https://doi.org/10.21437/Interspeech.2019-1693 2023-10-07T22:45:31Z The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals. Experiments were conducted on DeepAL Fieldwork Data (DLFD). Underwater sounds were recorded in northern British Columbia by a hydrophones array. Recordings were labeled by marine biologists in Orca sounds or Noise. We have investigated multiresolution analysis according to the three main relevant acoustic levels: spatial, temporal and spectral. For this purpose, we studied the beamforming array analysis, the multitemporal resolution and the multilevel wavelet decomposition. For the spatial level, a beamforming algorithm was used for denoising the underwater audio signal.For the temporal level, two sets of multitemporal three-level features were extracted using pyramidal representation. For the spectral level, in order to detect transient sound, waveletanalysis was computed using various wavelet families. At last, an Orca Activity detector was designed combining ComParE set with multitemporal and multilevel wavelet features.Experiments on the Test set have shown a significant improvement of 0.051, compared to the baseline performance of the Challenge (0.866) Conference Object Orca Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe) Interspeech 2019 2428 2432
institution Open Polar
collection Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
op_collection_id ftccsdartic
language English
topic [INFO]Computer Science [cs]
spellingShingle [INFO]Computer Science [cs]
Caraty, Marie-José
Montacié, Claude
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
topic_facet [INFO]Computer Science [cs]
description The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals. Experiments were conducted on DeepAL Fieldwork Data (DLFD). Underwater sounds were recorded in northern British Columbia by a hydrophones array. Recordings were labeled by marine biologists in Orca sounds or Noise. We have investigated multiresolution analysis according to the three main relevant acoustic levels: spatial, temporal and spectral. For this purpose, we studied the beamforming array analysis, the multitemporal resolution and the multilevel wavelet decomposition. For the spatial level, a beamforming algorithm was used for denoising the underwater audio signal.For the temporal level, two sets of multitemporal three-level features were extracted using pyramidal representation. For the spectral level, in order to detect transient sound, waveletanalysis was computed using various wavelet families. At last, an Orca Activity detector was designed combining ComParE set with multitemporal and multilevel wavelet features.Experiments on the Test set have shown a significant improvement of 0.051, compared to the baseline performance of the Challenge (0.866)
author2 Équipe Linguistique computationnelle (STIH-LC)
Sens, Texte, Informatique, Histoire (STIH)
Sorbonne Université (SU)-Sorbonne Université (SU)
format Conference Object
author Caraty, Marie-José
Montacié, Claude
author_facet Caraty, Marie-José
Montacié, Claude
author_sort Caraty, Marie-José
title Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_short Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_full Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_fullStr Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_full_unstemmed Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_sort spatial, temporal and spectral multiresolution analysis for the interspeech 2019 compare challenge
publisher HAL CCSD
publishDate 2019
url https://hal.science/hal-03946087
https://doi.org/10.21437/Interspeech.2019-1693
op_coverage Graz, Austria, France
genre Orca
genre_facet Orca
op_source Interspeech 2019
https://hal.science/hal-03946087
Interspeech 2019, Sep 2019, Graz, Austria, France. pp.2428-2432, ⟨10.21437/Interspeech.2019-1693⟩
op_relation info:eu-repo/semantics/altIdentifier/doi/10.21437/Interspeech.2019-1693
hal-03946087
https://hal.science/hal-03946087
doi:10.21437/Interspeech.2019-1693
op_doi https://doi.org/10.21437/Interspeech.2019-1693
container_title Interspeech 2019
container_start_page 2428
op_container_end_page 2432
_version_ 1781704566645981184