Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge

International audience The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are...

Full description

Bibliographic Details
Published in:Interspeech 2019
Main Authors: Caraty, Marie-José, Montacié, Claude
Other Authors: Équipe Linguistique computationnelle (STIH-LC), Sens, Texte, Informatique, Histoire (STIH), Sorbonne Université (SU)-Sorbonne Université (SU)
Format: Conference Object
Language:English
Published: HAL CCSD 2019
Subjects:
Online Access:https://hal.science/hal-03946087
https://doi.org/10.21437/Interspeech.2019-1693
id ftsorbonneuniv:oai:HAL:hal-03946087v1
record_format openpolar
spelling ftsorbonneuniv:oai:HAL:hal-03946087v1 2024-09-15T18:28:46+00:00 Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge Caraty, Marie-José Montacié, Claude Équipe Linguistique computationnelle (STIH-LC) Sens, Texte, Informatique, Histoire (STIH) Sorbonne Université (SU)-Sorbonne Université (SU) Graz, Austria, France 2019-09-15 https://hal.science/hal-03946087 https://doi.org/10.21437/Interspeech.2019-1693 en eng HAL CCSD ISCA info:eu-repo/semantics/altIdentifier/doi/10.21437/Interspeech.2019-1693 hal-03946087 https://hal.science/hal-03946087 doi:10.21437/Interspeech.2019-1693 Interspeech 2019 https://hal.science/hal-03946087 Interspeech 2019, Sep 2019, Graz, Austria, France. pp.2428-2432, ⟨10.21437/Interspeech.2019-1693⟩ [INFO]Computer Science [cs] info:eu-repo/semantics/conferenceObject Conference papers 2019 ftsorbonneuniv https://doi.org/10.21437/Interspeech.2019-1693 2024-07-25T23:47:45Z International audience The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals. Experiments were conducted on DeepAL Fieldwork Data (DLFD). Underwater sounds were recorded in northern British Columbia by a hydrophones array. Recordings were labeled by marine biologists in Orca sounds or Noise. We have investigated multiresolution analysis according to the three main relevant acoustic levels: spatial, temporal and spectral. For this purpose, we studied the beamforming array analysis, the multitemporal resolution and the multilevel wavelet decomposition. For the spatial level, a beamforming algorithm was used for denoising the underwater audio signal.For the temporal level, two sets of multitemporal three-level features were extracted using pyramidal representation. For the spectral level, in order to detect transient sound, waveletanalysis was computed using various wavelet families. At last, an Orca Activity detector was designed combining ComParE set with multitemporal and multilevel wavelet features.Experiments on the Test set have shown a significant improvement of 0.051, compared to the baseline performance of the Challenge (0.866) Conference Object Orca HAL Sorbonne Université Interspeech 2019 2428 2432
institution Open Polar
collection HAL Sorbonne Université
op_collection_id ftsorbonneuniv
language English
topic [INFO]Computer Science [cs]
spellingShingle [INFO]Computer Science [cs]
Caraty, Marie-José
Montacié, Claude
Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
topic_facet [INFO]Computer Science [cs]
description International audience The INTERSPEECH 2019 Orca Activity Challenge consists in the detection of the Orca sounds from underwater audio signal. Orca can produce a wide variety of sounds categorized in clicks, whistles and pulsed calls. Clicks are useful for echolocation, whistles and pulsed calls are used as social signals. Experiments were conducted on DeepAL Fieldwork Data (DLFD). Underwater sounds were recorded in northern British Columbia by a hydrophones array. Recordings were labeled by marine biologists in Orca sounds or Noise. We have investigated multiresolution analysis according to the three main relevant acoustic levels: spatial, temporal and spectral. For this purpose, we studied the beamforming array analysis, the multitemporal resolution and the multilevel wavelet decomposition. For the spatial level, a beamforming algorithm was used for denoising the underwater audio signal.For the temporal level, two sets of multitemporal three-level features were extracted using pyramidal representation. For the spectral level, in order to detect transient sound, waveletanalysis was computed using various wavelet families. At last, an Orca Activity detector was designed combining ComParE set with multitemporal and multilevel wavelet features.Experiments on the Test set have shown a significant improvement of 0.051, compared to the baseline performance of the Challenge (0.866)
author2 Équipe Linguistique computationnelle (STIH-LC)
Sens, Texte, Informatique, Histoire (STIH)
Sorbonne Université (SU)-Sorbonne Université (SU)
format Conference Object
author Caraty, Marie-José
Montacié, Claude
author_facet Caraty, Marie-José
Montacié, Claude
author_sort Caraty, Marie-José
title Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_short Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_full Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_fullStr Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_full_unstemmed Spatial, Temporal and Spectral Multiresolution Analysis for the INTERSPEECH 2019 ComParE Challenge
title_sort spatial, temporal and spectral multiresolution analysis for the interspeech 2019 compare challenge
publisher HAL CCSD
publishDate 2019
url https://hal.science/hal-03946087
https://doi.org/10.21437/Interspeech.2019-1693
op_coverage Graz, Austria, France
genre Orca
genre_facet Orca
op_source Interspeech 2019
https://hal.science/hal-03946087
Interspeech 2019, Sep 2019, Graz, Austria, France. pp.2428-2432, ⟨10.21437/Interspeech.2019-1693⟩
op_relation info:eu-repo/semantics/altIdentifier/doi/10.21437/Interspeech.2019-1693
hal-03946087
https://hal.science/hal-03946087
doi:10.21437/Interspeech.2019-1693
op_doi https://doi.org/10.21437/Interspeech.2019-1693
container_title Interspeech 2019
container_start_page 2428
op_container_end_page 2432
_version_ 1810470208010190848