The Orchive : Data mining a massive bioacoustic archive

The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the wor...

Full description

Bibliographic Details
Main Authors: Ness, Steven, Symonds, Helena, Spong, Paul, Tzanetakis, George
Format: Text
Language:unknown
Published: 2013
Subjects:
Online Access:http://arxiv.org/abs/1307.0589
id ftarxivpreprints:oai:arXiv.org:1307.0589
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:1307.0589 2023-09-05T13:22:20+02:00 The Orchive : Data mining a massive bioacoustic archive Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George 2013-07-02 http://arxiv.org/abs/1307.0589 unknown http://arxiv.org/abs/1307.0589 Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound text 2013 ftarxivpreprints 2023-08-16T13:03:52Z The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the world. We have developed a web-based interface that allows researchers to listen to these recordings, view waveform and spectral representations of the audio, label clips with annotations, and view the results of machine learning classifiers based on automatic audio features extraction. In this paper we describe such classifiers that discriminate between background noise, orca calls, and the voice notes that are present in most of the tapes. Furthermore we show classification results for individual calls based on a previously existing orca call catalog. We have also experimentally investigated the scalability of classifiers over the entire Orchive. Comment: ICML 2013 Workshop on Machine Learning for Bioacoustics Text Orca ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Machine Learning
Computer Science - Databases
Computer Science - Sound
spellingShingle Computer Science - Machine Learning
Computer Science - Databases
Computer Science - Sound
Ness, Steven
Symonds, Helena
Spong, Paul
Tzanetakis, George
The Orchive : Data mining a massive bioacoustic archive
topic_facet Computer Science - Machine Learning
Computer Science - Databases
Computer Science - Sound
description The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the world. We have developed a web-based interface that allows researchers to listen to these recordings, view waveform and spectral representations of the audio, label clips with annotations, and view the results of machine learning classifiers based on automatic audio features extraction. In this paper we describe such classifiers that discriminate between background noise, orca calls, and the voice notes that are present in most of the tapes. Furthermore we show classification results for individual calls based on a previously existing orca call catalog. We have also experimentally investigated the scalability of classifiers over the entire Orchive. Comment: ICML 2013 Workshop on Machine Learning for Bioacoustics
format Text
author Ness, Steven
Symonds, Helena
Spong, Paul
Tzanetakis, George
author_facet Ness, Steven
Symonds, Helena
Spong, Paul
Tzanetakis, George
author_sort Ness, Steven
title The Orchive : Data mining a massive bioacoustic archive
title_short The Orchive : Data mining a massive bioacoustic archive
title_full The Orchive : Data mining a massive bioacoustic archive
title_fullStr The Orchive : Data mining a massive bioacoustic archive
title_full_unstemmed The Orchive : Data mining a massive bioacoustic archive
title_sort orchive : data mining a massive bioacoustic archive
publishDate 2013
url http://arxiv.org/abs/1307.0589
genre Orca
genre_facet Orca
op_relation http://arxiv.org/abs/1307.0589
_version_ 1776202860404408320