The Orchive : Data mining a massive bioacoustic archive
The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the wor...
Main Authors: | , , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2013
|
Subjects: | |
Online Access: | http://arxiv.org/abs/1307.0589 |
id |
ftarxivpreprints:oai:arXiv.org:1307.0589 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:1307.0589 2023-09-05T13:22:20+02:00 The Orchive : Data mining a massive bioacoustic archive Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George 2013-07-02 http://arxiv.org/abs/1307.0589 unknown http://arxiv.org/abs/1307.0589 Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound text 2013 ftarxivpreprints 2023-08-16T13:03:52Z The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the world. We have developed a web-based interface that allows researchers to listen to these recordings, view waveform and spectral representations of the audio, label clips with annotations, and view the results of machine learning classifiers based on automatic audio features extraction. In this paper we describe such classifiers that discriminate between background noise, orca calls, and the voice notes that are present in most of the tapes. Furthermore we show classification results for individual calls based on a previously existing orca call catalog. We have also experimentally investigated the scalability of classifiers over the entire Orchive. Comment: ICML 2013 Workshop on Machine Learning for Bioacoustics Text Orca ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound |
spellingShingle |
Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George The Orchive : Data mining a massive bioacoustic archive |
topic_facet |
Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound |
description |
The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the world. We have developed a web-based interface that allows researchers to listen to these recordings, view waveform and spectral representations of the audio, label clips with annotations, and view the results of machine learning classifiers based on automatic audio features extraction. In this paper we describe such classifiers that discriminate between background noise, orca calls, and the voice notes that are present in most of the tapes. Furthermore we show classification results for individual calls based on a previously existing orca call catalog. We have also experimentally investigated the scalability of classifiers over the entire Orchive. Comment: ICML 2013 Workshop on Machine Learning for Bioacoustics |
format |
Text |
author |
Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George |
author_facet |
Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George |
author_sort |
Ness, Steven |
title |
The Orchive : Data mining a massive bioacoustic archive |
title_short |
The Orchive : Data mining a massive bioacoustic archive |
title_full |
The Orchive : Data mining a massive bioacoustic archive |
title_fullStr |
The Orchive : Data mining a massive bioacoustic archive |
title_full_unstemmed |
The Orchive : Data mining a massive bioacoustic archive |
title_sort |
orchive : data mining a massive bioacoustic archive |
publishDate |
2013 |
url |
http://arxiv.org/abs/1307.0589 |
genre |
Orca |
genre_facet |
Orca |
op_relation |
http://arxiv.org/abs/1307.0589 |
_version_ |
1776202860404408320 |