The Orchive : Data mining a massive bioacoustic archive

The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the wor...

Full description

Bibliographic Details
Main Authors:	Ness, Steven, Symonds, Helena, Spong, Paul, Tzanetakis, George
Format:	Text
Language:	unknown
Published:	2013
Subjects:	Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound Orca
Online Access:	http://arxiv.org/abs/1307.0589

id	ftarxivpreprints:oai:arXiv.org:1307.0589
record_format	openpolar
spelling	ftarxivpreprints:oai:arXiv.org:1307.0589 2023-09-05T13:22:20+02:00 The Orchive : Data mining a massive bioacoustic archive Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George 2013-07-02 http://arxiv.org/abs/1307.0589 unknown http://arxiv.org/abs/1307.0589 Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound text 2013 ftarxivpreprints 2023-08-16T13:03:52Z The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the world. We have developed a web-based interface that allows researchers to listen to these recordings, view waveform and spectral representations of the audio, label clips with annotations, and view the results of machine learning classifiers based on automatic audio features extraction. In this paper we describe such classifiers that discriminate between background noise, orca calls, and the voice notes that are present in most of the tapes. Furthermore we show classification results for individual calls based on a previously existing orca call catalog. We have also experimentally investigated the scalability of classifiers over the entire Orchive. Comment: ICML 2013 Workshop on Machine Learning for Bioacoustics Text Orca ArXiv.org (Cornell University Library)
institution	Open Polar
collection	ArXiv.org (Cornell University Library)
op_collection_id	ftarxivpreprints
language	unknown
topic	Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound
spellingShingle	Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George The Orchive : Data mining a massive bioacoustic archive
topic_facet	Computer Science - Machine Learning Computer Science - Databases Computer Science - Sound
description	The Orchive is a large collection of over 20,000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island. It contains recorded orca vocalizations from the 1980 to the present time and is one of the largest resources of bioacoustic data in the world. We have developed a web-based interface that allows researchers to listen to these recordings, view waveform and spectral representations of the audio, label clips with annotations, and view the results of machine learning classifiers based on automatic audio features extraction. In this paper we describe such classifiers that discriminate between background noise, orca calls, and the voice notes that are present in most of the tapes. Furthermore we show classification results for individual calls based on a previously existing orca call catalog. We have also experimentally investigated the scalability of classifiers over the entire Orchive. Comment: ICML 2013 Workshop on Machine Learning for Bioacoustics
format	Text
author	Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George
author_facet	Ness, Steven Symonds, Helena Spong, Paul Tzanetakis, George
author_sort	Ness, Steven
title	The Orchive : Data mining a massive bioacoustic archive
title_short	The Orchive : Data mining a massive bioacoustic archive
title_full	The Orchive : Data mining a massive bioacoustic archive
title_fullStr	The Orchive : Data mining a massive bioacoustic archive
title_full_unstemmed	The Orchive : Data mining a massive bioacoustic archive
title_sort	orchive : data mining a massive bioacoustic archive
publishDate	2013
url	http://arxiv.org/abs/1307.0589
genre	Orca
genre_facet	Orca
op_relation	http://arxiv.org/abs/1307.0589
_version_	1776202860404408320

The Orchive : Data mining a massive bioacoustic archive

Similar Items