Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data

Orcinus orca (killer whales) exhibit complex calls. They last about a second. In a call, an orca typically uses multiple frequencies simultaneously, varies the frequencies, and varies their volumes. Behavior data is hard to obtain because orcas live under water and travel quickly. Sound data is rela...

Full description

Bibliographic Details
Main Author: Sandholm, Sophia
Format: Text
Language:unknown
Published: 2023
Subjects:
Online Access:http://arxiv.org/abs/2302.10983
id ftarxivpreprints:oai:arXiv.org:2302.10983
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2302.10983 2023-09-05T13:21:01+02:00 Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data Sandholm, Sophia 2023-01-28 http://arxiv.org/abs/2302.10983 unknown http://arxiv.org/abs/2302.10983 Computer Science - Sound Computer Science - Computation and Language Computer Science - Machine Learning Electrical Engineering and Systems Science - Audio and Speech Processing text 2023 ftarxivpreprints 2023-08-16T17:33:04Z Orcinus orca (killer whales) exhibit complex calls. They last about a second. In a call, an orca typically uses multiple frequencies simultaneously, varies the frequencies, and varies their volumes. Behavior data is hard to obtain because orcas live under water and travel quickly. Sound data is relatively easy to capture. As a science goal, we would like to know whether orca vocalizations constitute a semantic language. We do this by studying whether machine learning can predict behavior from vocalizations. Such prediction would also help scientific research and safety applications because one would like to predict behavior while only having to capture sound. A significant challenge in this process is lack of labeled data. We work with recent recordings of McMurdo Sound orcas [Wellard et al. 2020] where each recording is labeled with the behaviors observed during the recording. This yields a dataset where sound segments - continuous vocalizations that can be thought of as call sequences or more general structures - within the recordings are labeled with superfluous behaviors. Despite that, with a careful combination of recent machine learning techniques, we achieve 96.4% classification accuracy. This suggests that orcas do use a semantic language. It is also promising for research and applications. Text McMurdo Sound Orca Orcinus orca ArXiv.org (Cornell University Library) McMurdo Sound
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Sound
Computer Science - Computation and Language
Computer Science - Machine Learning
Electrical Engineering and Systems Science - Audio and Speech Processing
spellingShingle Computer Science - Sound
Computer Science - Computation and Language
Computer Science - Machine Learning
Electrical Engineering and Systems Science - Audio and Speech Processing
Sandholm, Sophia
Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data
topic_facet Computer Science - Sound
Computer Science - Computation and Language
Computer Science - Machine Learning
Electrical Engineering and Systems Science - Audio and Speech Processing
description Orcinus orca (killer whales) exhibit complex calls. They last about a second. In a call, an orca typically uses multiple frequencies simultaneously, varies the frequencies, and varies their volumes. Behavior data is hard to obtain because orcas live under water and travel quickly. Sound data is relatively easy to capture. As a science goal, we would like to know whether orca vocalizations constitute a semantic language. We do this by studying whether machine learning can predict behavior from vocalizations. Such prediction would also help scientific research and safety applications because one would like to predict behavior while only having to capture sound. A significant challenge in this process is lack of labeled data. We work with recent recordings of McMurdo Sound orcas [Wellard et al. 2020] where each recording is labeled with the behaviors observed during the recording. This yields a dataset where sound segments - continuous vocalizations that can be thought of as call sequences or more general structures - within the recordings are labeled with superfluous behaviors. Despite that, with a careful combination of recent machine learning techniques, we achieve 96.4% classification accuracy. This suggests that orcas do use a semantic language. It is also promising for research and applications.
format Text
author Sandholm, Sophia
author_facet Sandholm, Sophia
author_sort Sandholm, Sophia
title Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data
title_short Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data
title_full Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data
title_fullStr Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data
title_full_unstemmed Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data
title_sort do orcas have semantic language? machine learning to predict orca behaviors using partially labeled vocalization data
publishDate 2023
url http://arxiv.org/abs/2302.10983
geographic McMurdo Sound
geographic_facet McMurdo Sound
genre McMurdo Sound
Orca
Orcinus orca
genre_facet McMurdo Sound
Orca
Orcinus orca
op_relation http://arxiv.org/abs/2302.10983
_version_ 1776201646030716928