Orcasound Workflow

The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasound is a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port To...

Full description

Bibliographic Details
Main Author: , George
Format: Article in Journal/Newspaper
Language:unknown
Published: Zenodo 2022
Subjects:
Online Access:https://dx.doi.org/10.5281/zenodo.5889225
https://zenodo.org/record/5889225
id ftdatacite:10.5281/zenodo.5889225
record_format openpolar
spelling ftdatacite:10.5281/zenodo.5889225 2023-05-15T17:53:38+02:00 Orcasound Workflow , George 2022 https://dx.doi.org/10.5281/zenodo.5889225 https://zenodo.org/record/5889225 unknown Zenodo https://dx.doi.org/10.5281/zenodo.5889224 Open Access Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 info:eu-repo/semantics/openAccess CC-BY article Software SoftwareSourceCode 2022 ftdatacite https://doi.org/10.5281/zenodo.5889225 https://doi.org/10.5281/zenodo.5889224 2022-02-09T13:10:00Z The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasound is a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port Townsend) in order to study Orca whales in the Pacific Northwest region. Throughout the course of this project, code to process and analyze the hydrophone data has been developed, and machine learning models have been trained to automatically identify the whistles of the Orcas. All of the code is available publicly on GitHub, and the hydrophone data are free to access, stored in an AWS bucket. In this paper, we have developed an Orcasound pipeline using Pegasus. This version of the pipeline is based on the GitHub Actions Orcasound workflow ,and incorporates inference components of the OrcaHello AI notification system. The Orcasound Pegasus workflow processes the hydrophone data of one or more sensors in batches for each timestamp, and converts them to a WAV format. Using the WAV output it creates spectrogram images that are stored in the final output location. Furthermore, using the pre trained Orca sound model, the workflow scans the WAV files to identify potential sounds produced by the orcas. These predictions are merged in a JSON file for each sensor, and if data from more than one sensor are being processed the workflow will create a final merged JSON output for all.In our experiments we used data from a single hydrophone sensor over the span of a day. The workflow consumed 8641recordings with a total size of 1.5GBs and median size of 181KB/s. Article in Journal/Newspaper Orca DataCite Metadata Store (German National Library of Science and Technology) Orca Sound ENVELOPE(-58.000,-58.000,-61.883,-61.883) Pacific
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
description The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasound is a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port Townsend) in order to study Orca whales in the Pacific Northwest region. Throughout the course of this project, code to process and analyze the hydrophone data has been developed, and machine learning models have been trained to automatically identify the whistles of the Orcas. All of the code is available publicly on GitHub, and the hydrophone data are free to access, stored in an AWS bucket. In this paper, we have developed an Orcasound pipeline using Pegasus. This version of the pipeline is based on the GitHub Actions Orcasound workflow ,and incorporates inference components of the OrcaHello AI notification system. The Orcasound Pegasus workflow processes the hydrophone data of one or more sensors in batches for each timestamp, and converts them to a WAV format. Using the WAV output it creates spectrogram images that are stored in the final output location. Furthermore, using the pre trained Orca sound model, the workflow scans the WAV files to identify potential sounds produced by the orcas. These predictions are merged in a JSON file for each sensor, and if data from more than one sensor are being processed the workflow will create a final merged JSON output for all.In our experiments we used data from a single hydrophone sensor over the span of a day. The workflow consumed 8641recordings with a total size of 1.5GBs and median size of 181KB/s.
format Article in Journal/Newspaper
author , George
spellingShingle , George
Orcasound Workflow
author_facet , George
author_sort , George
title Orcasound Workflow
title_short Orcasound Workflow
title_full Orcasound Workflow
title_fullStr Orcasound Workflow
title_full_unstemmed Orcasound Workflow
title_sort orcasound workflow
publisher Zenodo
publishDate 2022
url https://dx.doi.org/10.5281/zenodo.5889225
https://zenodo.org/record/5889225
long_lat ENVELOPE(-58.000,-58.000,-61.883,-61.883)
geographic Orca Sound
Pacific
geographic_facet Orca Sound
Pacific
genre Orca
genre_facet Orca
op_relation https://dx.doi.org/10.5281/zenodo.5889224
op_rights Open Access
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
cc-by-4.0
info:eu-repo/semantics/openAccess
op_rightsnorm CC-BY
op_doi https://doi.org/10.5281/zenodo.5889225
https://doi.org/10.5281/zenodo.5889224
_version_ 1766161334991323136