Orcasound Workflow

The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasoundis a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port Tow...

Full description

Bibliographic Details
Main Author: George
Format: Other/Unknown Material
Language:unknown
Published: Zenodo 2022
Subjects:
Online Access:https://doi.org/10.5281/zenodo.5889225
id ftzenodo:oai:zenodo.org:5889225
record_format openpolar
spelling ftzenodo:oai:zenodo.org:5889225 2024-09-15T18:28:53+00:00 Orcasound Workflow George 2022-01-21 https://doi.org/10.5281/zenodo.5889225 unknown Zenodo https://doi.org/10.5281/zenodo.5889224 https://doi.org/10.5281/zenodo.5889225 oai:zenodo.org:5889225 info:eu-repo/semantics/openAccess Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode info:eu-repo/semantics/other 2022 ftzenodo https://doi.org/10.5281/zenodo.588922510.5281/zenodo.5889224 2024-07-26T13:38:57Z The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasoundis a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port Townsend) in order to study Orca whales in the Pacific Northwest region. Throughout the course of this project, code to process and analyze the hydrophone data has been developed, and machine learning models have been trained to automatically identify the whistles of the Orcas. All of the code is available publicly onGitHub, and the hydrophone data are free to access, stored in an AWS bucket. In this paper, we have developed an Orcasound pipeline using Pegasus. This version of the pipeline is based on the GitHub Actions Orcasound workflow ,and incorporates inference components of the OrcaHello AI notification system.The Orcasound Pegasus workflow processes the hydrophone data of one or more sensors in batches for each timestamp, and converts them to a WAV format. Using the WAV output it creates spectrogram images that are stored in the final output location. Furthermore, using the pre trained Orca sound model, the workflow scans the WAV files to identify potential sounds produced by the orcas. These predictions are merged in a JSON file for each sensor, and if data from more than one sensor are being processed the workflow will create a final merged JSON output for all.In our experiments we used data from a single hydrophone sensor over the span of a day. The workflow consumed 8641recordings with a total size of 1.5GBs and median size of 181KB/s. Other/Unknown Material Orca Zenodo
institution Open Polar
collection Zenodo
op_collection_id ftzenodo
language unknown
description The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasoundis a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port Townsend) in order to study Orca whales in the Pacific Northwest region. Throughout the course of this project, code to process and analyze the hydrophone data has been developed, and machine learning models have been trained to automatically identify the whistles of the Orcas. All of the code is available publicly onGitHub, and the hydrophone data are free to access, stored in an AWS bucket. In this paper, we have developed an Orcasound pipeline using Pegasus. This version of the pipeline is based on the GitHub Actions Orcasound workflow ,and incorporates inference components of the OrcaHello AI notification system.The Orcasound Pegasus workflow processes the hydrophone data of one or more sensors in batches for each timestamp, and converts them to a WAV format. Using the WAV output it creates spectrogram images that are stored in the final output location. Furthermore, using the pre trained Orca sound model, the workflow scans the WAV files to identify potential sounds produced by the orcas. These predictions are merged in a JSON file for each sensor, and if data from more than one sensor are being processed the workflow will create a final merged JSON output for all.In our experiments we used data from a single hydrophone sensor over the span of a day. The workflow consumed 8641recordings with a total size of 1.5GBs and median size of 181KB/s.
format Other/Unknown Material
author George
spellingShingle George
Orcasound Workflow
author_facet George
author_sort George
title Orcasound Workflow
title_short Orcasound Workflow
title_full Orcasound Workflow
title_fullStr Orcasound Workflow
title_full_unstemmed Orcasound Workflow
title_sort orcasound workflow
publisher Zenodo
publishDate 2022
url https://doi.org/10.5281/zenodo.5889225
genre Orca
genre_facet Orca
op_relation https://doi.org/10.5281/zenodo.5889224
https://doi.org/10.5281/zenodo.5889225
oai:zenodo.org:5889225
op_rights info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
op_doi https://doi.org/10.5281/zenodo.588922510.5281/zenodo.5889224
_version_ 1810470315711528960