Orcasound Workflow
The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasound is a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port To...
Main Author: | |
---|---|
Format: | Article in Journal/Newspaper |
Language: | unknown |
Published: |
Zenodo
2022
|
Subjects: | |
Online Access: | https://dx.doi.org/10.5281/zenodo.5889224 https://zenodo.org/record/5889224 |
id |
ftdatacite:10.5281/zenodo.5889224 |
---|---|
record_format |
openpolar |
spelling |
ftdatacite:10.5281/zenodo.5889224 2023-05-15T17:53:38+02:00 Orcasound Workflow , George 2022 https://dx.doi.org/10.5281/zenodo.5889224 https://zenodo.org/record/5889224 unknown Zenodo https://dx.doi.org/10.5281/zenodo.5889225 Open Access Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 info:eu-repo/semantics/openAccess CC-BY article Software SoftwareSourceCode 2022 ftdatacite https://doi.org/10.5281/zenodo.5889224 https://doi.org/10.5281/zenodo.5889225 2022-02-09T13:10:00Z The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasound is a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port Townsend) in order to study Orca whales in the Pacific Northwest region. Throughout the course of this project, code to process and analyze the hydrophone data has been developed, and machine learning models have been trained to automatically identify the whistles of the Orcas. All of the code is available publicly on GitHub, and the hydrophone data are free to access, stored in an AWS bucket. In this paper, we have developed an Orcasound pipeline using Pegasus. This version of the pipeline is based on the GitHub Actions Orcasound workflow ,and incorporates inference components of the OrcaHello AI notification system. The Orcasound Pegasus workflow processes the hydrophone data of one or more sensors in batches for each timestamp, and converts them to a WAV format. Using the WAV output it creates spectrogram images that are stored in the final output location. Furthermore, using the pre trained Orca sound model, the workflow scans the WAV files to identify potential sounds produced by the orcas. These predictions are merged in a JSON file for each sensor, and if data from more than one sensor are being processed the workflow will create a final merged JSON output for all.In our experiments we used data from a single hydrophone sensor over the span of a day. The workflow consumed 8641recordings with a total size of 1.5GBs and median size of 181KB/s. Article in Journal/Newspaper Orca DataCite Metadata Store (German National Library of Science and Technology) Orca Sound ENVELOPE(-58.000,-58.000,-61.883,-61.883) Pacific |
institution |
Open Polar |
collection |
DataCite Metadata Store (German National Library of Science and Technology) |
op_collection_id |
ftdatacite |
language |
unknown |
description |
The Ocean Observatories Initiative(OOI) through a network of sensors, supports critical research in ocean science and marine life. Orcasound is a community driven project that leverages hydrophone sensors deployed in three locations in the state of Washington (SanJuan Island, Point Bush, and Port Townsend) in order to study Orca whales in the Pacific Northwest region. Throughout the course of this project, code to process and analyze the hydrophone data has been developed, and machine learning models have been trained to automatically identify the whistles of the Orcas. All of the code is available publicly on GitHub, and the hydrophone data are free to access, stored in an AWS bucket. In this paper, we have developed an Orcasound pipeline using Pegasus. This version of the pipeline is based on the GitHub Actions Orcasound workflow ,and incorporates inference components of the OrcaHello AI notification system. The Orcasound Pegasus workflow processes the hydrophone data of one or more sensors in batches for each timestamp, and converts them to a WAV format. Using the WAV output it creates spectrogram images that are stored in the final output location. Furthermore, using the pre trained Orca sound model, the workflow scans the WAV files to identify potential sounds produced by the orcas. These predictions are merged in a JSON file for each sensor, and if data from more than one sensor are being processed the workflow will create a final merged JSON output for all.In our experiments we used data from a single hydrophone sensor over the span of a day. The workflow consumed 8641recordings with a total size of 1.5GBs and median size of 181KB/s. |
format |
Article in Journal/Newspaper |
author |
, George |
spellingShingle |
, George Orcasound Workflow |
author_facet |
, George |
author_sort |
, George |
title |
Orcasound Workflow |
title_short |
Orcasound Workflow |
title_full |
Orcasound Workflow |
title_fullStr |
Orcasound Workflow |
title_full_unstemmed |
Orcasound Workflow |
title_sort |
orcasound workflow |
publisher |
Zenodo |
publishDate |
2022 |
url |
https://dx.doi.org/10.5281/zenodo.5889224 https://zenodo.org/record/5889224 |
long_lat |
ENVELOPE(-58.000,-58.000,-61.883,-61.883) |
geographic |
Orca Sound Pacific |
geographic_facet |
Orca Sound Pacific |
genre |
Orca |
genre_facet |
Orca |
op_relation |
https://dx.doi.org/10.5281/zenodo.5889225 |
op_rights |
Open Access Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 info:eu-repo/semantics/openAccess |
op_rightsnorm |
CC-BY |
op_doi |
https://doi.org/10.5281/zenodo.5889224 https://doi.org/10.5281/zenodo.5889225 |
_version_ |
1766161335158046720 |