Automatic sorting of Dwarf Minke Whale underwater images

Abstract: Apredictableaggregationofdwarfminkewhales(Balaenopteraacutorostratasubspecies) occurs annually in the Australian waters of the northern Great Barrier Reef in June–July, which has been the subject of a long-term photo-identification study. Researchers from the Minke Whale Project (MWP) at J...

Full description

Bibliographic Details
Published in:Information
Main Authors: Konovalov, Dmitry A., Swinhoe, Natalie, Efremova, Dina B., Birtles, R. Alastair, Kusetic, Martha, Hillcoat, Suzanne, Curnock, Matthew I., Williams, Genevieve, Sheaves, Marcus
Format: Article in Journal/Newspaper
Language:unknown
Published: Molecular Diversity Preservation International (MDPI) 2020
Subjects:
Online Access:https://researchonline.jcu.edu.au/62875/1/information-11-00200.pdf
Description
Summary:Abstract: Apredictableaggregationofdwarfminkewhales(Balaenopteraacutorostratasubspecies) occurs annually in the Australian waters of the northern Great Barrier Reef in June–July, which has been the subject of a long-term photo-identification study. Researchers from the Minke Whale Project (MWP) at James Cook University collect large volumes of underwater digital imagery each season (e.g., 1.8TB in 2018), much of which is contributed by citizen scientists. Manual processing and analysis of this quantity of data had become infeasible, and Convolutional Neural Networks (CNNs) offered a potential solution. Our study sought to design and train a CNN that could detect whales from video footage in complex near-surface underwater surroundings and differentiate the whales from people, boats and recreational gear. We modified known classification CNNs to localise whales in video frames and digital still images. The required high classification accuracy was achieved by discovering an effective negative-labelling training technique. This resulted in a less than 1% false-positive classification rate and below 0.1% false-negative rate. The final operation-version CNN-pipeline processed all videos (with the interval of 10 frames) in approximately four days (running on two GPUs) delivering 1.95 million sorted images.