Training dataset for object detection - Penguins from UAV

Progress Code: completed Statement: The dataset has an overall good quality, However, when detecting penguins from UAV, it is sometimes difficult to distinguish grey-feathered chicks from rocks, which present similar colours. In this sense, the model can predict rocky coasts as a penguin-abundant zo...

Full description

Bibliographic Details
Format: Dataset
Language:unknown
Published: Australian Ocean Data Network
Subjects:
AMD
Online Access:https://researchdata.edu.au/training-dataset-object-penguins-uav/2823165
Description
Summary:Progress Code: completed Statement: The dataset has an overall good quality, However, when detecting penguins from UAV, it is sometimes difficult to distinguish grey-feathered chicks from rocks, which present similar colours. In this sense, the model can predict rocky coasts as a penguin-abundant zones. Purpose This dataset was created to perform Chinstrap penguin detection from Vapour Col colony, Deception Island, enabling an efficient quantification of the population of this species within the colony. Subsequently it was used in the article "The contribution of penguin guano to the Southern Ocean iron pool". On February 8, 2021, Deception Island Chinstrap penguin colonies were photographed during the PiMetAn Project XXXIV Spanish Antarctic campaign using unmanned aerial vehicles (UAV) at a height of 30m. From the obtained imagery, a training dataset for penguin detection from aerial perspective was generated. The penguin species is the Chinstrap penguin (Pygoscelis antarcticus). The dataset consists of three folders: "train", containing 531 images, intended for model training; "valid", containing 50 images, intended for model validation; and "test", containing 25 images, intended for model testing. In each of the three folders, an additional .csv file is located, containing labels (x,y positions and class names for every penguin in the images), annotated in Tensorflow Object Detection format. There is only one annotation class: Penguin. All 606 images are 224x224 px in size, and 96 dpi. The following augmentation was applied to create 3 versions of each source image: * Random shear of between -18° to +18° horizontally and -11° to +11° vertically This dataset was annotated and exported via www.roboflow.com The model Faster R-CNN64 with ResNet-101 backbone was used to perform object detection tasks. Training and evaluation tasks were performed using the TensorFlow 2.0 machine learning platform by Google.