Data from: Processing citizen science- and machine-annotated time-lapse imagery for biologically meaningful metrics

Time-lapse cameras facilitate remote and high-resolution monitoring of wild animal and plant communities, but the image data produced require further processing to be useful. Here we publish pipelines to process raw time-lapse imagery, resulting in count data (number of penguins per image) and '...

Full description

Bibliographic Details
Main Authors: Jones, Fiona M., Arteta, Carlos, Zisserman, Andrew, Lempitsky, Victor, Lintott, Chris J., Hart, Tom
Format: Dataset
Language:unknown
Published: 2020
Subjects:
Online Access:https://zenodo.org/record/5105142
https://doi.org/10.5061/dryad.94sp17b
Description
Summary:Time-lapse cameras facilitate remote and high-resolution monitoring of wild animal and plant communities, but the image data produced require further processing to be useful. Here we publish pipelines to process raw time-lapse imagery, resulting in count data (number of penguins per image) and 'nearest neighbour distance' measurements. The latter provide useful summaries of colony spatial structure (which can indicate phenological stage) and can be used to detect movement – metrics which could be valuable for a number of different monitoring scenarios, including image capture during aerial surveys. We present two alternative pathways for producing counts: 1) via the Zooniverse citizen science project Penguin Watch and 2) via a computer vision algorithm (Pengbot), and share a comparison of citizen science-, machine learning-, and expert- derived counts. We provide example files for 14 Penguin Watch cameras, generated from 63,070 raw images annotated by 50,445 volunteers. We encourage the use of this large open-source dataset, and the associated processing methodologies, for both ecological studies and continued machine learning and computer vision development. Penguin Watch ManifestThis file contains metadata for each of the 63,070 different images, captured by 14 Penguin Watch time-lapse cameras, included in the repository found at: DOI: https://doi.org/10.5061/dryad.vv36g. The following variables are provided: image name, date/time, Zooniverse ID, path, classification count, state, temperature in Fahrenheit, lunar phase, and a URL link to a thumbnail image. Please see the associated paper for an explanation of these variables.PW_Manifest.csvKraken FilesThis folder contains 14 'Kraken Files' - one for each of the time-lapse cameras described in Jones et al. (2018); DOI: https://doi.org/10.1038/sdata.2018.124. These 'Kraken Files' contain filtered Penguin Watch 'consensus click' data combined with metadata (date/time, temperature in Fahrenheit, lunar phase, and a URL link to a thumbnail image) for the 63,070 ...