Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks

Planktonic foraminiferal species identification is central to many paleoceanographic studies, from selecting species for geochemical research to elucidating the biotic dynamics of microfossil communities relevant to physical oceanographic processes and interconnected phenomena such as climate change...

Full description

Bibliographic Details
Published in:Paleoceanography and Paleoclimatology
Main Authors: Hsiang, Allison Y., Brombacher, Anieke, Rillo, Marina C., Mleneck‐vautravers, Maryline J., Conn, Stephen, Lordsmith, Sian, Jentzen, Anna, Henehan, Michael J., Metcalfe, Brett, Fenton, Isabel S., Wade, Bridget S., Fox, Lyndsey, Meilland, Julie, Davis, Catherine V., Baranowski, Ulrike, Groeneveld, Jeroen, Edgar, Kirsty M., Movellan, Aurore, Aze, Tracy, Dowsett, Harry J., Miller, C. Giles, Rios, Nelson, Hull, Pincelli M.
Format: Article in Journal/Newspaper
Language:English
Published: 2019
Subjects:
Online Access:https://eprints.soton.ac.uk/433625/
https://eprints.soton.ac.uk/433625/1/Hsiang_et_al_2019_Paleoceanography_and_Paleoclimatology.pdf
id ftsouthampton:oai:eprints.soton.ac.uk:433625
record_format openpolar
spelling ftsouthampton:oai:eprints.soton.ac.uk:433625 2024-02-11T10:08:01+01:00 Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks Hsiang, Allison Y. Brombacher, Anieke Rillo, Marina C. Mleneck‐vautravers, Maryline J. Conn, Stephen Lordsmith, Sian Jentzen, Anna Henehan, Michael J. Metcalfe, Brett Fenton, Isabel S. Wade, Bridget S. Fox, Lyndsey Meilland, Julie Davis, Catherine V. Baranowski, Ulrike Groeneveld, Jeroen Edgar, Kirsty M. Movellan, Aurore Aze, Tracy Dowsett, Harry J. Miller, C. Giles Rios, Nelson Hull, Pincelli M. 2019-08-13 text https://eprints.soton.ac.uk/433625/ https://eprints.soton.ac.uk/433625/1/Hsiang_et_al_2019_Paleoceanography_and_Paleoclimatology.pdf en English eng https://eprints.soton.ac.uk/433625/1/Hsiang_et_al_2019_Paleoceanography_and_Paleoclimatology.pdf Hsiang, Allison Y., Brombacher, Anieke, Rillo, Marina C., Mleneck‐vautravers, Maryline J., Conn, Stephen, Lordsmith, Sian, Jentzen, Anna, Henehan, Michael J., Metcalfe, Brett, Fenton, Isabel S., Wade, Bridget S., Fox, Lyndsey, Meilland, Julie, Davis, Catherine V., Baranowski, Ulrike, Groeneveld, Jeroen, Edgar, Kirsty M., Movellan, Aurore, Aze, Tracy, Dowsett, Harry J., Miller, C. Giles, Rios, Nelson and Hull, Pincelli M. (2019) Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks. Paleoceanography and Paleoclimatology, 34 (7), 1157-1177. (doi:10.1029/2019PA003612 <http://dx.doi.org/10.1029/2019PA003612>). cc_by_nc_nd_4 Article PeerReviewed 2019 ftsouthampton https://doi.org/10.1029/2019PA003612 2024-01-25T23:19:37Z Planktonic foraminiferal species identification is central to many paleoceanographic studies, from selecting species for geochemical research to elucidating the biotic dynamics of microfossil communities relevant to physical oceanographic processes and interconnected phenomena such as climate change. However, few resources exist to train students in the difficult task of discerning amongst closely related species, resulting in diverging taxonomic schools that differ in species concepts and boundaries. This problem is exacerbated by the limited number of taxonomic experts. Here we document our initial progress toward removing these confounding and/or rate‐limiting factors by generating the first extensive image library of modern planktonic foraminifera, providing digital taxonomic training tools and resources, and automating species‐level taxonomic identification of planktonic foraminifera via machine learning using convolution neural networks. Experts identified 34,640 images of modern (extant) planktonic foraminifera to the species level. These images are served as species exemplars through the online portal Endless Forams (endlessforams.org) and a taxonomic training portal hosted on the citizen science platform Zooniverse (zooniverse.org/projects/ahsiang/endless‐forams/). A supervised machine learning classifier was then trained with ~27,000 images of these identified planktonic foraminifera. The best‐performing model provided the correct species name for an image in the validation set 87.4% of the time and included the correct name in its top three guesses 97.7% of the time. Together, these resources provide a rigorous set of training tools in modern planktonic foraminiferal taxonomy and a means of rapidly generating assemblage data via machine learning in future studies for applications such as paleotemperature reconstruction. Article in Journal/Newspaper Planktonic foraminifera University of Southampton: e-Prints Soton Paleoceanography and Paleoclimatology 34 7 1157 1177
institution Open Polar
collection University of Southampton: e-Prints Soton
op_collection_id ftsouthampton
language English
description Planktonic foraminiferal species identification is central to many paleoceanographic studies, from selecting species for geochemical research to elucidating the biotic dynamics of microfossil communities relevant to physical oceanographic processes and interconnected phenomena such as climate change. However, few resources exist to train students in the difficult task of discerning amongst closely related species, resulting in diverging taxonomic schools that differ in species concepts and boundaries. This problem is exacerbated by the limited number of taxonomic experts. Here we document our initial progress toward removing these confounding and/or rate‐limiting factors by generating the first extensive image library of modern planktonic foraminifera, providing digital taxonomic training tools and resources, and automating species‐level taxonomic identification of planktonic foraminifera via machine learning using convolution neural networks. Experts identified 34,640 images of modern (extant) planktonic foraminifera to the species level. These images are served as species exemplars through the online portal Endless Forams (endlessforams.org) and a taxonomic training portal hosted on the citizen science platform Zooniverse (zooniverse.org/projects/ahsiang/endless‐forams/). A supervised machine learning classifier was then trained with ~27,000 images of these identified planktonic foraminifera. The best‐performing model provided the correct species name for an image in the validation set 87.4% of the time and included the correct name in its top three guesses 97.7% of the time. Together, these resources provide a rigorous set of training tools in modern planktonic foraminiferal taxonomy and a means of rapidly generating assemblage data via machine learning in future studies for applications such as paleotemperature reconstruction.
format Article in Journal/Newspaper
author Hsiang, Allison Y.
Brombacher, Anieke
Rillo, Marina C.
Mleneck‐vautravers, Maryline J.
Conn, Stephen
Lordsmith, Sian
Jentzen, Anna
Henehan, Michael J.
Metcalfe, Brett
Fenton, Isabel S.
Wade, Bridget S.
Fox, Lyndsey
Meilland, Julie
Davis, Catherine V.
Baranowski, Ulrike
Groeneveld, Jeroen
Edgar, Kirsty M.
Movellan, Aurore
Aze, Tracy
Dowsett, Harry J.
Miller, C. Giles
Rios, Nelson
Hull, Pincelli M.
spellingShingle Hsiang, Allison Y.
Brombacher, Anieke
Rillo, Marina C.
Mleneck‐vautravers, Maryline J.
Conn, Stephen
Lordsmith, Sian
Jentzen, Anna
Henehan, Michael J.
Metcalfe, Brett
Fenton, Isabel S.
Wade, Bridget S.
Fox, Lyndsey
Meilland, Julie
Davis, Catherine V.
Baranowski, Ulrike
Groeneveld, Jeroen
Edgar, Kirsty M.
Movellan, Aurore
Aze, Tracy
Dowsett, Harry J.
Miller, C. Giles
Rios, Nelson
Hull, Pincelli M.
Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
author_facet Hsiang, Allison Y.
Brombacher, Anieke
Rillo, Marina C.
Mleneck‐vautravers, Maryline J.
Conn, Stephen
Lordsmith, Sian
Jentzen, Anna
Henehan, Michael J.
Metcalfe, Brett
Fenton, Isabel S.
Wade, Bridget S.
Fox, Lyndsey
Meilland, Julie
Davis, Catherine V.
Baranowski, Ulrike
Groeneveld, Jeroen
Edgar, Kirsty M.
Movellan, Aurore
Aze, Tracy
Dowsett, Harry J.
Miller, C. Giles
Rios, Nelson
Hull, Pincelli M.
author_sort Hsiang, Allison Y.
title Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
title_short Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
title_full Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
title_fullStr Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
title_full_unstemmed Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
title_sort endless forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
publishDate 2019
url https://eprints.soton.ac.uk/433625/
https://eprints.soton.ac.uk/433625/1/Hsiang_et_al_2019_Paleoceanography_and_Paleoclimatology.pdf
genre Planktonic foraminifera
genre_facet Planktonic foraminifera
op_relation https://eprints.soton.ac.uk/433625/1/Hsiang_et_al_2019_Paleoceanography_and_Paleoclimatology.pdf
Hsiang, Allison Y., Brombacher, Anieke, Rillo, Marina C., Mleneck‐vautravers, Maryline J., Conn, Stephen, Lordsmith, Sian, Jentzen, Anna, Henehan, Michael J., Metcalfe, Brett, Fenton, Isabel S., Wade, Bridget S., Fox, Lyndsey, Meilland, Julie, Davis, Catherine V., Baranowski, Ulrike, Groeneveld, Jeroen, Edgar, Kirsty M., Movellan, Aurore, Aze, Tracy, Dowsett, Harry J., Miller, C. Giles, Rios, Nelson and Hull, Pincelli M. (2019) Endless Forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks. Paleoceanography and Paleoclimatology, 34 (7), 1157-1177. (doi:10.1029/2019PA003612 <http://dx.doi.org/10.1029/2019PA003612>).
op_rights cc_by_nc_nd_4
op_doi https://doi.org/10.1029/2019PA003612
container_title Paleoceanography and Paleoclimatology
container_volume 34
container_issue 7
container_start_page 1157
op_container_end_page 1177
_version_ 1790606930966740992