Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution.
International audience Accurate species identification often relies on public repositories to compare the barcode sequences of the investigated individual(s) with taxonomically assigned sequences. However, the accuracy of identifications in public repositories is often questionable, and the names or...
Published in: | Molecular Ecology Resources |
---|---|
Main Authors: | , , , , , , , , , , , , , , , , , |
Other Authors: | , , , , , , , , , , , , , , , , , , , |
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
HAL CCSD
2021
|
Subjects: | |
Online Access: | https://hal.sorbonne-universite.fr/hal-03268583 https://hal.sorbonne-universite.fr/hal-03268583/document https://hal.sorbonne-universite.fr/hal-03268583/file/1755-0998.13453.pdf https://doi.org/10.1111/1755-0998.13453 |
id |
ftunivnantes:oai:HAL:hal-03268583v1 |
---|---|
record_format |
openpolar |
institution |
Open Polar |
collection |
Université de Nantes: HAL-UNIV-NANTES |
op_collection_id |
ftunivnantes |
language |
English |
topic |
Ulva Aquaculture DNA barcoding Phylogeny Sea lettuce [SDE]Environmental Sciences |
spellingShingle |
Ulva Aquaculture DNA barcoding Phylogeny Sea lettuce [SDE]Environmental Sciences Fort, Antoine Mchale, Marcus Cascella, Kevin Potin, Philippe Perrineau, Marie-Mathilde Kerrison, Philip, da Costa, Elisabete Calado, Ricardo Domingues, Maria, Costa Azevedo, Isabel Sousa-Pinto, Isabel Gachon, Claire van Der Werf, Adrie de Visser, Willem Beniers, Johanna, Jansen, Henrice Guiry, Michael, Sulpice, Ronan Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. |
topic_facet |
Ulva Aquaculture DNA barcoding Phylogeny Sea lettuce [SDE]Environmental Sciences |
description |
International audience Accurate species identification often relies on public repositories to compare the barcode sequences of the investigated individual(s) with taxonomically assigned sequences. However, the accuracy of identifications in public repositories is often questionable, and the names originally given are rarely updated. For instance, species of the Sea Lettuce (Ulva spp.; Ulvophyceae, Ulvales, Ulvaceae) are frequently misidentified in public repositories, including herbaria and gene banks, making species identification based on traditional barcoding unreliable. We DNA barcoded 295 individual distromatic foliose strains of Ulva from the North-East Atlantic for three loci (rbcL, tufA, ITS1). Seven distinct species were found, and we compared our results with all worldwide Ulva spp sequences present in the NCBI database for the three barcodes rbcL, tufA and the ITS1. Our results demonstrate a large degree of species misidentification, where we estimate that 24 to 32% of the entries pertaining to foliose species are misannotated and provide an exhaustive list of NCBI sequences reannotations. An analysis of the global distribution of registered samples from foliose species also indicates possible geographical isolation for some species, and the absence of U. lactuca from Northern Europe. We extended our analytical framework to three other genera, Fucus, Porphyra and Pyropia and also identified erroneously labelled accessions and possibly new synonymies, albeit less than for Ulva spp. Altogether, exhaustive taxonomic clarification by aggregation of a library of barcode sequences highlights misannotations and delivers an improved representation of species diversity and distribution. |
author2 |
National University of Ireland Galway (NUI Galway) Laboratoire de Biologie Intégrative des Modèles Marins (LBI2M) Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Station biologique de Roscoff (SBR) Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS) Scottish Association for Marine Science (SAMS) Laboratorio Associado para a Quimica Verde (LAQV) Requimte Universidade do Porto = University of Porto-Departamento de Química (DQ) Faculdade de Ciências e Tecnologia = School of Science & Technology (FCT NOVA) Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Faculdade de Ciências e Tecnologia = School of Science & Technology (FCT NOVA) Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade do Porto = University of Porto-Departamento de Química (DQ) Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA) Centro de Estudos do Ambiante e do Mar (CESAM) Universidade de Aveiro Interdisciplinary Centre of Marine and Environmental Research Matosinhos, Portugal (CIIMAR) Universidade do Porto = University of Porto Molécules de Communication et Adaptation des Micro-organismes (MCAM) Muséum national d'Histoire naturelle (MNHN)-Centre National de la Recherche Scientifique (CNRS) Wageningen University and Research Wageningen (WUR) Martin Ryan Institute |
format |
Article in Journal/Newspaper |
author |
Fort, Antoine Mchale, Marcus Cascella, Kevin Potin, Philippe Perrineau, Marie-Mathilde Kerrison, Philip, da Costa, Elisabete Calado, Ricardo Domingues, Maria, Costa Azevedo, Isabel Sousa-Pinto, Isabel Gachon, Claire van Der Werf, Adrie de Visser, Willem Beniers, Johanna, Jansen, Henrice Guiry, Michael, Sulpice, Ronan |
author_facet |
Fort, Antoine Mchale, Marcus Cascella, Kevin Potin, Philippe Perrineau, Marie-Mathilde Kerrison, Philip, da Costa, Elisabete Calado, Ricardo Domingues, Maria, Costa Azevedo, Isabel Sousa-Pinto, Isabel Gachon, Claire van Der Werf, Adrie de Visser, Willem Beniers, Johanna, Jansen, Henrice Guiry, Michael, Sulpice, Ronan |
author_sort |
Fort, Antoine |
title |
Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. |
title_short |
Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. |
title_full |
Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. |
title_fullStr |
Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. |
title_full_unstemmed |
Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. |
title_sort |
exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. |
publisher |
HAL CCSD |
publishDate |
2021 |
url |
https://hal.sorbonne-universite.fr/hal-03268583 https://hal.sorbonne-universite.fr/hal-03268583/document https://hal.sorbonne-universite.fr/hal-03268583/file/1755-0998.13453.pdf https://doi.org/10.1111/1755-0998.13453 |
genre |
North East Atlantic |
genre_facet |
North East Atlantic |
op_source |
ISSN: 1755-098X EISSN: 1755-0998 Molecular Ecology Resources https://hal.sorbonne-universite.fr/hal-03268583 Molecular Ecology Resources, Wiley/Blackwell, 2021, ⟨10.1111/1755-0998.13453⟩ |
op_relation |
info:eu-repo/semantics/altIdentifier/doi/10.1111/1755-0998.13453 info:eu-repo/semantics/altIdentifier/pmid/34153167 hal-03268583 https://hal.sorbonne-universite.fr/hal-03268583 https://hal.sorbonne-universite.fr/hal-03268583/document https://hal.sorbonne-universite.fr/hal-03268583/file/1755-0998.13453.pdf doi:10.1111/1755-0998.13453 PUBMED: 34153167 |
op_rights |
info:eu-repo/semantics/OpenAccess |
op_doi |
https://doi.org/10.1111/1755-0998.13453 |
container_title |
Molecular Ecology Resources |
container_volume |
22 |
container_issue |
1 |
container_start_page |
86 |
op_container_end_page |
101 |
_version_ |
1766139059359449088 |
spelling |
ftunivnantes:oai:HAL:hal-03268583v1 2023-05-15T17:38:34+02:00 Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution. Fort, Antoine Mchale, Marcus Cascella, Kevin Potin, Philippe Perrineau, Marie-Mathilde Kerrison, Philip, da Costa, Elisabete Calado, Ricardo Domingues, Maria, Costa Azevedo, Isabel Sousa-Pinto, Isabel Gachon, Claire van Der Werf, Adrie de Visser, Willem Beniers, Johanna, Jansen, Henrice Guiry, Michael, Sulpice, Ronan National University of Ireland Galway (NUI Galway) Laboratoire de Biologie Intégrative des Modèles Marins (LBI2M) Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Station biologique de Roscoff (SBR) Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Centre National de la Recherche Scientifique (CNRS) Scottish Association for Marine Science (SAMS) Laboratorio Associado para a Quimica Verde (LAQV) Requimte Universidade do Porto = University of Porto-Departamento de Química (DQ) Faculdade de Ciências e Tecnologia = School of Science & Technology (FCT NOVA) Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Faculdade de Ciências e Tecnologia = School of Science & Technology (FCT NOVA) Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade do Porto = University of Porto-Departamento de Química (DQ) Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA) Centro de Estudos do Ambiante e do Mar (CESAM) Universidade de Aveiro Interdisciplinary Centre of Marine and Environmental Research Matosinhos, Portugal (CIIMAR) Universidade do Porto = University of Porto Molécules de Communication et Adaptation des Micro-organismes (MCAM) Muséum national d'Histoire naturelle (MNHN)-Centre National de la Recherche Scientifique (CNRS) Wageningen University and Research Wageningen (WUR) Martin Ryan Institute 2021-06-21 https://hal.sorbonne-universite.fr/hal-03268583 https://hal.sorbonne-universite.fr/hal-03268583/document https://hal.sorbonne-universite.fr/hal-03268583/file/1755-0998.13453.pdf https://doi.org/10.1111/1755-0998.13453 en eng HAL CCSD Wiley/Blackwell info:eu-repo/semantics/altIdentifier/doi/10.1111/1755-0998.13453 info:eu-repo/semantics/altIdentifier/pmid/34153167 hal-03268583 https://hal.sorbonne-universite.fr/hal-03268583 https://hal.sorbonne-universite.fr/hal-03268583/document https://hal.sorbonne-universite.fr/hal-03268583/file/1755-0998.13453.pdf doi:10.1111/1755-0998.13453 PUBMED: 34153167 info:eu-repo/semantics/OpenAccess ISSN: 1755-098X EISSN: 1755-0998 Molecular Ecology Resources https://hal.sorbonne-universite.fr/hal-03268583 Molecular Ecology Resources, Wiley/Blackwell, 2021, ⟨10.1111/1755-0998.13453⟩ Ulva Aquaculture DNA barcoding Phylogeny Sea lettuce [SDE]Environmental Sciences info:eu-repo/semantics/article Journal articles 2021 ftunivnantes https://doi.org/10.1111/1755-0998.13453 2022-09-06T23:24:58Z International audience Accurate species identification often relies on public repositories to compare the barcode sequences of the investigated individual(s) with taxonomically assigned sequences. However, the accuracy of identifications in public repositories is often questionable, and the names originally given are rarely updated. For instance, species of the Sea Lettuce (Ulva spp.; Ulvophyceae, Ulvales, Ulvaceae) are frequently misidentified in public repositories, including herbaria and gene banks, making species identification based on traditional barcoding unreliable. We DNA barcoded 295 individual distromatic foliose strains of Ulva from the North-East Atlantic for three loci (rbcL, tufA, ITS1). Seven distinct species were found, and we compared our results with all worldwide Ulva spp sequences present in the NCBI database for the three barcodes rbcL, tufA and the ITS1. Our results demonstrate a large degree of species misidentification, where we estimate that 24 to 32% of the entries pertaining to foliose species are misannotated and provide an exhaustive list of NCBI sequences reannotations. An analysis of the global distribution of registered samples from foliose species also indicates possible geographical isolation for some species, and the absence of U. lactuca from Northern Europe. We extended our analytical framework to three other genera, Fucus, Porphyra and Pyropia and also identified erroneously labelled accessions and possibly new synonymies, albeit less than for Ulva spp. Altogether, exhaustive taxonomic clarification by aggregation of a library of barcode sequences highlights misannotations and delivers an improved representation of species diversity and distribution. Article in Journal/Newspaper North East Atlantic Université de Nantes: HAL-UNIV-NANTES Molecular Ecology Resources 22 1 86 101 |