Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository

Biodiversity assessments relying on DNA have increased rapidly over the last decade. However, the reliability of taxonomic assignments in metabarcoding studies is variable and affected by the reference databases and the assignment methods used. Species level assignments are usually considered as rel...

Full description

Bibliographic Details
Published in:Metabarcoding and Metagenomics
Main Authors: Audrey Bourret, Claude Nozères, Eric Parent, Geneviève J. Parent
Format: Article in Journal/Newspaper
Language:English
Published: Pensoft Publishers 2023
Subjects:
Online Access:https://doi.org/10.3897/mbmg.7.98539
https://doaj.org/article/e58fc55d0d1a437db9148c10b9347038
id ftdoajarticles:oai:doaj.org/article:e58fc55d0d1a437db9148c10b9347038
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:e58fc55d0d1a437db9148c10b9347038 2023-05-15T17:45:44+02:00 Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository Audrey Bourret Claude Nozères Eric Parent Geneviève J. Parent 2023-02-01T00:00:00Z https://doi.org/10.3897/mbmg.7.98539 https://doaj.org/article/e58fc55d0d1a437db9148c10b9347038 EN eng Pensoft Publishers https://mbmg.pensoft.net/article/98539/download/pdf/ https://mbmg.pensoft.net/article/98539/download/xml/ https://mbmg.pensoft.net/article/98539/ https://doaj.org/toc/2534-9708 doi:10.3897/mbmg.7.98539 2534-9708 https://doaj.org/article/e58fc55d0d1a437db9148c10b9347038 Metabarcoding and Metagenomics, Vol 7, Iss , Pp 37-49 (2023) Ecology QH540-549.5 article 2023 ftdoajarticles https://doi.org/10.3897/mbmg.7.98539 2023-02-26T01:27:20Z Biodiversity assessments relying on DNA have increased rapidly over the last decade. However, the reliability of taxonomic assignments in metabarcoding studies is variable and affected by the reference databases and the assignment methods used. Species level assignments are usually considered as reliable using regional libraries but unreliable using public repositories. In this study, we aimed to test this assumption for metazoan species detected in the Gulf of St. Lawrence in the Northwest Atlantic. We first created a regional library (GSL-rl) by data mining COI barcode sequences from BOLD, and included a reliability ranking system for species assignments. We then estimated 1) the accuracy and precision of the public repository NCBI-nt for species assignments using sequences from the regional library and 2) compared the detection and reliability of species assignments of a metabarcoding dataset using either NCBI-nt or the regional library and popular assignment methods. With NCBI-nt and sequences from the regional library, the BLAST-LCA (least common ancestor) method was the most precise method for species assignments, but the accuracy was higher with the BLAST-TopHit method (>80% over all taxa, between 70% and 90% amongst taxonomic groups). With the metabarcoding dataset, the reliability of species assignments was greater using GSL-rl compared to NCBI-nt. However, we also observed that the total number of reliable species assignments could be maximized using both GSL-rl and NCBI-nt with different optimized assignment methods. The use of a two-step approach for species assignments, i.e., using a regional library and a public repository, could improve the reliability and the number of detected species in metabarcoding studies. Article in Journal/Newspaper Northwest Atlantic Directory of Open Access Journals: DOAJ Articles Metabarcoding and Metagenomics 7
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic Ecology
QH540-549.5
spellingShingle Ecology
QH540-549.5
Audrey Bourret
Claude Nozères
Eric Parent
Geneviève J. Parent
Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository
topic_facet Ecology
QH540-549.5
description Biodiversity assessments relying on DNA have increased rapidly over the last decade. However, the reliability of taxonomic assignments in metabarcoding studies is variable and affected by the reference databases and the assignment methods used. Species level assignments are usually considered as reliable using regional libraries but unreliable using public repositories. In this study, we aimed to test this assumption for metazoan species detected in the Gulf of St. Lawrence in the Northwest Atlantic. We first created a regional library (GSL-rl) by data mining COI barcode sequences from BOLD, and included a reliability ranking system for species assignments. We then estimated 1) the accuracy and precision of the public repository NCBI-nt for species assignments using sequences from the regional library and 2) compared the detection and reliability of species assignments of a metabarcoding dataset using either NCBI-nt or the regional library and popular assignment methods. With NCBI-nt and sequences from the regional library, the BLAST-LCA (least common ancestor) method was the most precise method for species assignments, but the accuracy was higher with the BLAST-TopHit method (>80% over all taxa, between 70% and 90% amongst taxonomic groups). With the metabarcoding dataset, the reliability of species assignments was greater using GSL-rl compared to NCBI-nt. However, we also observed that the total number of reliable species assignments could be maximized using both GSL-rl and NCBI-nt with different optimized assignment methods. The use of a two-step approach for species assignments, i.e., using a regional library and a public repository, could improve the reliability and the number of detected species in metabarcoding studies.
format Article in Journal/Newspaper
author Audrey Bourret
Claude Nozères
Eric Parent
Geneviève J. Parent
author_facet Audrey Bourret
Claude Nozères
Eric Parent
Geneviève J. Parent
author_sort Audrey Bourret
title Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository
title_short Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository
title_full Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository
title_fullStr Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository
title_full_unstemmed Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository
title_sort maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository
publisher Pensoft Publishers
publishDate 2023
url https://doi.org/10.3897/mbmg.7.98539
https://doaj.org/article/e58fc55d0d1a437db9148c10b9347038
genre Northwest Atlantic
genre_facet Northwest Atlantic
op_source Metabarcoding and Metagenomics, Vol 7, Iss , Pp 37-49 (2023)
op_relation https://mbmg.pensoft.net/article/98539/download/pdf/
https://mbmg.pensoft.net/article/98539/download/xml/
https://mbmg.pensoft.net/article/98539/
https://doaj.org/toc/2534-9708
doi:10.3897/mbmg.7.98539
2534-9708
https://doaj.org/article/e58fc55d0d1a437db9148c10b9347038
op_doi https://doi.org/10.3897/mbmg.7.98539
container_title Metabarcoding and Metagenomics
container_volume 7
_version_ 1766148976068788224