SSUnique: Detecting Sequence Novelty in Microbiome Surveys

ABSTRACT High-throughput sequencing of small-subunit (SSU) rRNA genes has revolutionized understanding of microbial communities and facilitated investigations into ecological dynamics at unprecedented scales. Such extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environme...

Full description

Bibliographic Details
Published in:mSystems
Main Authors: Michael D. J. Lynch, Josh D. Neufeld
Format: Article in Journal/Newspaper
Language:English
Published: American Society for Microbiology 2016
Subjects:
Online Access:https://doi.org/10.1128/mSystems.00133-16
https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b
id ftdoajarticles:oai:doaj.org/article:ec8357535c4b40f2b4f0b8af56646b1b
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:ec8357535c4b40f2b4f0b8af56646b1b 2023-05-15T15:13:17+02:00 SSUnique: Detecting Sequence Novelty in Microbiome Surveys Michael D. J. Lynch Josh D. Neufeld 2016-12-01T00:00:00Z https://doi.org/10.1128/mSystems.00133-16 https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b EN eng American Society for Microbiology https://journals.asm.org/doi/10.1128/mSystems.00133-16 https://doaj.org/toc/2379-5077 doi:10.1128/mSystems.00133-16 2379-5077 https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b mSystems, Vol 1, Iss 6 (2016) 16S rRNA high-throughput sequencing microbial dark matter microbiome rare biosphere taxonomic blind spots Microbiology QR1-502 article 2016 ftdoajarticles https://doi.org/10.1128/mSystems.00133-16 2022-12-31T09:01:15Z ABSTRACT High-throughput sequencing of small-subunit (SSU) rRNA genes has revolutionized understanding of microbial communities and facilitated investigations into ecological dynamics at unprecedented scales. Such extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain a substantial proportion of unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. Indeed, these novel taxonomic lineages are associated with so-called microbial “dark matter,” which is the genomic potential of these lineages. Unfortunately, characterization beyond “unclassified” is challenging due to relatively short read lengths and large data set sizes. Here we demonstrate how mining of phylogenetically novel sequences from microbial ecosystems can be automated using SSUnique, a software pipeline that filters unclassified and/or rare operational taxonomic units (OTUs) from 16S rRNA gene sequence libraries by screening against consensus structural models for SSU rRNA. Phylogenetic position is inferred against a reference data set, and additional characterization of novel clades is also included, such as targeted probe/primer design and mining of assembled metagenomes for genomic context. We show how SSUnique reproduced a previous analysis of phylogenetic novelty from an Arctic tundra soil and demonstrate the recovery of highly novel clades from data sets associated with both the Earth Microbiome Project (EMP) and Human Microbiome Project (HMP). We anticipate that SSUnique will add to the expanding computational toolbox supporting high-throughput sequencing approaches for the study of microbial ecology and phylogeny. IMPORTANCE Extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain many unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. This ... Article in Journal/Newspaper Arctic Tundra Directory of Open Access Journals: DOAJ Articles Arctic mSystems 1 6
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic 16S rRNA
high-throughput sequencing
microbial dark matter
microbiome
rare biosphere
taxonomic blind spots
Microbiology
QR1-502
spellingShingle 16S rRNA
high-throughput sequencing
microbial dark matter
microbiome
rare biosphere
taxonomic blind spots
Microbiology
QR1-502
Michael D. J. Lynch
Josh D. Neufeld
SSUnique: Detecting Sequence Novelty in Microbiome Surveys
topic_facet 16S rRNA
high-throughput sequencing
microbial dark matter
microbiome
rare biosphere
taxonomic blind spots
Microbiology
QR1-502
description ABSTRACT High-throughput sequencing of small-subunit (SSU) rRNA genes has revolutionized understanding of microbial communities and facilitated investigations into ecological dynamics at unprecedented scales. Such extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain a substantial proportion of unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. Indeed, these novel taxonomic lineages are associated with so-called microbial “dark matter,” which is the genomic potential of these lineages. Unfortunately, characterization beyond “unclassified” is challenging due to relatively short read lengths and large data set sizes. Here we demonstrate how mining of phylogenetically novel sequences from microbial ecosystems can be automated using SSUnique, a software pipeline that filters unclassified and/or rare operational taxonomic units (OTUs) from 16S rRNA gene sequence libraries by screening against consensus structural models for SSU rRNA. Phylogenetic position is inferred against a reference data set, and additional characterization of novel clades is also included, such as targeted probe/primer design and mining of assembled metagenomes for genomic context. We show how SSUnique reproduced a previous analysis of phylogenetic novelty from an Arctic tundra soil and demonstrate the recovery of highly novel clades from data sets associated with both the Earth Microbiome Project (EMP) and Human Microbiome Project (HMP). We anticipate that SSUnique will add to the expanding computational toolbox supporting high-throughput sequencing approaches for the study of microbial ecology and phylogeny. IMPORTANCE Extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain many unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. This ...
format Article in Journal/Newspaper
author Michael D. J. Lynch
Josh D. Neufeld
author_facet Michael D. J. Lynch
Josh D. Neufeld
author_sort Michael D. J. Lynch
title SSUnique: Detecting Sequence Novelty in Microbiome Surveys
title_short SSUnique: Detecting Sequence Novelty in Microbiome Surveys
title_full SSUnique: Detecting Sequence Novelty in Microbiome Surveys
title_fullStr SSUnique: Detecting Sequence Novelty in Microbiome Surveys
title_full_unstemmed SSUnique: Detecting Sequence Novelty in Microbiome Surveys
title_sort ssunique: detecting sequence novelty in microbiome surveys
publisher American Society for Microbiology
publishDate 2016
url https://doi.org/10.1128/mSystems.00133-16
https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b
geographic Arctic
geographic_facet Arctic
genre Arctic
Tundra
genre_facet Arctic
Tundra
op_source mSystems, Vol 1, Iss 6 (2016)
op_relation https://journals.asm.org/doi/10.1128/mSystems.00133-16
https://doaj.org/toc/2379-5077
doi:10.1128/mSystems.00133-16
2379-5077
https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b
op_doi https://doi.org/10.1128/mSystems.00133-16
container_title mSystems
container_volume 1
container_issue 6
_version_ 1766343858360156160