SSUnique: Detecting Sequence Novelty in Microbiome Surveys
ABSTRACT High-throughput sequencing of small-subunit (SSU) rRNA genes has revolutionized understanding of microbial communities and facilitated investigations into ecological dynamics at unprecedented scales. Such extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environme...
Published in: | mSystems |
---|---|
Main Authors: | , |
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
American Society for Microbiology
2016
|
Subjects: | |
Online Access: | https://doi.org/10.1128/mSystems.00133-16 https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b |
id |
ftdoajarticles:oai:doaj.org/article:ec8357535c4b40f2b4f0b8af56646b1b |
---|---|
record_format |
openpolar |
spelling |
ftdoajarticles:oai:doaj.org/article:ec8357535c4b40f2b4f0b8af56646b1b 2023-05-15T15:13:17+02:00 SSUnique: Detecting Sequence Novelty in Microbiome Surveys Michael D. J. Lynch Josh D. Neufeld 2016-12-01T00:00:00Z https://doi.org/10.1128/mSystems.00133-16 https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b EN eng American Society for Microbiology https://journals.asm.org/doi/10.1128/mSystems.00133-16 https://doaj.org/toc/2379-5077 doi:10.1128/mSystems.00133-16 2379-5077 https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b mSystems, Vol 1, Iss 6 (2016) 16S rRNA high-throughput sequencing microbial dark matter microbiome rare biosphere taxonomic blind spots Microbiology QR1-502 article 2016 ftdoajarticles https://doi.org/10.1128/mSystems.00133-16 2022-12-31T09:01:15Z ABSTRACT High-throughput sequencing of small-subunit (SSU) rRNA genes has revolutionized understanding of microbial communities and facilitated investigations into ecological dynamics at unprecedented scales. Such extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain a substantial proportion of unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. Indeed, these novel taxonomic lineages are associated with so-called microbial “dark matter,” which is the genomic potential of these lineages. Unfortunately, characterization beyond “unclassified” is challenging due to relatively short read lengths and large data set sizes. Here we demonstrate how mining of phylogenetically novel sequences from microbial ecosystems can be automated using SSUnique, a software pipeline that filters unclassified and/or rare operational taxonomic units (OTUs) from 16S rRNA gene sequence libraries by screening against consensus structural models for SSU rRNA. Phylogenetic position is inferred against a reference data set, and additional characterization of novel clades is also included, such as targeted probe/primer design and mining of assembled metagenomes for genomic context. We show how SSUnique reproduced a previous analysis of phylogenetic novelty from an Arctic tundra soil and demonstrate the recovery of highly novel clades from data sets associated with both the Earth Microbiome Project (EMP) and Human Microbiome Project (HMP). We anticipate that SSUnique will add to the expanding computational toolbox supporting high-throughput sequencing approaches for the study of microbial ecology and phylogeny. IMPORTANCE Extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain many unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. This ... Article in Journal/Newspaper Arctic Tundra Directory of Open Access Journals: DOAJ Articles Arctic mSystems 1 6 |
institution |
Open Polar |
collection |
Directory of Open Access Journals: DOAJ Articles |
op_collection_id |
ftdoajarticles |
language |
English |
topic |
16S rRNA high-throughput sequencing microbial dark matter microbiome rare biosphere taxonomic blind spots Microbiology QR1-502 |
spellingShingle |
16S rRNA high-throughput sequencing microbial dark matter microbiome rare biosphere taxonomic blind spots Microbiology QR1-502 Michael D. J. Lynch Josh D. Neufeld SSUnique: Detecting Sequence Novelty in Microbiome Surveys |
topic_facet |
16S rRNA high-throughput sequencing microbial dark matter microbiome rare biosphere taxonomic blind spots Microbiology QR1-502 |
description |
ABSTRACT High-throughput sequencing of small-subunit (SSU) rRNA genes has revolutionized understanding of microbial communities and facilitated investigations into ecological dynamics at unprecedented scales. Such extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain a substantial proportion of unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. Indeed, these novel taxonomic lineages are associated with so-called microbial “dark matter,” which is the genomic potential of these lineages. Unfortunately, characterization beyond “unclassified” is challenging due to relatively short read lengths and large data set sizes. Here we demonstrate how mining of phylogenetically novel sequences from microbial ecosystems can be automated using SSUnique, a software pipeline that filters unclassified and/or rare operational taxonomic units (OTUs) from 16S rRNA gene sequence libraries by screening against consensus structural models for SSU rRNA. Phylogenetic position is inferred against a reference data set, and additional characterization of novel clades is also included, such as targeted probe/primer design and mining of assembled metagenomes for genomic context. We show how SSUnique reproduced a previous analysis of phylogenetic novelty from an Arctic tundra soil and demonstrate the recovery of highly novel clades from data sets associated with both the Earth Microbiome Project (EMP) and Human Microbiome Project (HMP). We anticipate that SSUnique will add to the expanding computational toolbox supporting high-throughput sequencing approaches for the study of microbial ecology and phylogeny. IMPORTANCE Extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain many unclassified sequences, many representing organisms with novel taxonomy (taxonomic “blind spots”) and potentially unique ecology. This ... |
format |
Article in Journal/Newspaper |
author |
Michael D. J. Lynch Josh D. Neufeld |
author_facet |
Michael D. J. Lynch Josh D. Neufeld |
author_sort |
Michael D. J. Lynch |
title |
SSUnique: Detecting Sequence Novelty in Microbiome Surveys |
title_short |
SSUnique: Detecting Sequence Novelty in Microbiome Surveys |
title_full |
SSUnique: Detecting Sequence Novelty in Microbiome Surveys |
title_fullStr |
SSUnique: Detecting Sequence Novelty in Microbiome Surveys |
title_full_unstemmed |
SSUnique: Detecting Sequence Novelty in Microbiome Surveys |
title_sort |
ssunique: detecting sequence novelty in microbiome surveys |
publisher |
American Society for Microbiology |
publishDate |
2016 |
url |
https://doi.org/10.1128/mSystems.00133-16 https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b |
geographic |
Arctic |
geographic_facet |
Arctic |
genre |
Arctic Tundra |
genre_facet |
Arctic Tundra |
op_source |
mSystems, Vol 1, Iss 6 (2016) |
op_relation |
https://journals.asm.org/doi/10.1128/mSystems.00133-16 https://doaj.org/toc/2379-5077 doi:10.1128/mSystems.00133-16 2379-5077 https://doaj.org/article/ec8357535c4b40f2b4f0b8af56646b1b |
op_doi |
https://doi.org/10.1128/mSystems.00133-16 |
container_title |
mSystems |
container_volume |
1 |
container_issue |
6 |
_version_ |
1766343858360156160 |