Constrained ordination analysis in metagenomics microbial diversity studies

Canonical or constrained correspondence analysis (CCA) is a very popular method for the analysis of species abundance distributions (SAD), particularly when the study objective is to explain differences between the SADs at different sampling sites in terms of local environmental characteristics. The...

Full description

Bibliographic Details
Main Author: Thas, Olivier
Format: Conference Object
Language:English
Published: 2010
Subjects:
Online Access:https://biblio.ugent.be/publication/1853238
http://hdl.handle.net/1854/LU-1853238
id ftunivgent:oai:archive.ugent.be:1853238
record_format openpolar
spelling ftunivgent:oai:archive.ugent.be:1853238 2024-02-11T09:56:28+01:00 Constrained ordination analysis in metagenomics microbial diversity studies Thas, Olivier 2010 https://biblio.ugent.be/publication/1853238 http://hdl.handle.net/1854/LU-1853238 eng eng https://biblio.ugent.be/publication/1853238 http://hdl.handle.net/1854/LU-1853238 XXVth international biometrics conference, Abstracts ISBN: 9780982191910 Mathematics and Statistics conference info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion 2010 ftunivgent 2024-01-24T23:08:39Z Canonical or constrained correspondence analysis (CCA) is a very popular method for the analysis of species abundance distributions (SAD), particularly when the study objective is to explain differences between the SADs at different sampling sites in terms of local environmental characteristics. These methods have been used successfully for moderately sized studies with several tens of sites and species. Current molecular genomics high throughput sequencing techniques allow estimation of SADs of several tens of thousands of microbial species at each sampling site. A consequence of these deep sequencing results is that the SADs are sparse, in the sense that many microbial species have very small or zero abundances at many sampling sites. Because it is well known that CCA is sensitive to these phenomena, and because CCA depends on restrictive assumptions, there is need for a more appropriate statistical method for this type of metagenomics data. We have developed a constrained ordination technique that can cope with sparse high through- put abundance data. The method is related to the statistical models of Yee (2004, Ecological Monographs, 74(4), pp. 685-701), Zhu et al. (2005, Ecological Modelling, 187, pp. 524-536) and Yee (2006, Ecology, 87(1), pp. 203-213). However, instead of assuming a Poisson model for the abundances, we consider a hurdle model with a truncated Poisson component. We also show how our methods relate to the models of rank abundance distributions (RAD) of Foster and Dunstan (2010, Biometrics, 66, pp. 186-195). The new method is applied to a study on microbial communities in Antarctic lakes. The Roche 454 sequencing technique is used to give SADs of several of thousand microbial species in samples from 50 lakes. The study objective is to estimate the relative importance of environ- mental lake characteristics and of the geographic locations of the lakes in explaining differences between the SADs. Conference Object Antarc* Antarctic Ghent University Academic Bibliography Antarctic
institution Open Polar
collection Ghent University Academic Bibliography
op_collection_id ftunivgent
language English
topic Mathematics and Statistics
spellingShingle Mathematics and Statistics
Thas, Olivier
Constrained ordination analysis in metagenomics microbial diversity studies
topic_facet Mathematics and Statistics
description Canonical or constrained correspondence analysis (CCA) is a very popular method for the analysis of species abundance distributions (SAD), particularly when the study objective is to explain differences between the SADs at different sampling sites in terms of local environmental characteristics. These methods have been used successfully for moderately sized studies with several tens of sites and species. Current molecular genomics high throughput sequencing techniques allow estimation of SADs of several tens of thousands of microbial species at each sampling site. A consequence of these deep sequencing results is that the SADs are sparse, in the sense that many microbial species have very small or zero abundances at many sampling sites. Because it is well known that CCA is sensitive to these phenomena, and because CCA depends on restrictive assumptions, there is need for a more appropriate statistical method for this type of metagenomics data. We have developed a constrained ordination technique that can cope with sparse high through- put abundance data. The method is related to the statistical models of Yee (2004, Ecological Monographs, 74(4), pp. 685-701), Zhu et al. (2005, Ecological Modelling, 187, pp. 524-536) and Yee (2006, Ecology, 87(1), pp. 203-213). However, instead of assuming a Poisson model for the abundances, we consider a hurdle model with a truncated Poisson component. We also show how our methods relate to the models of rank abundance distributions (RAD) of Foster and Dunstan (2010, Biometrics, 66, pp. 186-195). The new method is applied to a study on microbial communities in Antarctic lakes. The Roche 454 sequencing technique is used to give SADs of several of thousand microbial species in samples from 50 lakes. The study objective is to estimate the relative importance of environ- mental lake characteristics and of the geographic locations of the lakes in explaining differences between the SADs.
format Conference Object
author Thas, Olivier
author_facet Thas, Olivier
author_sort Thas, Olivier
title Constrained ordination analysis in metagenomics microbial diversity studies
title_short Constrained ordination analysis in metagenomics microbial diversity studies
title_full Constrained ordination analysis in metagenomics microbial diversity studies
title_fullStr Constrained ordination analysis in metagenomics microbial diversity studies
title_full_unstemmed Constrained ordination analysis in metagenomics microbial diversity studies
title_sort constrained ordination analysis in metagenomics microbial diversity studies
publishDate 2010
url https://biblio.ugent.be/publication/1853238
http://hdl.handle.net/1854/LU-1853238
geographic Antarctic
geographic_facet Antarctic
genre Antarc*
Antarctic
genre_facet Antarc*
Antarctic
op_source XXVth international biometrics conference, Abstracts
ISBN: 9780982191910
op_relation https://biblio.ugent.be/publication/1853238
http://hdl.handle.net/1854/LU-1853238
_version_ 1790603297972813824