Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity

We introduce a dependent Bayesian nonparametric model for the probabilistic modeling of membership of subgroups in a community based on partially replicated data. The focus here is on species-by-site data, that is, community data where observations at different sites are classified in distinct speci...

Full description

Bibliographic Details
Published in:The Annals of Applied Statistics
Main Authors: Arbel, J, Mengersen, K, Rousseau, J
Format: Article in Journal/Newspaper
Language:unknown
Published: Institute of Mathematical Statistics 2017
Subjects:
Online Access:https://doi.org/10.1214/16-AOAS944
https://ora.ox.ac.uk/objects/uuid:bbe1b213-0c10-425b-b4c4-f9dcbedfad37
id ftuloxford:oai:ora.ox.ac.uk:uuid:bbe1b213-0c10-425b-b4c4-f9dcbedfad37
record_format openpolar
spelling ftuloxford:oai:ora.ox.ac.uk:uuid:bbe1b213-0c10-425b-b4c4-f9dcbedfad37 2023-05-15T13:41:08+02:00 Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity Arbel, J Mengersen, K Rousseau, J 2017-12-13 https://doi.org/10.1214/16-AOAS944 https://ora.ox.ac.uk/objects/uuid:bbe1b213-0c10-425b-b4c4-f9dcbedfad37 unknown Institute of Mathematical Statistics doi:10.1214/16-AOAS944 https://ora.ox.ac.uk/objects/uuid:bbe1b213-0c10-425b-b4c4-f9dcbedfad37 https://doi.org/10.1214/16-AOAS944 info:eu-repo/semantics/openAccess Journal article 2017 ftuloxford https://doi.org/10.1214/16-AOAS944 2022-06-28T20:22:28Z We introduce a dependent Bayesian nonparametric model for the probabilistic modeling of membership of subgroups in a community based on partially replicated data. The focus here is on species-by-site data, that is, community data where observations at different sites are classified in distinct species. Our aim is to study the impact of additional covariates, for instance, environmental variables, on the data structure, and in particular on the community diversity. To this end, we introduce dependence a priori across the covariates and show that it improves posterior inference. We use a dependent version of the Griffiths–Engen–McCloskey distribution defined via the stickbreaking construction. This distribution is obtained by transforming a Gaussian process whose covariance function controls the desired dependence. The resulting posterior distribution is sampled by Markov chain Monte Carlo. We illustrate the application of our model to a soil microbial data set acquired across a hydrocarbon contamination gradient at the site of a fuel spill in Antarctica. This method allows for inference on a number of quantities of interest in ecotoxicology, such as diversity or effective concentrations, and is broadly applicable to the general problem of community response to environmental variables. Article in Journal/Newspaper Antarc* Antarctica ORA - Oxford University Research Archive The Annals of Applied Statistics 10 3
institution Open Polar
collection ORA - Oxford University Research Archive
op_collection_id ftuloxford
language unknown
description We introduce a dependent Bayesian nonparametric model for the probabilistic modeling of membership of subgroups in a community based on partially replicated data. The focus here is on species-by-site data, that is, community data where observations at different sites are classified in distinct species. Our aim is to study the impact of additional covariates, for instance, environmental variables, on the data structure, and in particular on the community diversity. To this end, we introduce dependence a priori across the covariates and show that it improves posterior inference. We use a dependent version of the Griffiths–Engen–McCloskey distribution defined via the stickbreaking construction. This distribution is obtained by transforming a Gaussian process whose covariance function controls the desired dependence. The resulting posterior distribution is sampled by Markov chain Monte Carlo. We illustrate the application of our model to a soil microbial data set acquired across a hydrocarbon contamination gradient at the site of a fuel spill in Antarctica. This method allows for inference on a number of quantities of interest in ecotoxicology, such as diversity or effective concentrations, and is broadly applicable to the general problem of community response to environmental variables.
format Article in Journal/Newspaper
author Arbel, J
Mengersen, K
Rousseau, J
spellingShingle Arbel, J
Mengersen, K
Rousseau, J
Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity
author_facet Arbel, J
Mengersen, K
Rousseau, J
author_sort Arbel, J
title Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity
title_short Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity
title_full Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity
title_fullStr Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity
title_full_unstemmed Bayesian nonparametric dependent model for partially replicated data: The influence of fuel spills on species diversity
title_sort bayesian nonparametric dependent model for partially replicated data: the influence of fuel spills on species diversity
publisher Institute of Mathematical Statistics
publishDate 2017
url https://doi.org/10.1214/16-AOAS944
https://ora.ox.ac.uk/objects/uuid:bbe1b213-0c10-425b-b4c4-f9dcbedfad37
genre Antarc*
Antarctica
genre_facet Antarc*
Antarctica
op_relation doi:10.1214/16-AOAS944
https://ora.ox.ac.uk/objects/uuid:bbe1b213-0c10-425b-b4c4-f9dcbedfad37
https://doi.org/10.1214/16-AOAS944
op_rights info:eu-repo/semantics/openAccess
op_doi https://doi.org/10.1214/16-AOAS944
container_title The Annals of Applied Statistics
container_volume 10
container_issue 3
_version_ 1766146068166213632