Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus

Approximate Bayesian Computation (ABC) is a powerful tool for model-based inference of demographic population histories from large genetic data sets. For most organisms its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-s...

Full description

Bibliographic Details
Main Authors: Shafer, Aaron B. A., Gattepaille, Lucie M., Stewart, Robert E. A., Wolf, Jochen B. W.
Format: Article in Journal/Newspaper
Language:unknown
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10255/dryad.74844
https://doi.org/10.5061/dryad.78k38
id ftdryad:oai:v1.datadryad.org:10255/dryad.74844
record_format openpolar
spelling ftdryad:oai:v1.datadryad.org:10255/dryad.74844 2023-05-15T17:52:25+02:00 Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus Shafer, Aaron B. A. Gattepaille, Lucie M. Stewart, Robert E. A. Wolf, Jochen B. W. 2014-12-09T20:02:49Z http://hdl.handle.net/10255/dryad.74844 https://doi.org/10.5061/dryad.78k38 unknown doi:10.5061/dryad.78k38/1 doi:10.5061/dryad.78k38/2 doi:10.1111/mec.13034 PMID:25482153 doi:10.5061/dryad.78k38 Shafer ABA, Gattepaille LM, Stewart REA, Wolf JBW (2015) Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus. Molecular Ecology 24(2): 328-345. http://hdl.handle.net/10255/dryad.74844 Article 2014 ftdryad https://doi.org/10.5061/dryad.78k38 https://doi.org/10.5061/dryad.78k38/1 https://doi.org/10.5061/dryad.78k38/2 https://doi.org/10.1111/mec.13034 2020-01-01T15:13:45Z Approximate Bayesian Computation (ABC) is a powerful tool for model-based inference of demographic population histories from large genetic data sets. For most organisms its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-scale data to fill this gap, but its potential has not fully been exploited. Here, we explored power, precision and biases of a coalescent-based ABC approach where GBS data were modeled with either a population mutation parameter (θ) or with a fixed sites (FS) approach, allowing single or several segregating sites per locus. With simulated data ranging from 500 to 50,000 loci a variety of demographic models could be reliably inferred across a range of timescales and migration scenarios. Posterior estimates were informative with 1,000 loci for migration and split time in simple population divergence models. In more complex models posterior distributions were wide and almost reverted to the uninformative prior even with 50,000 loci. ABC parameter estimates, however, were generally more accurate than an alternative composite-likelihood method. Bottleneck scenarios proved particularly difficult and only recent bottlenecks without recovery could be reliably detected and dated. Notably, minor allele frequency filters – usual practice for GBS data – negatively affected nearly all estimates. With this in mind, we used a combination of FS and θ approaches on empirical GBS data generated from the Atlantic walrus (Odobenus rosmarus rosmarus), collectively providing support for a population split before the last glacial maximum followed by asymmetrical migration and a range-wide bottleneck. Overall, this study evaluates the potential and limitations of GBS data in an ABC-coalescence framework and proposes a best-practice approach. Article in Journal/Newspaper Odobenus rosmarus walrus* Dryad Digital Repository (Duke University)
institution Open Polar
collection Dryad Digital Repository (Duke University)
op_collection_id ftdryad
language unknown
description Approximate Bayesian Computation (ABC) is a powerful tool for model-based inference of demographic population histories from large genetic data sets. For most organisms its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-scale data to fill this gap, but its potential has not fully been exploited. Here, we explored power, precision and biases of a coalescent-based ABC approach where GBS data were modeled with either a population mutation parameter (θ) or with a fixed sites (FS) approach, allowing single or several segregating sites per locus. With simulated data ranging from 500 to 50,000 loci a variety of demographic models could be reliably inferred across a range of timescales and migration scenarios. Posterior estimates were informative with 1,000 loci for migration and split time in simple population divergence models. In more complex models posterior distributions were wide and almost reverted to the uninformative prior even with 50,000 loci. ABC parameter estimates, however, were generally more accurate than an alternative composite-likelihood method. Bottleneck scenarios proved particularly difficult and only recent bottlenecks without recovery could be reliably detected and dated. Notably, minor allele frequency filters – usual practice for GBS data – negatively affected nearly all estimates. With this in mind, we used a combination of FS and θ approaches on empirical GBS data generated from the Atlantic walrus (Odobenus rosmarus rosmarus), collectively providing support for a population split before the last glacial maximum followed by asymmetrical migration and a range-wide bottleneck. Overall, this study evaluates the potential and limitations of GBS data in an ABC-coalescence framework and proposes a best-practice approach.
format Article in Journal/Newspaper
author Shafer, Aaron B. A.
Gattepaille, Lucie M.
Stewart, Robert E. A.
Wolf, Jochen B. W.
spellingShingle Shafer, Aaron B. A.
Gattepaille, Lucie M.
Stewart, Robert E. A.
Wolf, Jochen B. W.
Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
author_facet Shafer, Aaron B. A.
Gattepaille, Lucie M.
Stewart, Robert E. A.
Wolf, Jochen B. W.
author_sort Shafer, Aaron B. A.
title Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_short Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_full Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_fullStr Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_full_unstemmed Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_sort data from: demographic inferences using short-read genomic data in an approximate bayesian computation framework: in silico evaluation of power, biases, and proof of concept in atlantic walrus
publishDate 2014
url http://hdl.handle.net/10255/dryad.74844
https://doi.org/10.5061/dryad.78k38
genre Odobenus rosmarus
walrus*
genre_facet Odobenus rosmarus
walrus*
op_relation doi:10.5061/dryad.78k38/1
doi:10.5061/dryad.78k38/2
doi:10.1111/mec.13034
PMID:25482153
doi:10.5061/dryad.78k38
Shafer ABA, Gattepaille LM, Stewart REA, Wolf JBW (2015) Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus. Molecular Ecology 24(2): 328-345.
http://hdl.handle.net/10255/dryad.74844
op_doi https://doi.org/10.5061/dryad.78k38
https://doi.org/10.5061/dryad.78k38/1
https://doi.org/10.5061/dryad.78k38/2
https://doi.org/10.1111/mec.13034
_version_ 1766159835636695040