Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus

Approximate Bayesian Computation (ABC) is a powerful tool for model-based inference of demographic population histories from large genetic data sets. For most organisms its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-s...

Full description

Bibliographic Details
Main Authors: Shafer, Aaron B. A., Gattepaille, Lucie M., Stewart, Robert E. A., Wolf, Jochen B. W.
Format: Other/Unknown Material
Language:unknown
Published: Zenodo 2014
Subjects:
Online Access:https://doi.org/10.5061/dryad.78k38
id ftzenodo:oai:zenodo.org:5008782
record_format openpolar
spelling ftzenodo:oai:zenodo.org:5008782 2024-09-15T18:28:33+00:00 Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus Shafer, Aaron B. A. Gattepaille, Lucie M. Stewart, Robert E. A. Wolf, Jochen B. W. 2014-12-09 https://doi.org/10.5061/dryad.78k38 unknown Zenodo https://doi.org/10.1111/mec.13034 https://zenodo.org/communities/dryad https://doi.org/10.5061/dryad.78k38 oai:zenodo.org:5008782 info:eu-repo/semantics/openAccess Creative Commons Zero v1.0 Universal https://creativecommons.org/publicdomain/zero/1.0/legalcode info:eu-repo/semantics/other 2014 ftzenodo https://doi.org/10.5061/dryad.78k3810.1111/mec.13034 2024-07-26T10:06:21Z Approximate Bayesian Computation (ABC) is a powerful tool for model-based inference of demographic population histories from large genetic data sets. For most organisms its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-scale data to fill this gap, but its potential has not fully been exploited. Here, we explored power, precision and biases of a coalescent-based ABC approach where GBS data were modeled with either a population mutation parameter (θ) or with a fixed sites (FS) approach, allowing single or several segregating sites per locus. With simulated data ranging from 500 to 50,000 loci a variety of demographic models could be reliably inferred across a range of timescales and migration scenarios. Posterior estimates were informative with 1,000 loci for migration and split time in simple population divergence models. In more complex models posterior distributions were wide and almost reverted to the uninformative prior even with 50,000 loci. ABC parameter estimates, however, were generally more accurate than an alternative composite-likelihood method. Bottleneck scenarios proved particularly difficult and only recent bottlenecks without recovery could be reliably detected and dated. Notably, minor allele frequency filters – usual practice for GBS data – negatively affected nearly all estimates. With this in mind, we used a combination of FS and θ approaches on empirical GBS data generated from the Atlantic walrus (Odobenus rosmarus rosmarus), collectively providing support for a population split before the last glacial maximum followed by asymmetrical migration and a range-wide bottleneck. Overall, this study evaluates the potential and limitations of GBS data in an ABC-coalescence framework and proposes a best-practice approach. Walrus ms data Walrus GBS data in ms format Walrus.ms Walrus VCF Walrus GBS data in vcf format Walrus.vcf Other/Unknown Material Odobenus rosmarus walrus* Zenodo
institution Open Polar
collection Zenodo
op_collection_id ftzenodo
language unknown
description Approximate Bayesian Computation (ABC) is a powerful tool for model-based inference of demographic population histories from large genetic data sets. For most organisms its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-scale data to fill this gap, but its potential has not fully been exploited. Here, we explored power, precision and biases of a coalescent-based ABC approach where GBS data were modeled with either a population mutation parameter (θ) or with a fixed sites (FS) approach, allowing single or several segregating sites per locus. With simulated data ranging from 500 to 50,000 loci a variety of demographic models could be reliably inferred across a range of timescales and migration scenarios. Posterior estimates were informative with 1,000 loci for migration and split time in simple population divergence models. In more complex models posterior distributions were wide and almost reverted to the uninformative prior even with 50,000 loci. ABC parameter estimates, however, were generally more accurate than an alternative composite-likelihood method. Bottleneck scenarios proved particularly difficult and only recent bottlenecks without recovery could be reliably detected and dated. Notably, minor allele frequency filters – usual practice for GBS data – negatively affected nearly all estimates. With this in mind, we used a combination of FS and θ approaches on empirical GBS data generated from the Atlantic walrus (Odobenus rosmarus rosmarus), collectively providing support for a population split before the last glacial maximum followed by asymmetrical migration and a range-wide bottleneck. Overall, this study evaluates the potential and limitations of GBS data in an ABC-coalescence framework and proposes a best-practice approach. Walrus ms data Walrus GBS data in ms format Walrus.ms Walrus VCF Walrus GBS data in vcf format Walrus.vcf
format Other/Unknown Material
author Shafer, Aaron B. A.
Gattepaille, Lucie M.
Stewart, Robert E. A.
Wolf, Jochen B. W.
spellingShingle Shafer, Aaron B. A.
Gattepaille, Lucie M.
Stewart, Robert E. A.
Wolf, Jochen B. W.
Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
author_facet Shafer, Aaron B. A.
Gattepaille, Lucie M.
Stewart, Robert E. A.
Wolf, Jochen B. W.
author_sort Shafer, Aaron B. A.
title Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_short Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_full Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_fullStr Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_full_unstemmed Data from: Demographic inferences using short-read genomic data in an Approximate Bayesian Computation framework: in silico evaluation of power, biases, and proof of concept in Atlantic walrus
title_sort data from: demographic inferences using short-read genomic data in an approximate bayesian computation framework: in silico evaluation of power, biases, and proof of concept in atlantic walrus
publisher Zenodo
publishDate 2014
url https://doi.org/10.5061/dryad.78k38
genre Odobenus rosmarus
walrus*
genre_facet Odobenus rosmarus
walrus*
op_relation https://doi.org/10.1111/mec.13034
https://zenodo.org/communities/dryad
https://doi.org/10.5061/dryad.78k38
oai:zenodo.org:5008782
op_rights info:eu-repo/semantics/openAccess
Creative Commons Zero v1.0 Universal
https://creativecommons.org/publicdomain/zero/1.0/legalcode
op_doi https://doi.org/10.5061/dryad.78k3810.1111/mec.13034
_version_ 1810469921841217536