Detecting genetic signals of selection in heavily bottlenecked reindeer populations by comparing parallel founder events ...

Founder populations are of special interest to both evolutionary and conservation biologists, but the detection of genetic signals of selection in these populations is challenging due to their demographic history. Geographically separated founder populations likely subjected to similar selection pre...

Full description

Bibliographic Details
Main Authors: De Jong, Menno, Lovatt, Fiona, Hoelzel, Rus
Format: Dataset
Language:English
Published: Dryad 2021
Subjects:
Online Access:https://dx.doi.org/10.5061/dryad.rv15dv47h
https://datadryad.org/stash/dataset/doi:10.5061/dryad.rv15dv47h
Description
Summary:Founder populations are of special interest to both evolutionary and conservation biologists, but the detection of genetic signals of selection in these populations is challenging due to their demographic history. Geographically separated founder populations likely subjected to similar selection pressures provide an ideal but rare opportunity to overcome these challenges. Here we take advantage of such a situation generated when small, isolated founder populations of reindeer were established on the island of South Georgia, and using this system we look for empirical evidence of selection overcoming strong genetic drift. We generated a 70K ddRADseq SNP database for the two parallel reindeer founder populations and screened for signatures of soft sweeps. We find evidence for a genomic region under selection shared among the two populations, and support our findings with Wright-Fisher model simulations to assess the power and specificity of interpopulation selection scans – i.e. Bayescan, OutFLANK, PCadapt and ... : SNP datasets containing genotype information for samples taken from the two introduced populations of reindeer (Rangifer tarandus) which occurred until recently on the south-Atlantic island South Georgia. The dataset also includes samples from their Norwegian source population. Sequencing libraries were generated with the ddRADseq protocol, and SNPs were subsequently called using the STACKS2.2 refmap pipeline, using the reindeer genome as reference. The data contains multiple SNPs per read pair. The data is stored in PED and MAP format (plink format). Each population is represented by ~30 individuals, as specified in the popfile. 'Busen' and 'Barff' are the names of the two peninsulas on which the two South Georgia reindeer populations occured. Also included in the download are read depth depth information files generated with vcftools. ...