Data from: Trans-oceanic genomic divergence of Atlantic cod ecotypes is associated with large inversions

Chromosomal rearrangements such as inversions can play a crucial role in maintaining polymorphism underlying complex traits and contribute to the process of speciation. In Atlantic cod (Gadus morhua), inversions of several megabases have been identified that dominate genomic differentiation between...

Full description

Bibliographic Details
Main Authors: Berg, Paul R., Star, Bastiaan, Pampoulie, Christophe, Bradbury, Ian R., Bentzen, Paul, Hutchings, Jeffrey A., Jentoft, Sissel, Jakobsen, Kjetill S.
Format: Dataset
Language:unknown
Published: Dryad Digital Repository 2017
Subjects:
Online Access:https://doi.org/10.5061/dryad.b20ps
Description
Summary:Chromosomal rearrangements such as inversions can play a crucial role in maintaining polymorphism underlying complex traits and contribute to the process of speciation. In Atlantic cod (Gadus morhua), inversions of several megabases have been identified that dominate genomic differentiation between migratory and non-migratory ecotypes in the Northeast Atlantic. Here, we show that the same genomic regions display elevated divergence and contribute to ecotype divergence in the Northwest Atlantic as well. The occurrence of these inversions on both sides of the Atlantic Ocean reveals a common evolutionary origin, predating the more than 100,000 years old trans-Atlantic separation of Atlantic cod. The long-term persistence of these inversions indicates that they are maintained by selection, possibly facilitated by co-evolution of genes underlying complex traits. Our data suggest that migratory behaviour is derived from more stationary, ancestral ecotypes. Overall, we identify several large genomic regions - each containing hundreds of genes – likely involved in the maintenance of genomic divergence in Atlantic cod on both sides of the Atlantic Ocean. BergEtAl2017_AtlanticCod_TransatlanticDataset_PLINK-formatSNP array data for 316 individuals of Atlantic cod (Gadus morhua), genotyped at 8,165 loci in standard PLINK file format. The PLINK file format consists of two files, a map file and a ped file. In the map file, the first column defines the linkage group, the second column contains the loci names (dbSNP accession numbers), the third line describes the position within the linkage groups (here all are set to 0) and the fourth column defines the order of the SNPs within each linkage group. In the ped file, column one is used to separate the populations (pop-1 to pop-9), column 2 defines the individuals within each population (denoted as the population abbreviation followed by the individual number) where the populations are abbreviated as follows: Can-N_PB = Placentia Bay, Can-N_SG = Southern Gulf of St. Lawrence, ...