Genotype calling and mapping of multisite variants using an Atlantic salmon iSelect SNP-array

Motivation: Due to a genome duplication event in the recent history of salmonids, modern Atlantic salmon ( Salmo salar ) have a mosaic genome with roughly 1/3rd being tetraploid. This is a complicating factor in genotyping and genetic mapping since polymorphisms within duplicated regions (multisite...

Full description

Bibliographic Details
Published in:Bioinformatics
Main Authors: Gidskehaug, Lars, Kent, Matthew, Hayes, Ben J., Lien, Sigbjørn
Format: Text
Language:English
Published: Oxford University Press 2010
Subjects:
Online Access:http://bioinformatics.oxfordjournals.org/cgi/content/short/btq673v1
https://doi.org/10.1093/bioinformatics/btq673
Description
Summary:Motivation: Due to a genome duplication event in the recent history of salmonids, modern Atlantic salmon ( Salmo salar ) have a mosaic genome with roughly 1/3rd being tetraploid. This is a complicating factor in genotyping and genetic mapping since polymorphisms within duplicated regions (multisite variants; MSVs) are challenging to call and to assign to the correct paralogue. Standard genotyping software offered by Illumina has not been written to interpret MSVs and will either fail or mis-call these polymorphisms. For the purpose of mapping, linkage, or association studies in non-diploid species, there is a pressing need for software that includes analysis of MSVs in addition to regular single nucleotide polymorphism (SNP) markers. Results: A software package is presented for the analysis of partially tetraploid genomes genotyped using Illumina Infinium BeadArrays (Illumina Inc.) that includes pre-processing, clustering, plotting, and validation routines. More than 3000 salmon from an aquacultural strain in Norway, distributed among 266 full-sib families, were genotyped on a 15K BeadArray including both SNP- and MSV-markers. A total of 4268 SNPs and 1471 MSVs were identified, with average call accuracies of 0.97 and 0.86, respectively. A total of 150 MSVs polymorphic in both paralogs were dissected and mapped to their respective chromosomes, yielding insights about the salmon genome reversion to diploidy and improving marker genome coverage. Several retained homeologies were found and are reported. Availability and Implementation: R-package beadarrayMSV freely available on the web at http://cran.r-project.org/ . Contact: lars.gidskehaug@umb.no Supplementary information: Supplementary data are available at Bioinformatics online.