The generation of the first chromosome-level de novo genome assembly and the development and validation of a 50K SNP array for the St. John River aquaculture strain of North American Atlantic salmon

Atlantic salmon (Salmo salar) in Northeastern US and Eastern Canada has high economic value for the sport fishing and aquaculture industries. Large differences exist between the genomes of Atlantic salmon of European origin and North American (N.A.) origin. Given the genetic and genomic differences...

Full description

Bibliographic Details
Published in:G3: Genes, Genomes, Genetics
Main Authors: Gao, Guangtu, Waldbieser, Geoffrey C, Youngblood, Ramey C, Zhao, Dongyan, Pietrak, Michael R, Allen, Melissa S, Stannard, Jason A, Buchanan, John T, Long, Roseanna L, Milligan, Melissa, Burr, Gary, Mejía-Guerra, Katherine, Sheehan, Moira J, Scheffler, Brian E, Rexroad, Caird E, Peterson, Brian C, Palti, Yniv
Format: Text
Language:English
Published: Oxford University Press 2023
Subjects:
Online Access:http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10468304/
http://www.ncbi.nlm.nih.gov/pubmed/37335943
https://doi.org/10.1093/g3journal/jkad138
Description
Summary:Atlantic salmon (Salmo salar) in Northeastern US and Eastern Canada has high economic value for the sport fishing and aquaculture industries. Large differences exist between the genomes of Atlantic salmon of European origin and North American (N.A.) origin. Given the genetic and genomic differences between the 2 lineages, it is crucial to develop unique genomic resources for N.A. Atlantic salmon. Here, we describe the resources that we recently developed for genomic and genetic research in N.A. Atlantic salmon aquaculture. Firstly, a new single nucleotide polymorphism (SNP) database for N.A. Atlantic salmon consisting of 3.1 million putative SNPs was generated using data from whole-genome resequencing of 80 N.A. Atlantic salmon individuals. Secondly, a high-density 50K SNP array enriched for the genic regions of the genome and containing 3 sex determination and 61 putative continent of origin markers was developed and validated. Thirdly, a genetic map composed of 27 linkage groups with 36K SNP markers was generated from 2,512 individuals in 141 full-sib families. Finally, a chromosome-level de novo genome assembly from a male N.A. Atlantic salmon from the St. John River aquaculture strain was generated using PacBio long reads. Information from Hi-C proximity ligation sequences and Bionano optical mapping was used to concatenate the contigs into scaffolds. The assembly contains 1,755 scaffolds and only 1,253 gaps, with a total length of 2.83 Gb and N50 of 17.2 Mb. A BUSCO analysis detected 96.2% of the conserved Actinopterygii genes in the assembly, and the genetic linkage information was used to guide the formation of 27 chromosome sequences. Comparative analysis with the reference genome assembly of the European Atlantic salmon confirmed that the karyotype differences between the 2 lineages are caused by a fission in chromosome Ssa01 and 3 chromosome fusions including the p arm of chromosome Ssa01 with Ssa23, Ssa08 with Ssa29, and Ssa26 with Ssa28. The genomic resources we have generated for Atlantic salmon ...