Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy

The Atlantic salmon is one of the most economically important species in modern-day aquaculture. For this reason, a lot of effort has been put into implementation and improvement of breeding programs for this species, achieving vast genetic process in a considerably short period of time. Improvement...

Full description

Bibliographic Details
Main Author: Manousi, Domniki
Format: Master Thesis
Language:English
Published: Norwegian University of Life Sciences, Ås 2021
Subjects:
Online Access:https://hdl.handle.net/11250/2772899
id ftunivmob:oai:nmbu.brage.unit.no:11250/2772899
record_format openpolar
spelling ftunivmob:oai:nmbu.brage.unit.no:11250/2772899 2023-05-15T15:30:26+02:00 Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy Manousi, Domniki 2021-06-01 application/pdf https://hdl.handle.net/11250/2772899 eng eng Norwegian University of Life Sciences, Ås https://hdl.handle.net/11250/2772899 Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal http://creativecommons.org/licenses/by-nc-nd/4.0/deed.no CC-BY-NC-ND 44 Atlantic salmon Salmo salar Genotype imputation Genome assembly Long read sequencing VDP::Landbruks- og Fiskerifag: 900 Master thesis 2021 ftunivmob 2021-09-23T20:16:17Z The Atlantic salmon is one of the most economically important species in modern-day aquaculture. For this reason, a lot of effort has been put into implementation and improvement of breeding programs for this species, achieving vast genetic process in a considerably short period of time. Improvements in sequencing technologies have facilitated the use of genomic selection, integrating molecular genetic information and increasing selection response for key production traits of polygenic architecture. However, implementation of genomic selection requires large, densely genotyped populations, which can prove challenging, especially considering aquatic populations. Genotype imputation therefore, constitutes a cost-efficient method that amplifies the genotyping density of large populations, allowing them to be analyzed in low-density and cost genotyping platforms. Although at the time of the first Atlantic salmon genome assembly leading sequencing and bioinformatic methods were applied to assemble the genome reference, the high genomic complexity of the species severely impacted the quality of the produced assembly. Assembly errors are expected to primarily affect genotyping quality and consequently all downstream analyses. The recent release of a new genome assembly for Atlantic salmon (NCBI GeneBank reference: GCA_905237065.2), constructed using long-read sequencing technologies, is expected to improve our understanding of salmon genetics and genomics as well as contribute to the application of higher-quality genomic data in salmon breeding. In this study we explored the improvements achieved in the new genome assembly, as these were realized through a genotype imputation analysis using a small sample of immediate relatives. We report large structural changes occurring in the new genome assembly and discuss their impact on imputation accuracy as well as on currently available genotyping platforms. We also provide potential considerations regarding local heterogeneity of imputation accuracy in relationship to salmon’s high genomic complexity as well as occurrence of structural variation elements. Finally, we discuss possible strengths and weaknesses of different imputation approaches relative to our experimental sample limitations. M-ABG Master Thesis Atlantic salmon Salmo salar Open archive Norwegian University of Life Sciences: Brage NMBU
institution Open Polar
collection Open archive Norwegian University of Life Sciences: Brage NMBU
op_collection_id ftunivmob
language English
topic Atlantic salmon
Salmo salar
Genotype imputation
Genome assembly
Long read sequencing
VDP::Landbruks- og Fiskerifag: 900
spellingShingle Atlantic salmon
Salmo salar
Genotype imputation
Genome assembly
Long read sequencing
VDP::Landbruks- og Fiskerifag: 900
Manousi, Domniki
Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy
topic_facet Atlantic salmon
Salmo salar
Genotype imputation
Genome assembly
Long read sequencing
VDP::Landbruks- og Fiskerifag: 900
description The Atlantic salmon is one of the most economically important species in modern-day aquaculture. For this reason, a lot of effort has been put into implementation and improvement of breeding programs for this species, achieving vast genetic process in a considerably short period of time. Improvements in sequencing technologies have facilitated the use of genomic selection, integrating molecular genetic information and increasing selection response for key production traits of polygenic architecture. However, implementation of genomic selection requires large, densely genotyped populations, which can prove challenging, especially considering aquatic populations. Genotype imputation therefore, constitutes a cost-efficient method that amplifies the genotyping density of large populations, allowing them to be analyzed in low-density and cost genotyping platforms. Although at the time of the first Atlantic salmon genome assembly leading sequencing and bioinformatic methods were applied to assemble the genome reference, the high genomic complexity of the species severely impacted the quality of the produced assembly. Assembly errors are expected to primarily affect genotyping quality and consequently all downstream analyses. The recent release of a new genome assembly for Atlantic salmon (NCBI GeneBank reference: GCA_905237065.2), constructed using long-read sequencing technologies, is expected to improve our understanding of salmon genetics and genomics as well as contribute to the application of higher-quality genomic data in salmon breeding. In this study we explored the improvements achieved in the new genome assembly, as these were realized through a genotype imputation analysis using a small sample of immediate relatives. We report large structural changes occurring in the new genome assembly and discuss their impact on imputation accuracy as well as on currently available genotyping platforms. We also provide potential considerations regarding local heterogeneity of imputation accuracy in relationship to salmon’s high genomic complexity as well as occurrence of structural variation elements. Finally, we discuss possible strengths and weaknesses of different imputation approaches relative to our experimental sample limitations. M-ABG
format Master Thesis
author Manousi, Domniki
author_facet Manousi, Domniki
author_sort Manousi, Domniki
title Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy
title_short Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy
title_full Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy
title_fullStr Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy
title_full_unstemmed Assessing the Effects of the New Atlantic Salmon (Salmo salar) Genome Assembly on Imputation Accuracy
title_sort assessing the effects of the new atlantic salmon (salmo salar) genome assembly on imputation accuracy
publisher Norwegian University of Life Sciences, Ås
publishDate 2021
url https://hdl.handle.net/11250/2772899
genre Atlantic salmon
Salmo salar
genre_facet Atlantic salmon
Salmo salar
op_source 44
op_relation https://hdl.handle.net/11250/2772899
op_rights Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal
http://creativecommons.org/licenses/by-nc-nd/4.0/deed.no
op_rightsnorm CC-BY-NC-ND
_version_ 1766360876244271104