An improved genome assembly uncovers prolific tandem repeats in Atlantic cod

Abstract Background The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for comple...

Full description

Bibliographic Details
Main Authors: TøRresen, Ole, Star, Bastiaan, Jentoft, Sissel, Reinar, William, Grove, Harald, Miller, Jason, Walenz, Brian, Knight, James, Ekholm, Jenny, Peluso, Paul, Edvardsen, Rolf, Tooming-Klunderud, Ave, Skage, Morten, SigbjøRn Lien, Kjetill Jakobsen, Nederbragt, Alexander
Format: Article in Journal/Newspaper
Language:unknown
Published: Figshare 2017
Subjects:
Online Access:https://dx.doi.org/10.6084/m9.figshare.c.3668749.v1
https://figshare.com/collections/An_improved_genome_assembly_uncovers_prolific_tandem_repeats_in_Atlantic_cod/3668749/1
id ftdatacite:10.6084/m9.figshare.c.3668749.v1
record_format openpolar
spelling ftdatacite:10.6084/m9.figshare.c.3668749.v1 2023-05-15T15:26:50+02:00 An improved genome assembly uncovers prolific tandem repeats in Atlantic cod TøRresen, Ole Star, Bastiaan Jentoft, Sissel Reinar, William Grove, Harald Miller, Jason Walenz, Brian Knight, James Ekholm, Jenny Peluso, Paul Edvardsen, Rolf Tooming-Klunderud, Ave Skage, Morten SigbjøRn Lien Kjetill Jakobsen Nederbragt, Alexander 2017 https://dx.doi.org/10.6084/m9.figshare.c.3668749.v1 https://figshare.com/collections/An_improved_genome_assembly_uncovers_prolific_tandem_repeats_in_Atlantic_cod/3668749/1 unknown Figshare https://dx.doi.org/10.1186/s12864-016-3448-x https://dx.doi.org/10.6084/m9.figshare.c.3668749 CC BY 4.0 https://creativecommons.org/licenses/by/4.0 CC-BY Genetics FOS Biological sciences Evolutionary Biology Ecology 69999 Biological Sciences not elsewhere classified Marine Biology Collection article 2017 ftdatacite https://doi.org/10.6084/m9.figshare.c.3668749.v1 https://doi.org/10.1186/s12864-016-3448-x https://doi.org/10.6084/m9.figshare.c.3668749 2021-11-05T12:55:41Z Abstract Background The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. Results By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. Conclusions The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance. Article in Journal/Newspaper atlantic cod Gadus morhua DataCite Metadata Store (German National Library of Science and Technology)
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
topic Genetics
FOS Biological sciences
Evolutionary Biology
Ecology
69999 Biological Sciences not elsewhere classified
Marine Biology
spellingShingle Genetics
FOS Biological sciences
Evolutionary Biology
Ecology
69999 Biological Sciences not elsewhere classified
Marine Biology
TøRresen, Ole
Star, Bastiaan
Jentoft, Sissel
Reinar, William
Grove, Harald
Miller, Jason
Walenz, Brian
Knight, James
Ekholm, Jenny
Peluso, Paul
Edvardsen, Rolf
Tooming-Klunderud, Ave
Skage, Morten
SigbjøRn Lien
Kjetill Jakobsen
Nederbragt, Alexander
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
topic_facet Genetics
FOS Biological sciences
Evolutionary Biology
Ecology
69999 Biological Sciences not elsewhere classified
Marine Biology
description Abstract Background The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. Results By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. Conclusions The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.
format Article in Journal/Newspaper
author TøRresen, Ole
Star, Bastiaan
Jentoft, Sissel
Reinar, William
Grove, Harald
Miller, Jason
Walenz, Brian
Knight, James
Ekholm, Jenny
Peluso, Paul
Edvardsen, Rolf
Tooming-Klunderud, Ave
Skage, Morten
SigbjøRn Lien
Kjetill Jakobsen
Nederbragt, Alexander
author_facet TøRresen, Ole
Star, Bastiaan
Jentoft, Sissel
Reinar, William
Grove, Harald
Miller, Jason
Walenz, Brian
Knight, James
Ekholm, Jenny
Peluso, Paul
Edvardsen, Rolf
Tooming-Klunderud, Ave
Skage, Morten
SigbjøRn Lien
Kjetill Jakobsen
Nederbragt, Alexander
author_sort TøRresen, Ole
title An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_short An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_full An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_fullStr An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_full_unstemmed An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_sort improved genome assembly uncovers prolific tandem repeats in atlantic cod
publisher Figshare
publishDate 2017
url https://dx.doi.org/10.6084/m9.figshare.c.3668749.v1
https://figshare.com/collections/An_improved_genome_assembly_uncovers_prolific_tandem_repeats_in_Atlantic_cod/3668749/1
genre atlantic cod
Gadus morhua
genre_facet atlantic cod
Gadus morhua
op_relation https://dx.doi.org/10.1186/s12864-016-3448-x
https://dx.doi.org/10.6084/m9.figshare.c.3668749
op_rights CC BY 4.0
https://creativecommons.org/licenses/by/4.0
op_rightsnorm CC-BY
op_doi https://doi.org/10.6084/m9.figshare.c.3668749.v1
https://doi.org/10.1186/s12864-016-3448-x
https://doi.org/10.6084/m9.figshare.c.3668749
_version_ 1766357321263349760