An improved genome assembly uncovers prolific tandem repeats in Atlantic cod

Background The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes...

Full description

Bibliographic Details
Published in:BMC Genomics
Main Authors: Tørresen, Ole K, Star, Bastiaan, Jentoft, Sissel, Reinar, William B, Grove, Harald, Miller, Jason R, Walenz, Brian P, Knight, James, Ekholm, Jenny M, Peluso, Paul, Edvardsen, Rolf B, Tooming-Klungerud, Ave, Skage, Morten, Lien, Sigbjørn, Jakobsen, Kjetill S, Nederbragt, Alexander J
Format: Article in Journal/Newspaper
Language:English
Published: Biomed Central 2017
Subjects:
Online Access:http://hdl.handle.net/11250/2490248
https://doi.org/10.1186/s12864-016-3448-x
id ftagderuniv:oai:uia.brage.unit.no:11250/2490248
record_format openpolar
spelling ftagderuniv:oai:uia.brage.unit.no:11250/2490248 2023-05-15T15:26:47+02:00 An improved genome assembly uncovers prolific tandem repeats in Atlantic cod Tørresen, Ole K Star, Bastiaan Jentoft, Sissel Reinar, William B Grove, Harald Miller, Jason R Walenz, Brian P Knight, James Ekholm, Jenny M Peluso, Paul Edvardsen, Rolf B Tooming-Klungerud, Ave Skage, Morten Lien, Sigbjørn Jakobsen, Kjetill S Nederbragt, Alexander J 2017 application/pdf http://hdl.handle.net/11250/2490248 https://doi.org/10.1186/s12864-016-3448-x eng eng Biomed Central BMC Genomics. 2017, 18:95 1-23 http://hdl.handle.net/11250/2490248 https://doi.org/10.1186/s12864-016-3448-x Navngivelse 4.0 Internasjonal http://creativecommons.org/licenses/by/4.0/deed.no CC-BY 1-23 18 BMC Genomics 1 Journal article Peer reviewed 2017 ftagderuniv https://doi.org/10.1186/s12864-016-3448-x 2022-12-11T06:51:00Z Background The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. Results By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. Conclusions The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance. publishedVersion nivå1 Article in Journal/Newspaper atlantic cod Gadus morhua Unvieristy of Agder: AURA (Brage) BMC Genomics 18 1
institution Open Polar
collection Unvieristy of Agder: AURA (Brage)
op_collection_id ftagderuniv
language English
description Background The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. Results By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. Conclusions The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance. publishedVersion nivå1
format Article in Journal/Newspaper
author Tørresen, Ole K
Star, Bastiaan
Jentoft, Sissel
Reinar, William B
Grove, Harald
Miller, Jason R
Walenz, Brian P
Knight, James
Ekholm, Jenny M
Peluso, Paul
Edvardsen, Rolf B
Tooming-Klungerud, Ave
Skage, Morten
Lien, Sigbjørn
Jakobsen, Kjetill S
Nederbragt, Alexander J
spellingShingle Tørresen, Ole K
Star, Bastiaan
Jentoft, Sissel
Reinar, William B
Grove, Harald
Miller, Jason R
Walenz, Brian P
Knight, James
Ekholm, Jenny M
Peluso, Paul
Edvardsen, Rolf B
Tooming-Klungerud, Ave
Skage, Morten
Lien, Sigbjørn
Jakobsen, Kjetill S
Nederbragt, Alexander J
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
author_facet Tørresen, Ole K
Star, Bastiaan
Jentoft, Sissel
Reinar, William B
Grove, Harald
Miller, Jason R
Walenz, Brian P
Knight, James
Ekholm, Jenny M
Peluso, Paul
Edvardsen, Rolf B
Tooming-Klungerud, Ave
Skage, Morten
Lien, Sigbjørn
Jakobsen, Kjetill S
Nederbragt, Alexander J
author_sort Tørresen, Ole K
title An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_short An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_full An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_fullStr An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_full_unstemmed An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_sort improved genome assembly uncovers prolific tandem repeats in atlantic cod
publisher Biomed Central
publishDate 2017
url http://hdl.handle.net/11250/2490248
https://doi.org/10.1186/s12864-016-3448-x
genre atlantic cod
Gadus morhua
genre_facet atlantic cod
Gadus morhua
op_source 1-23
18
BMC Genomics
1
op_relation BMC Genomics. 2017, 18:95 1-23
http://hdl.handle.net/11250/2490248
https://doi.org/10.1186/s12864-016-3448-x
op_rights Navngivelse 4.0 Internasjonal
http://creativecommons.org/licenses/by/4.0/deed.no
op_rightsnorm CC-BY
op_doi https://doi.org/10.1186/s12864-016-3448-x
container_title BMC Genomics
container_volume 18
container_issue 1
_version_ 1766357264247029760