Whole-genome sequencing of 13 Arctic plants and draft genomes of Oxyria digyna and Cochlearia groenlandica.

To understand the genomic characteristics of Arctic plants, we generated 28-44 Gb of short-read sequencing data from 13 Arctic plants collected from the High Arctic Svalbard. We successfully estimated the genome sizes of eight species by using the k-mer-based method (180-894 Mb). Among these plants,...

Full description

Bibliographic Details
Published in:Scientific Data
Main Authors: Kim, Jun, Lim, Jiseon, Kim, Moonkyo, Lee, Yoo Kyung
Format: Dataset
Language:English
Published: Nature Publishing Group 2024
Subjects:
Online Access:https://doi.org/10.1038/s41597-024-03569-6
https://pubmed.ncbi.nlm.nih.gov/39025921
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11258133/
Description
Summary:To understand the genomic characteristics of Arctic plants, we generated 28-44 Gb of short-read sequencing data from 13 Arctic plants collected from the High Arctic Svalbard. We successfully estimated the genome sizes of eight species by using the k-mer-based method (180-894 Mb). Among these plants, the mountain sorrel (Oxyria digyna) and Greenland scurvy grass (Cochlearia groenlandica) had relatively small genome sizes and chromosome numbers. We obtained 45 × and 121 × high-fidelity long-read sequencing data. We assembled their reads into high-quality draft genomes (genome size: 561 and 250 Mb; contig N50 length: 36.9 and 14.8 Mb, respectively), and correspondingly annotated 43,105 and 29,675 genes using ~46 and ~85 million RNA sequencing reads. We identified 765,012 and 88,959 single-nucleotide variants, and 18,082 and 7,698 structural variants (variant size ≥ 50 bp). This study provided high-quality genome assemblies of O. digyna and C. groenlandica, which are valuable resources for the population and molecular genetic studies of these plants.