Supporting Data for: The genome of the pygmy right whale illuminates the evolution of rorquals ...

Background Baleen whales are a clade of gigantic and highly specialized marine mammals. Their genomes have been used to investigate their complex evolutionary history and to decipher the molecular mechanisms that allowed them to reach these dimensions. However, many unanswered questions remain, espe...

Full description

Bibliographic Details
Main Authors: Wolf, Magnus, Zapf, Konstantin, Gupta, Deepak Kumar, Hiller, Michael, Árnason, Ulfur, Janke, Axel
Format: Dataset
Language:English
Published: Dryad 2022
Subjects:
Online Access:https://dx.doi.org/10.5061/dryad.9zw3r22j0
https://datadryad.org/stash/dataset/doi:10.5061/dryad.9zw3r22j0
Description
Summary:Background Baleen whales are a clade of gigantic and highly specialized marine mammals. Their genomes have been used to investigate their complex evolutionary history and to decipher the molecular mechanisms that allowed them to reach these dimensions. However, many unanswered questions remain, especially about the early radiation of rorquals and how cancer resistance interplays with their huge number of cells. The pygmy right whale is the smallest and most elusive among the baleen whales. It reaches only a fraction of the body length compared to its relatives and it is the only living member of an otherwise extinct family. This placement makes the pygmy right whale genome an interesting target to update the complex phylogenetic past of baleen whales, because it splits up an otherwise long branch that leads to the radiation of rorquals. Apart from that, genomic data of this species might help to investigate cancer resistance in large whales, since these mechanisms are not as important for the pygmy right ... : Author for correspondence: Magnus Wolf (Magnus.Wolf@senckenberg.de) The here deposited data is the result of a whole genome sequencing project of the pygmy right whale (Caperea marginata, Gray 1846). Apart of the genome construction, this project includes a phylogenomic revision of the rorqual clade and a positive selection analysis to find genes related to body size and hence cancer resistance in baleen whales. This deposition is composed of: Code to create phylogenomic trees: 1.) A zip file including the main script written in UNIX bash as well as the necessary subscripts and an extensive README file containing necessary instructions. (filename: GEMOMA-to-Phylogeny.zip) Genome data (Cmar): 1.) A raw whole genome assembly without changes made by NCBI in fasta format. (filename: Cmar_C18_SBIK-F_TBG_v1.fasta.gz) 2.) A homology-based genome annotation of the newly constructed genome, including a gff table and an amino acid fasta file. (filename gff: Cmar_C18_SBIK-F_TBG_v1_annotation.gff.gz; filename fasta: ...