Gene and repeat annotation for common eider (Somateria mollissima) ...

Here we provide the gene and repeat annotation for common eider (Somateria mollissima). It is unfortunately currently not possible to upload repeat annotation tracks to an international nucleotide sequence database such as ENA. While uploading the gene annotation is possible, some of the cross refer...

Full description

Bibliographic Details
Main Author: Tørresen, Ole K.
Format: Dataset
Language:unknown
Published: Zenodo 2024
Subjects:
Online Access:https://dx.doi.org/10.5281/zenodo.11159637
https://zenodo.org/doi/10.5281/zenodo.11159637
Description
Summary:Here we provide the gene and repeat annotation for common eider (Somateria mollissima). It is unfortunately currently not possible to upload repeat annotation tracks to an international nucleotide sequence database such as ENA. While uploading the gene annotation is possible, some of the cross references to different databases in the functional annotation is removed. Further, the names of the entries in the publicly available genome assemblies on ENA have different names that what is found in the annotation tracks here, so we also provide the FASTA files for the assemblies. Ideally, all this should have been available via ENA. We annotated the genome assemblies using a pre-release version of the EBP-Nor genome annotation pipeline (https://github.com/ebp-nor/GenomeAnnotation). First, AGAT (https://zenodo.org/record/7255559) agat_sp_keep_longest_isoform.pl and agat_sp_extract_sequences.pl were used on the GRCg7b (GCA_016699485.1) chicken genome assembly and annotation to generate one protein (the longest ...