Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets

For various species, alignments were found between a protein database (produced from Zebrafish) and the sequenced genome of that species. Using Perl scripts and the alignments, gene models were identified in the various species based on the protein sequences. The gene models, for the various species...

Full description

Bibliographic Details
Main Authors: Christensen, Kris, Davidson, William
Format: Dataset
Language:unknown
Published: 2017
Subjects:
Online Access:https://zenodo.org/record/270028
https://doi.org/10.5281/zenodo.270028
id ftzenodo:oai:zenodo.org:270028
record_format openpolar
spelling ftzenodo:oai:zenodo.org:270028 2023-05-15T15:31:23+02:00 Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets Christensen, Kris Davidson, William 2017-02-07 https://zenodo.org/record/270028 https://doi.org/10.5281/zenodo.270028 unknown https://zenodo.org/record/270028 https://doi.org/10.5281/zenodo.270028 oai:zenodo.org:270028 info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by/4.0/legalcode Atlantic Salmon Genome Duplication info:eu-repo/semantics/other dataset 2017 ftzenodo https://doi.org/10.5281/zenodo.270028 2023-03-11T02:31:07Z For various species, alignments were found between a protein database (produced from Zebrafish) and the sequenced genome of that species. Using Perl scripts and the alignments, gene models were identified in the various species based on the protein sequences. The gene models, for the various species, can be found in the .gff3 files. Some of the .gff3 files have had ribosomal proteins removed. Homeologous regions were then identified using Perl scripts and can be found in .gff3 files as well. They have Homeologous_Regions.gff3 in their title. Homeologous genes in these regions were counted (named XX_XX_Homeologous_Regions.txt), and compared to all of the genes (not just homeologous genes) in these regions (named Gene_Count_Homeolgous_XX_XX_XX.txt) to find the density. Homeologous gene sequences were compared to each other to identify the Ps values between them using a program called SNAP (Files with _Homeologous_region_analysis_version_1.2.txt at the end). The analyses of these files are summarized in "Pn_Ps_Values_Vertebrate_Homeologous_Regions.ods." The synteny between species can be found in the files with .seg extensions (These can be opened in IGV). A comparison between the gene density and Ps value for each homeologous region can be found in the file, "Gene_Density_Compared_to_Ps_Values.ods." Included is an extended readme file and Perl scripts (.pl extension) in a compressed file (Final_Scripts.tar.gz). Dataset Atlantic salmon Salmo salar Zenodo
institution Open Polar
collection Zenodo
op_collection_id ftzenodo
language unknown
topic Atlantic Salmon
Genome Duplication
spellingShingle Atlantic Salmon
Genome Duplication
Christensen, Kris
Davidson, William
Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets
topic_facet Atlantic Salmon
Genome Duplication
description For various species, alignments were found between a protein database (produced from Zebrafish) and the sequenced genome of that species. Using Perl scripts and the alignments, gene models were identified in the various species based on the protein sequences. The gene models, for the various species, can be found in the .gff3 files. Some of the .gff3 files have had ribosomal proteins removed. Homeologous regions were then identified using Perl scripts and can be found in .gff3 files as well. They have Homeologous_Regions.gff3 in their title. Homeologous genes in these regions were counted (named XX_XX_Homeologous_Regions.txt), and compared to all of the genes (not just homeologous genes) in these regions (named Gene_Count_Homeolgous_XX_XX_XX.txt) to find the density. Homeologous gene sequences were compared to each other to identify the Ps values between them using a program called SNAP (Files with _Homeologous_region_analysis_version_1.2.txt at the end). The analyses of these files are summarized in "Pn_Ps_Values_Vertebrate_Homeologous_Regions.ods." The synteny between species can be found in the files with .seg extensions (These can be opened in IGV). A comparison between the gene density and Ps value for each homeologous region can be found in the file, "Gene_Density_Compared_to_Ps_Values.ods." Included is an extended readme file and Perl scripts (.pl extension) in a compressed file (Final_Scripts.tar.gz).
format Dataset
author Christensen, Kris
Davidson, William
author_facet Christensen, Kris
Davidson, William
author_sort Christensen, Kris
title Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets
title_short Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets
title_full Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets
title_fullStr Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets
title_full_unstemmed Autopolyploidy Genome Duplication Preserves Other Ancient Genome Duplications in Atlantic Salmon (Salmo salar) Supplementary Datasets
title_sort autopolyploidy genome duplication preserves other ancient genome duplications in atlantic salmon (salmo salar) supplementary datasets
publishDate 2017
url https://zenodo.org/record/270028
https://doi.org/10.5281/zenodo.270028
genre Atlantic salmon
Salmo salar
genre_facet Atlantic salmon
Salmo salar
op_relation https://zenodo.org/record/270028
https://doi.org/10.5281/zenodo.270028
oai:zenodo.org:270028
op_rights info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by/4.0/legalcode
op_doi https://doi.org/10.5281/zenodo.270028
_version_ 1766361891562586112