Comparative genomic data of the Avian Phylogenomics Project

Background: The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognat...

Full description

Bibliographic Details
Published in:GigaScience
Main Authors: Zhang, Guojie, Li, Bo, Li, Cai, Gilbert, M. Thomas P., Jarvis, Erich D., Wang, Jun, The Avian Genome Consortium, Burt, David W.
Format: Article in Journal/Newspaper
Language:English
Published: BioMed Central 2014
Subjects:
Online Access:https://espace.library.uq.edu.au/view/UQ:688047
id ftunivqespace:oai:espace.library.uq.edu.au:UQ:688047
record_format openpolar
spelling ftunivqespace:oai:espace.library.uq.edu.au:UQ:688047 2023-05-15T13:04:57+02:00 Comparative genomic data of the Avian Phylogenomics Project Zhang, Guojie Li, Bo Li, Cai Gilbert, M. Thomas P. Jarvis, Erich D. Wang, Jun The Avian Genome Consortium Burt, David W. 2014-12-01 https://espace.library.uq.edu.au/view/UQ:688047 eng eng BioMed Central doi:10.1186/2047-217X-3-26 issn:2047-217X orcid:0000-0002-9991-1028 Not set 300837 DNRF94 R52-A5062 DP1OD000448 10-081390 Avian genomes Phylogenomics Whole genome sequencing Journal Article 2014 ftunivqespace https://doi.org/10.1186/2047-217X-3-26 2020-12-08T02:15:23Z Background: The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders, and used the genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomics analyses (Jarvis et al. in press; Zhang et al. in press). Here we release assemblies and datasets associated with the comparative genome analyses, which include 38 newly sequenced avian genomes plus previously released or simultaneously released genomes of Chicken, Zebra finch, Turkey, Pigeon, Peregrine falcon, Duck, Budgerigar, Adelie penguin, Emperor penguin and the Medium Ground Finch. We hope that this resource will serve future efforts in phylogenomics and comparative genomics.Findings: The 38 bird genomes were sequenced using the Illumina HiSeq 2000 platform and assembled using a whole genome shotgun strategy. The 48 genomes were categorized into two groups according to the N50 scaffold size of the assemblies: a high depth group comprising 23 species sequenced at high coverage (>50X) with multiple insert size libraries resulting in N50 scaffold sizes greater than 1 Mb (except the White-throated Tinamou and Bald Eagle); and a low depth group comprising 25 species sequenced at a low coverage (similar to 30X) with two insert size libraries resulting in an average N50 scaffold size of about 50 kb. Repetitive elements comprised 4%-22% of the bird genomes. The assembled scaffolds allowed the homology-based annotation of 13,000 similar to 17000 protein coding genes in each avian genome relative to chicken, zebra finch and human, as well as comparative and sequence conservation analyses.Conclusions: Here we release full genome assemblies of 38 newly sequenced avian species, link genome assembly downloads for the 7 of the remaining 10 species, and provide a guideline of genomic data that has been generated and used in our Avian Phylogenomics Project. To the best of our knowledge, the Avian Phylogenomics Project is the biggest vertebrate comparative genomics project to date. The genomic data presented here is expected to accelerate further analyses in many fields, including phylogenetics, comparative genomics, evolution, neurobiology, development biology, and other related areas. Article in Journal/Newspaper Adelie penguin peregrine falcon The University of Queensland: UQ eSpace Finch ENVELOPE(167.383,167.383,-72.567,-72.567) GigaScience 3 1
institution Open Polar
collection The University of Queensland: UQ eSpace
op_collection_id ftunivqespace
language English
topic Avian genomes
Phylogenomics
Whole genome sequencing
spellingShingle Avian genomes
Phylogenomics
Whole genome sequencing
Zhang, Guojie
Li, Bo
Li, Cai
Gilbert, M. Thomas P.
Jarvis, Erich D.
Wang, Jun
The Avian Genome Consortium
Burt, David W.
Comparative genomic data of the Avian Phylogenomics Project
topic_facet Avian genomes
Phylogenomics
Whole genome sequencing
description Background: The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders, and used the genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomics analyses (Jarvis et al. in press; Zhang et al. in press). Here we release assemblies and datasets associated with the comparative genome analyses, which include 38 newly sequenced avian genomes plus previously released or simultaneously released genomes of Chicken, Zebra finch, Turkey, Pigeon, Peregrine falcon, Duck, Budgerigar, Adelie penguin, Emperor penguin and the Medium Ground Finch. We hope that this resource will serve future efforts in phylogenomics and comparative genomics.Findings: The 38 bird genomes were sequenced using the Illumina HiSeq 2000 platform and assembled using a whole genome shotgun strategy. The 48 genomes were categorized into two groups according to the N50 scaffold size of the assemblies: a high depth group comprising 23 species sequenced at high coverage (>50X) with multiple insert size libraries resulting in N50 scaffold sizes greater than 1 Mb (except the White-throated Tinamou and Bald Eagle); and a low depth group comprising 25 species sequenced at a low coverage (similar to 30X) with two insert size libraries resulting in an average N50 scaffold size of about 50 kb. Repetitive elements comprised 4%-22% of the bird genomes. The assembled scaffolds allowed the homology-based annotation of 13,000 similar to 17000 protein coding genes in each avian genome relative to chicken, zebra finch and human, as well as comparative and sequence conservation analyses.Conclusions: Here we release full genome assemblies of 38 newly sequenced avian species, link genome assembly downloads for the 7 of the remaining 10 species, and provide a guideline of genomic data that has been generated and used in our Avian Phylogenomics Project. To the best of our knowledge, the Avian Phylogenomics Project is the biggest vertebrate comparative genomics project to date. The genomic data presented here is expected to accelerate further analyses in many fields, including phylogenetics, comparative genomics, evolution, neurobiology, development biology, and other related areas.
format Article in Journal/Newspaper
author Zhang, Guojie
Li, Bo
Li, Cai
Gilbert, M. Thomas P.
Jarvis, Erich D.
Wang, Jun
The Avian Genome Consortium
Burt, David W.
author_facet Zhang, Guojie
Li, Bo
Li, Cai
Gilbert, M. Thomas P.
Jarvis, Erich D.
Wang, Jun
The Avian Genome Consortium
Burt, David W.
author_sort Zhang, Guojie
title Comparative genomic data of the Avian Phylogenomics Project
title_short Comparative genomic data of the Avian Phylogenomics Project
title_full Comparative genomic data of the Avian Phylogenomics Project
title_fullStr Comparative genomic data of the Avian Phylogenomics Project
title_full_unstemmed Comparative genomic data of the Avian Phylogenomics Project
title_sort comparative genomic data of the avian phylogenomics project
publisher BioMed Central
publishDate 2014
url https://espace.library.uq.edu.au/view/UQ:688047
long_lat ENVELOPE(167.383,167.383,-72.567,-72.567)
geographic Finch
geographic_facet Finch
genre Adelie penguin
peregrine falcon
genre_facet Adelie penguin
peregrine falcon
op_relation doi:10.1186/2047-217X-3-26
issn:2047-217X
orcid:0000-0002-9991-1028
Not set
300837
DNRF94
R52-A5062
DP1OD000448
10-081390
op_doi https://doi.org/10.1186/2047-217X-3-26
container_title GigaScience
container_volume 3
container_issue 1
_version_ 1766375955774832640