The gyrfalcon (Falcon rusticolus) genome

High-quality genome assemblies are characterized by high sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolutionary, and population genomics. To provide this valuable resource for conse...

Full description

Bibliographic Details
Published in:G3 Genes|Genomes|Genetics
Main Authors: Zuccolo, Andrea, Mfarrej, Sara, Celii, Mirko, Mussurova, Saule, Rivera, Luis, Llaca, Victor, Mohammed, Nahed Abdullah, Pain, Arnab, Alrefaei, Abdulmajeed Fahad, Alrefaei, Abdulwahed Fahad, Wing, Rod Anthony
Other Authors: Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST) , Thuwal, 23955-6900 , Saudi Arabia, King Abdullah University of Science and Technology (KAUST), Pathogen Genomics Laboratory, Biological and Environmental Science and Engineering (BESE) , Thuwal-Jeddah, 23955-6900 , Saudi Arabia, Biological and Environmental Science and Engineering (BESE) Division, Center for Desert Agriculture, Bioscience Program, Plant Science, Environmental Science and Engineering Program, Crop Science Research Center, Sant’Anna School of Advanced Studies , Piazza Martiri della Libertà 33, 56127 Pisa , Italy, Research and Development, Corteva Agriscience , Johnston, Iowa 50131 , USA, Jamoum University College, Department of Biology, Umm Al-Qura University , Mecca 24382 , Saudi Arabia, Department of Zoology, College of Science, King Saud University , P.O. Box 2455, Riyadh 11451 , Saudi Arabia
Format: Article in Journal/Newspaper
Language:unknown
Published: Oxford University Press (OUP) 2023
Subjects:
Online Access:http://hdl.handle.net/10754/686860
https://doi.org/10.1093/g3journal/jkad001
_version_ 1821507927418077184
author Zuccolo, Andrea
Mfarrej, Sara
Celii, Mirko
Mussurova, Saule
Rivera, Luis
Llaca, Victor
Mohammed, Nahed Abdullah
Pain, Arnab
Alrefaei, Abdulmajeed Fahad
Alrefaei, Abdulwahed Fahad
Wing, Rod Anthony
author2 Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST) , Thuwal, 23955-6900 , Saudi Arabia
King Abdullah University of Science and Technology (KAUST), Pathogen Genomics Laboratory, Biological and Environmental Science and Engineering (BESE) , Thuwal-Jeddah, 23955-6900 , Saudi Arabia
Biological and Environmental Science and Engineering (BESE) Division
Center for Desert Agriculture
Bioscience Program
Plant Science
Environmental Science and Engineering Program
Crop Science Research Center, Sant’Anna School of Advanced Studies , Piazza Martiri della Libertà 33, 56127 Pisa , Italy
Research and Development, Corteva Agriscience , Johnston, Iowa 50131 , USA
Jamoum University College, Department of Biology, Umm Al-Qura University , Mecca 24382 , Saudi Arabia
Department of Zoology, College of Science, King Saud University , P.O. Box 2455, Riyadh 11451 , Saudi Arabia
author_facet Zuccolo, Andrea
Mfarrej, Sara
Celii, Mirko
Mussurova, Saule
Rivera, Luis
Llaca, Victor
Mohammed, Nahed Abdullah
Pain, Arnab
Alrefaei, Abdulmajeed Fahad
Alrefaei, Abdulwahed Fahad
Wing, Rod Anthony
author_sort Zuccolo, Andrea
collection King Abdullah University of Science and Technology: KAUST Repository
container_issue 3
container_title G3 Genes|Genomes|Genetics
container_volume 13
description High-quality genome assemblies are characterized by high sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolutionary, and population genomics. To provide this valuable resource for conservation projects and comparative genomics studies on gyrfalcon (Falco rusticolus), we sequenced and assembled the genome of this species using third-generation sequencing strategies and optical maps. Here, we describe a highly contiguous and complete genome assembly comprising 20 scaffolds and 13 contigs with a total size of 1.193 Gbp, including 8,064 complete Benchmarking Universal Single-Copy Orthologs (BUSCOs) of the total 8,338 BUSCO groups present in the library aves_odb10. Of these BUSCO genes, 96.7% were complete, 96.1% were present as a single copy, and 0.6% were duplicated. Furthermore, 0.8% BUSCO genes were fragmented and 2.5% (210) were missing. A de novo search for transposable elements (TEs) identified 5,716 TEs that masked 7.61% of the F. rusticolus genome assembly when combined with publicly available TE collections. Long interspersed nuclear elements, in particular, the element Chicken-repeat 1 (CR1), were the most abundant TEs in the F. rusticolus genome. A de novo first-pass gene annotation was performed using 293,349 PacBio Iso-Seq transcripts and 496,195 transcripts derived from the assembly of 42,429,525 Illumina PE RNA-seq reads. In all, 19,602 putative genes, of which 59.31% were functionally characterized and associated with Gene Ontology terms, were annotated. A comparison of the gyrfalcon genome assembly with the publicly available assemblies of the domestic chicken (Gallus gallus), zebra finch (Taeniopygia guttata), and hummingbird (Calypte anna) revealed several genome rearrangements. In particular, nine putative chromosome fusions were identified in the gyrfalcon genome assembly compared with those in the G. gallus genome assembly. This genome assembly, its annotation for TEs and genes and ...
format Article in Journal/Newspaper
genre Falco rusticolus
gyrfalcon
genre_facet Falco rusticolus
gyrfalcon
geographic Finch
geographic_facet Finch
id ftkingabdullahun:oai:repository.kaust.edu.sa:10754/686860
institution Open Polar
language unknown
long_lat ENVELOPE(167.383,167.383,-72.567,-72.567)
op_collection_id ftkingabdullahun
op_doi https://doi.org/10.1093/g3journal/jkad00110.25387/g3.21769628
op_relation DOI:10.25387/g3.21769628
https://academic.oup.com/g3journal/advance-article/doi/10.1093/g3journal/jkad001/6972330
Zuccolo, A., Mfarrej, S., Celii, M., Mussurova, S., Rivera, L. F., Llaca, V., Mohammed, N., Pain, A., Alrefaei, A. F., Alrefaei, A. F., & Wing, R. A. (2023). The gyrfalcon (Falcon rusticolus) genome. G3 Genes%7CGenomes%7CGenetics. https://doi.org/10.1093/g3journal/jkad001
doi:10.1093/g3journal/jkad001
2160-1836
G3 Genes%7CGenomes%7CGenetics
http://hdl.handle.net/10754/686860
op_rights Archived with thanks to G3 Genes|Genomes|Genetics under a Creative Commons license, details at: https://creativecommons.org/licenses/by/4.0/
https://creativecommons.org/licenses/by/4.0/
publishDate 2023
publisher Oxford University Press (OUP)
record_format openpolar
spelling ftkingabdullahun:oai:repository.kaust.edu.sa:10754/686860 2025-01-16T21:48:33+00:00 The gyrfalcon (Falcon rusticolus) genome Zuccolo, Andrea Mfarrej, Sara Celii, Mirko Mussurova, Saule Rivera, Luis Llaca, Victor Mohammed, Nahed Abdullah Pain, Arnab Alrefaei, Abdulmajeed Fahad Alrefaei, Abdulwahed Fahad Wing, Rod Anthony Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST) , Thuwal, 23955-6900 , Saudi Arabia King Abdullah University of Science and Technology (KAUST), Pathogen Genomics Laboratory, Biological and Environmental Science and Engineering (BESE) , Thuwal-Jeddah, 23955-6900 , Saudi Arabia Biological and Environmental Science and Engineering (BESE) Division Center for Desert Agriculture Bioscience Program Plant Science Environmental Science and Engineering Program Crop Science Research Center, Sant’Anna School of Advanced Studies , Piazza Martiri della Libertà 33, 56127 Pisa , Italy Research and Development, Corteva Agriscience , Johnston, Iowa 50131 , USA Jamoum University College, Department of Biology, Umm Al-Qura University , Mecca 24382 , Saudi Arabia Department of Zoology, College of Science, King Saud University , P.O. Box 2455, Riyadh 11451 , Saudi Arabia 2023-01-05 application/pdf application/zip http://hdl.handle.net/10754/686860 https://doi.org/10.1093/g3journal/jkad001 unknown Oxford University Press (OUP) DOI:10.25387/g3.21769628 https://academic.oup.com/g3journal/advance-article/doi/10.1093/g3journal/jkad001/6972330 Zuccolo, A., Mfarrej, S., Celii, M., Mussurova, S., Rivera, L. F., Llaca, V., Mohammed, N., Pain, A., Alrefaei, A. F., Alrefaei, A. F., & Wing, R. A. (2023). The gyrfalcon (Falcon rusticolus) genome. G3 Genes%7CGenomes%7CGenetics. https://doi.org/10.1093/g3journal/jkad001 doi:10.1093/g3journal/jkad001 2160-1836 G3 Genes%7CGenomes%7CGenetics http://hdl.handle.net/10754/686860 Archived with thanks to G3 Genes|Genomes|Genetics under a Creative Commons license, details at: https://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/ Article 2023 ftkingabdullahun https://doi.org/10.1093/g3journal/jkad00110.25387/g3.21769628 2023-12-09T20:21:05Z High-quality genome assemblies are characterized by high sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolutionary, and population genomics. To provide this valuable resource for conservation projects and comparative genomics studies on gyrfalcon (Falco rusticolus), we sequenced and assembled the genome of this species using third-generation sequencing strategies and optical maps. Here, we describe a highly contiguous and complete genome assembly comprising 20 scaffolds and 13 contigs with a total size of 1.193 Gbp, including 8,064 complete Benchmarking Universal Single-Copy Orthologs (BUSCOs) of the total 8,338 BUSCO groups present in the library aves_odb10. Of these BUSCO genes, 96.7% were complete, 96.1% were present as a single copy, and 0.6% were duplicated. Furthermore, 0.8% BUSCO genes were fragmented and 2.5% (210) were missing. A de novo search for transposable elements (TEs) identified 5,716 TEs that masked 7.61% of the F. rusticolus genome assembly when combined with publicly available TE collections. Long interspersed nuclear elements, in particular, the element Chicken-repeat 1 (CR1), were the most abundant TEs in the F. rusticolus genome. A de novo first-pass gene annotation was performed using 293,349 PacBio Iso-Seq transcripts and 496,195 transcripts derived from the assembly of 42,429,525 Illumina PE RNA-seq reads. In all, 19,602 putative genes, of which 59.31% were functionally characterized and associated with Gene Ontology terms, were annotated. A comparison of the gyrfalcon genome assembly with the publicly available assemblies of the domestic chicken (Gallus gallus), zebra finch (Taeniopygia guttata), and hummingbird (Calypte anna) revealed several genome rearrangements. In particular, nine putative chromosome fusions were identified in the gyrfalcon genome assembly compared with those in the G. gallus genome assembly. This genome assembly, its annotation for TEs and genes and ... Article in Journal/Newspaper Falco rusticolus gyrfalcon King Abdullah University of Science and Technology: KAUST Repository Finch ENVELOPE(167.383,167.383,-72.567,-72.567) G3 Genes|Genomes|Genetics 13 3
spellingShingle Zuccolo, Andrea
Mfarrej, Sara
Celii, Mirko
Mussurova, Saule
Rivera, Luis
Llaca, Victor
Mohammed, Nahed Abdullah
Pain, Arnab
Alrefaei, Abdulmajeed Fahad
Alrefaei, Abdulwahed Fahad
Wing, Rod Anthony
The gyrfalcon (Falcon rusticolus) genome
title The gyrfalcon (Falcon rusticolus) genome
title_full The gyrfalcon (Falcon rusticolus) genome
title_fullStr The gyrfalcon (Falcon rusticolus) genome
title_full_unstemmed The gyrfalcon (Falcon rusticolus) genome
title_short The gyrfalcon (Falcon rusticolus) genome
title_sort gyrfalcon (falcon rusticolus) genome
url http://hdl.handle.net/10754/686860
https://doi.org/10.1093/g3journal/jkad001