Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C

Background: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptib...

Full description

Bibliographic Details
Published in:GigaScience
Main Authors: Field, Matt A., Rosen, Benjamin D., Dudchenko, Olga, Chan, Eva K. F., Minoche, Andre E., Edwards, Richard J., Barton, Kirston, Lyons, Ruth J., Tuipulotu, Daniel Enosi, Hayes, Vanessa M., Omer, Arina D., Colaric, Zane, Keilwagen, Jens, Skvortsova, Ksenia, Bogdanovic, Ozren, Smith, Martin A., Aiden, Erez Lieberman, Smith, Timothy P. L., Zammit, Robert A., Ballard, J. William O.
Format: Article in Journal/Newspaper
Language:English
Published: 2020
Subjects:
Online Access:https://doi.org/10.1093/gigascience/giaa027
https://www.openagrar.de/receive/openagrar_mods_00063573
https://www.openagrar.de/servlets/MCRFileNodeServlet/openagrar_derivate_00032675/2020-076.pdf
id ftopenagrar:oai:www.openagrar.de:openagrar_mods_00063573
record_format openpolar
spelling ftopenagrar:oai:www.openagrar.de:openagrar_mods_00063573 2024-09-15T18:01:29+00:00 Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C Field, Matt A. Rosen, Benjamin D. Dudchenko, Olga Chan, Eva K. F. Minoche, Andre E. Edwards, Richard J. Barton, Kirston Lyons, Ruth J. Tuipulotu, Daniel Enosi Hayes, Vanessa M. Omer, Arina D. Colaric, Zane Keilwagen, Jens Skvortsova, Ksenia Bogdanovic, Ozren Smith, Martin A. Aiden, Erez Lieberman Smith, Timothy P. L. Zammit, Robert A. Ballard, J. William O. 2020 https://doi.org/10.1093/gigascience/giaa027 https://www.openagrar.de/receive/openagrar_mods_00063573 https://www.openagrar.de/servlets/MCRFileNodeServlet/openagrar_derivate_00032675/2020-076.pdf eng eng GigaScience -- GigaScience -- http://www.bibliothek.uni-regensburg.de/ezeit/?2708999 -- https://academic.oup.com/gigascience -- https://www.ncbi.nlm.nih.gov/pmc/journals/2056/ -- http://link.springer.com/journal/13742 -- 2047-217X https://doi.org/10.1093/gigascience/giaa027 https://www.openagrar.de/receive/openagrar_mods_00063573 https://www.openagrar.de/servlets/MCRFileNodeServlet/openagrar_derivate_00032675/2020-076.pdf https://creativecommons.org/licenses/by/4.0/ public info:eu-repo/semantics/openAccess article Text ddc:570 canine hip dysplasia de novo genome assembly DNA Zoo Hi-C long-read sequencing optical mapping article Text doc-type:article 2020 ftopenagrar https://doi.org/10.1093/gigascience/giaa027 2024-07-08T23:56:25Z Background: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. Findings: Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam-GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. Conclusions: GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology. Article in Journal/Newspaper Canis lupus OpenAgrar (OA) GigaScience 9 4
institution Open Polar
collection OpenAgrar (OA)
op_collection_id ftopenagrar
language English
topic article
Text
ddc:570
canine hip dysplasia
de novo genome assembly
DNA Zoo
Hi-C
long-read sequencing
optical mapping
spellingShingle article
Text
ddc:570
canine hip dysplasia
de novo genome assembly
DNA Zoo
Hi-C
long-read sequencing
optical mapping
Field, Matt A.
Rosen, Benjamin D.
Dudchenko, Olga
Chan, Eva K. F.
Minoche, Andre E.
Edwards, Richard J.
Barton, Kirston
Lyons, Ruth J.
Tuipulotu, Daniel Enosi
Hayes, Vanessa M.
Omer, Arina D.
Colaric, Zane
Keilwagen, Jens
Skvortsova, Ksenia
Bogdanovic, Ozren
Smith, Martin A.
Aiden, Erez Lieberman
Smith, Timothy P. L.
Zammit, Robert A.
Ballard, J. William O.
Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
topic_facet article
Text
ddc:570
canine hip dysplasia
de novo genome assembly
DNA Zoo
Hi-C
long-read sequencing
optical mapping
description Background: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. Findings: Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam-GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. Conclusions: GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology.
format Article in Journal/Newspaper
author Field, Matt A.
Rosen, Benjamin D.
Dudchenko, Olga
Chan, Eva K. F.
Minoche, Andre E.
Edwards, Richard J.
Barton, Kirston
Lyons, Ruth J.
Tuipulotu, Daniel Enosi
Hayes, Vanessa M.
Omer, Arina D.
Colaric, Zane
Keilwagen, Jens
Skvortsova, Ksenia
Bogdanovic, Ozren
Smith, Martin A.
Aiden, Erez Lieberman
Smith, Timothy P. L.
Zammit, Robert A.
Ballard, J. William O.
author_facet Field, Matt A.
Rosen, Benjamin D.
Dudchenko, Olga
Chan, Eva K. F.
Minoche, Andre E.
Edwards, Richard J.
Barton, Kirston
Lyons, Ruth J.
Tuipulotu, Daniel Enosi
Hayes, Vanessa M.
Omer, Arina D.
Colaric, Zane
Keilwagen, Jens
Skvortsova, Ksenia
Bogdanovic, Ozren
Smith, Martin A.
Aiden, Erez Lieberman
Smith, Timothy P. L.
Zammit, Robert A.
Ballard, J. William O.
author_sort Field, Matt A.
title Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_short Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_full Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_fullStr Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_full_unstemmed Canfam-GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_sort canfam-gsd: de novo chromosome-length genome assembly of the german shepherd dog (canis lupus familiaris) using a combination of long reads, optical mapping, and hi-c
publishDate 2020
url https://doi.org/10.1093/gigascience/giaa027
https://www.openagrar.de/receive/openagrar_mods_00063573
https://www.openagrar.de/servlets/MCRFileNodeServlet/openagrar_derivate_00032675/2020-076.pdf
genre Canis lupus
genre_facet Canis lupus
op_relation GigaScience -- GigaScience -- http://www.bibliothek.uni-regensburg.de/ezeit/?2708999 -- https://academic.oup.com/gigascience -- https://www.ncbi.nlm.nih.gov/pmc/journals/2056/ -- http://link.springer.com/journal/13742 -- 2047-217X
https://doi.org/10.1093/gigascience/giaa027
https://www.openagrar.de/receive/openagrar_mods_00063573
https://www.openagrar.de/servlets/MCRFileNodeServlet/openagrar_derivate_00032675/2020-076.pdf
op_rights https://creativecommons.org/licenses/by/4.0/
public
info:eu-repo/semantics/openAccess
op_doi https://doi.org/10.1093/gigascience/giaa027
container_title GigaScience
container_volume 9
container_issue 4
_version_ 1810438625485127680