Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
Abstract Background The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be s...
Published in: | GigaScience |
---|---|
Main Authors: | , , , , , , , , , , , , , , , , , , , |
Other Authors: | , , , , |
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
Oxford University Press (OUP)
2020
|
Subjects: | |
Online Access: | http://dx.doi.org/10.1093/gigascience/giaa027 http://academic.oup.com/gigascience/article-pdf/9/4/giaa027/32989705/giaa027.pdf |
id |
croxfordunivpr:10.1093/gigascience/giaa027 |
---|---|
record_format |
openpolar |
spelling |
croxfordunivpr:10.1093/gigascience/giaa027 2024-10-20T14:08:06+00:00 Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C Field, Matt A Rosen, Benjamin D Dudchenko, Olga Chan, Eva K F Minoche, Andre E Edwards, Richard J Barton, Kirston Lyons, Ruth J Tuipulotu, Daniel Enosi Hayes, Vanessa M D. Omer, Arina Colaric, Zane Keilwagen, Jens Skvortsova, Ksenia Bogdanovic, Ozren Smith, Martin A Aiden, Erez Lieberman Smith, Timothy P L Zammit, Robert A Ballard, J William O National Science Foundation Welch Foundation U.S. Department of Agriculture National Institutes of Health Australian Research Council 2020 http://dx.doi.org/10.1093/gigascience/giaa027 http://academic.oup.com/gigascience/article-pdf/9/4/giaa027/32989705/giaa027.pdf en eng Oxford University Press (OUP) http://creativecommons.org/licenses/by/4.0/ GigaScience volume 9, issue 4 ISSN 2047-217X journal-article 2020 croxfordunivpr https://doi.org/10.1093/gigascience/giaa027 2024-09-24T04:07:21Z Abstract Background The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. Findings Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. Conclusions GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology. Article in Journal/Newspaper Canis lupus Oxford University Press Pacific GigaScience 9 4 |
institution |
Open Polar |
collection |
Oxford University Press |
op_collection_id |
croxfordunivpr |
language |
English |
description |
Abstract Background The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. Findings Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. Conclusions GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology. |
author2 |
National Science Foundation Welch Foundation U.S. Department of Agriculture National Institutes of Health Australian Research Council |
format |
Article in Journal/Newspaper |
author |
Field, Matt A Rosen, Benjamin D Dudchenko, Olga Chan, Eva K F Minoche, Andre E Edwards, Richard J Barton, Kirston Lyons, Ruth J Tuipulotu, Daniel Enosi Hayes, Vanessa M D. Omer, Arina Colaric, Zane Keilwagen, Jens Skvortsova, Ksenia Bogdanovic, Ozren Smith, Martin A Aiden, Erez Lieberman Smith, Timothy P L Zammit, Robert A Ballard, J William O |
spellingShingle |
Field, Matt A Rosen, Benjamin D Dudchenko, Olga Chan, Eva K F Minoche, Andre E Edwards, Richard J Barton, Kirston Lyons, Ruth J Tuipulotu, Daniel Enosi Hayes, Vanessa M D. Omer, Arina Colaric, Zane Keilwagen, Jens Skvortsova, Ksenia Bogdanovic, Ozren Smith, Martin A Aiden, Erez Lieberman Smith, Timothy P L Zammit, Robert A Ballard, J William O Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C |
author_facet |
Field, Matt A Rosen, Benjamin D Dudchenko, Olga Chan, Eva K F Minoche, Andre E Edwards, Richard J Barton, Kirston Lyons, Ruth J Tuipulotu, Daniel Enosi Hayes, Vanessa M D. Omer, Arina Colaric, Zane Keilwagen, Jens Skvortsova, Ksenia Bogdanovic, Ozren Smith, Martin A Aiden, Erez Lieberman Smith, Timothy P L Zammit, Robert A Ballard, J William O |
author_sort |
Field, Matt A |
title |
Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C |
title_short |
Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C |
title_full |
Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C |
title_fullStr |
Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C |
title_full_unstemmed |
Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C |
title_sort |
canfam_gsd: de novo chromosome-length genome assembly of the german shepherd dog (canis lupus familiaris) using a combination of long reads, optical mapping, and hi-c |
publisher |
Oxford University Press (OUP) |
publishDate |
2020 |
url |
http://dx.doi.org/10.1093/gigascience/giaa027 http://academic.oup.com/gigascience/article-pdf/9/4/giaa027/32989705/giaa027.pdf |
geographic |
Pacific |
geographic_facet |
Pacific |
genre |
Canis lupus |
genre_facet |
Canis lupus |
op_source |
GigaScience volume 9, issue 4 ISSN 2047-217X |
op_rights |
http://creativecommons.org/licenses/by/4.0/ |
op_doi |
https://doi.org/10.1093/gigascience/giaa027 |
container_title |
GigaScience |
container_volume |
9 |
container_issue |
4 |
_version_ |
1813447203155345408 |