From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species

BACKGROUND: Recent advances in genomics have greatly increased research opportunities for non-model species. For wildlife, a growing availability of reference genomes means that population genetics is no longer restricted to a small set of anonymous loci. When used in conjunction with a reference ge...

Full description

Bibliographic Details
Published in:BMC Genomics
Main Authors: Wright, Belinda, Farquharson, Katherine A., McLennan, Elspeth A., Belov, Katherine, Hogg, Carolyn J., Grueber, Catherine E.
Format: Text
Language:English
Published: BioMed Central 2019
Subjects:
Online Access:http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6547446/
http://www.ncbi.nlm.nih.gov/pubmed/31159724
https://doi.org/10.1186/s12864-019-5806-y
id ftpubmed:oai:pubmedcentral.nih.gov:6547446
record_format openpolar
spelling ftpubmed:oai:pubmedcentral.nih.gov:6547446 2023-05-15T13:29:59+02:00 From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species Wright, Belinda Farquharson, Katherine A. McLennan, Elspeth A. Belov, Katherine Hogg, Carolyn J. Grueber, Catherine E. 2019-06-03 http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6547446/ http://www.ncbi.nlm.nih.gov/pubmed/31159724 https://doi.org/10.1186/s12864-019-5806-y en eng BioMed Central http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6547446/ http://www.ncbi.nlm.nih.gov/pubmed/31159724 http://dx.doi.org/10.1186/s12864-019-5806-y © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. CC0 PDM CC-BY Research Article Text 2019 ftpubmed https://doi.org/10.1186/s12864-019-5806-y 2019-06-09T00:28:52Z BACKGROUND: Recent advances in genomics have greatly increased research opportunities for non-model species. For wildlife, a growing availability of reference genomes means that population genetics is no longer restricted to a small set of anonymous loci. When used in conjunction with a reference genome, reduced-representation sequencing (RRS) provides a cost-effective method for obtaining reliable diversity information for population genetics. Many software tools have been developed to process RRS data, though few studies of non-model species incorporate genome alignment in calling loci. A commonly-used RRS analysis pipeline, Stacks, has this capacity and so it is timely to compare its utility with existing software originally designed for alignment and analysis of whole genome sequencing data. Here we examine population genetic inferences from two species for which reference-aligned reduced-representation data have been collected. Our two study species are a threatened Australian marsupial (Tasmanian devil Sarcophilus harrisii; declining population) and an Arctic-circle migrant bird (pink-footed goose Anser brachyrhynchus; expanding population). Analyses of these data are compared using Stacks versus two widely-used genomics packages, SAMtools and GATK. We also introduce a custom R script to improve the reliability of single nucleotide polymorphism (SNP) calls in all pipelines and conduct population genetic inferences for non-model species with reference genomes. RESULTS: Although we identified orders of magnitude fewer SNPs in our devil dataset than for goose, we found remarkable symmetry between the two species in our assessment of software performance. For both datasets, all three methods were able to delineate population structure, even with varying numbers of loci. For both species, population structure inferences were influenced by the percent of missing data. CONCLUSIONS: For studies of non-model species with a reference genome, we recommend combining Stacks output with further filtering (as included in ... Text Anser brachyrhynchus Arctic Pink-footed Goose PubMed Central (PMC) Arctic BMC Genomics 20 1
institution Open Polar
collection PubMed Central (PMC)
op_collection_id ftpubmed
language English
topic Research Article
spellingShingle Research Article
Wright, Belinda
Farquharson, Katherine A.
McLennan, Elspeth A.
Belov, Katherine
Hogg, Carolyn J.
Grueber, Catherine E.
From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
topic_facet Research Article
description BACKGROUND: Recent advances in genomics have greatly increased research opportunities for non-model species. For wildlife, a growing availability of reference genomes means that population genetics is no longer restricted to a small set of anonymous loci. When used in conjunction with a reference genome, reduced-representation sequencing (RRS) provides a cost-effective method for obtaining reliable diversity information for population genetics. Many software tools have been developed to process RRS data, though few studies of non-model species incorporate genome alignment in calling loci. A commonly-used RRS analysis pipeline, Stacks, has this capacity and so it is timely to compare its utility with existing software originally designed for alignment and analysis of whole genome sequencing data. Here we examine population genetic inferences from two species for which reference-aligned reduced-representation data have been collected. Our two study species are a threatened Australian marsupial (Tasmanian devil Sarcophilus harrisii; declining population) and an Arctic-circle migrant bird (pink-footed goose Anser brachyrhynchus; expanding population). Analyses of these data are compared using Stacks versus two widely-used genomics packages, SAMtools and GATK. We also introduce a custom R script to improve the reliability of single nucleotide polymorphism (SNP) calls in all pipelines and conduct population genetic inferences for non-model species with reference genomes. RESULTS: Although we identified orders of magnitude fewer SNPs in our devil dataset than for goose, we found remarkable symmetry between the two species in our assessment of software performance. For both datasets, all three methods were able to delineate population structure, even with varying numbers of loci. For both species, population structure inferences were influenced by the percent of missing data. CONCLUSIONS: For studies of non-model species with a reference genome, we recommend combining Stacks output with further filtering (as included in ...
format Text
author Wright, Belinda
Farquharson, Katherine A.
McLennan, Elspeth A.
Belov, Katherine
Hogg, Carolyn J.
Grueber, Catherine E.
author_facet Wright, Belinda
Farquharson, Katherine A.
McLennan, Elspeth A.
Belov, Katherine
Hogg, Carolyn J.
Grueber, Catherine E.
author_sort Wright, Belinda
title From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_short From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_full From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_fullStr From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_full_unstemmed From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_sort from reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
publisher BioMed Central
publishDate 2019
url http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6547446/
http://www.ncbi.nlm.nih.gov/pubmed/31159724
https://doi.org/10.1186/s12864-019-5806-y
geographic Arctic
geographic_facet Arctic
genre Anser brachyrhynchus
Arctic
Pink-footed Goose
genre_facet Anser brachyrhynchus
Arctic
Pink-footed Goose
op_relation http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6547446/
http://www.ncbi.nlm.nih.gov/pubmed/31159724
http://dx.doi.org/10.1186/s12864-019-5806-y
op_rights © The Author(s). 2019
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
op_rightsnorm CC0
PDM
CC-BY
op_doi https://doi.org/10.1186/s12864-019-5806-y
container_title BMC Genomics
container_volume 20
container_issue 1
_version_ 1766004708610146304