From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species

Abstract Background Recent advances in genomics have greatly increased research opportunities for non-model species. For wildlife, a growing availability of reference genomes means that population genetics is no longer restricted to a small set of anonymous loci. When used in conjunction with a refe...

Full description

Bibliographic Details
Published in:BMC Genomics
Main Authors: Belinda Wright, Katherine A. Farquharson, Elspeth A. McLennan, Katherine Belov, Carolyn J. Hogg, Catherine E. Grueber
Format: Article in Journal/Newspaper
Language:English
Published: BMC 2019
Subjects:
Online Access:https://doi.org/10.1186/s12864-019-5806-y
https://doaj.org/article/39de240fbe2245eb8be14ef5be81eb78
id ftdoajarticles:oai:doaj.org/article:39de240fbe2245eb8be14ef5be81eb78
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:39de240fbe2245eb8be14ef5be81eb78 2023-05-15T13:29:59+02:00 From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species Belinda Wright Katherine A. Farquharson Elspeth A. McLennan Katherine Belov Carolyn J. Hogg Catherine E. Grueber 2019-06-01T00:00:00Z https://doi.org/10.1186/s12864-019-5806-y https://doaj.org/article/39de240fbe2245eb8be14ef5be81eb78 EN eng BMC http://link.springer.com/article/10.1186/s12864-019-5806-y https://doaj.org/toc/1471-2164 doi:10.1186/s12864-019-5806-y 1471-2164 https://doaj.org/article/39de240fbe2245eb8be14ef5be81eb78 BMC Genomics, Vol 20, Iss 1, Pp 1-10 (2019) Population genomics DArTseq Reference genome Tasmanian devil Pink-footed goose Population differentiation Biotechnology TP248.13-248.65 Genetics QH426-470 article 2019 ftdoajarticles https://doi.org/10.1186/s12864-019-5806-y 2022-12-31T16:35:24Z Abstract Background Recent advances in genomics have greatly increased research opportunities for non-model species. For wildlife, a growing availability of reference genomes means that population genetics is no longer restricted to a small set of anonymous loci. When used in conjunction with a reference genome, reduced-representation sequencing (RRS) provides a cost-effective method for obtaining reliable diversity information for population genetics. Many software tools have been developed to process RRS data, though few studies of non-model species incorporate genome alignment in calling loci. A commonly-used RRS analysis pipeline, Stacks, has this capacity and so it is timely to compare its utility with existing software originally designed for alignment and analysis of whole genome sequencing data. Here we examine population genetic inferences from two species for which reference-aligned reduced-representation data have been collected. Our two study species are a threatened Australian marsupial (Tasmanian devil Sarcophilus harrisii; declining population) and an Arctic-circle migrant bird (pink-footed goose Anser brachyrhynchus; expanding population). Analyses of these data are compared using Stacks versus two widely-used genomics packages, SAMtools and GATK. We also introduce a custom R script to improve the reliability of single nucleotide polymorphism (SNP) calls in all pipelines and conduct population genetic inferences for non-model species with reference genomes. Results Although we identified orders of magnitude fewer SNPs in our devil dataset than for goose, we found remarkable symmetry between the two species in our assessment of software performance. For both datasets, all three methods were able to delineate population structure, even with varying numbers of loci. For both species, population structure inferences were influenced by the percent of missing data. Conclusions For studies of non-model species with a reference genome, we recommend combining Stacks output with further filtering (as ... Article in Journal/Newspaper Anser brachyrhynchus Arctic Pink-footed Goose Directory of Open Access Journals: DOAJ Articles Arctic BMC Genomics 20 1
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic Population genomics
DArTseq
Reference genome
Tasmanian devil
Pink-footed goose
Population differentiation
Biotechnology
TP248.13-248.65
Genetics
QH426-470
spellingShingle Population genomics
DArTseq
Reference genome
Tasmanian devil
Pink-footed goose
Population differentiation
Biotechnology
TP248.13-248.65
Genetics
QH426-470
Belinda Wright
Katherine A. Farquharson
Elspeth A. McLennan
Katherine Belov
Carolyn J. Hogg
Catherine E. Grueber
From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
topic_facet Population genomics
DArTseq
Reference genome
Tasmanian devil
Pink-footed goose
Population differentiation
Biotechnology
TP248.13-248.65
Genetics
QH426-470
description Abstract Background Recent advances in genomics have greatly increased research opportunities for non-model species. For wildlife, a growing availability of reference genomes means that population genetics is no longer restricted to a small set of anonymous loci. When used in conjunction with a reference genome, reduced-representation sequencing (RRS) provides a cost-effective method for obtaining reliable diversity information for population genetics. Many software tools have been developed to process RRS data, though few studies of non-model species incorporate genome alignment in calling loci. A commonly-used RRS analysis pipeline, Stacks, has this capacity and so it is timely to compare its utility with existing software originally designed for alignment and analysis of whole genome sequencing data. Here we examine population genetic inferences from two species for which reference-aligned reduced-representation data have been collected. Our two study species are a threatened Australian marsupial (Tasmanian devil Sarcophilus harrisii; declining population) and an Arctic-circle migrant bird (pink-footed goose Anser brachyrhynchus; expanding population). Analyses of these data are compared using Stacks versus two widely-used genomics packages, SAMtools and GATK. We also introduce a custom R script to improve the reliability of single nucleotide polymorphism (SNP) calls in all pipelines and conduct population genetic inferences for non-model species with reference genomes. Results Although we identified orders of magnitude fewer SNPs in our devil dataset than for goose, we found remarkable symmetry between the two species in our assessment of software performance. For both datasets, all three methods were able to delineate population structure, even with varying numbers of loci. For both species, population structure inferences were influenced by the percent of missing data. Conclusions For studies of non-model species with a reference genome, we recommend combining Stacks output with further filtering (as ...
format Article in Journal/Newspaper
author Belinda Wright
Katherine A. Farquharson
Elspeth A. McLennan
Katherine Belov
Carolyn J. Hogg
Catherine E. Grueber
author_facet Belinda Wright
Katherine A. Farquharson
Elspeth A. McLennan
Katherine Belov
Carolyn J. Hogg
Catherine E. Grueber
author_sort Belinda Wright
title From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_short From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_full From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_fullStr From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_full_unstemmed From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
title_sort from reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species
publisher BMC
publishDate 2019
url https://doi.org/10.1186/s12864-019-5806-y
https://doaj.org/article/39de240fbe2245eb8be14ef5be81eb78
geographic Arctic
geographic_facet Arctic
genre Anser brachyrhynchus
Arctic
Pink-footed Goose
genre_facet Anser brachyrhynchus
Arctic
Pink-footed Goose
op_source BMC Genomics, Vol 20, Iss 1, Pp 1-10 (2019)
op_relation http://link.springer.com/article/10.1186/s12864-019-5806-y
https://doaj.org/toc/1471-2164
doi:10.1186/s12864-019-5806-y
1471-2164
https://doaj.org/article/39de240fbe2245eb8be14ef5be81eb78
op_doi https://doi.org/10.1186/s12864-019-5806-y
container_title BMC Genomics
container_volume 20
container_issue 1
_version_ 1766004706421768192