Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon

Short tandem repeat (STR) markers, which exhibit many alleles per locus, are commonly used to assign fish to their populations of origin. Single nucleotide polymorphisms (SNPs), which have many technical advantages over STRs, typically exhibit only two alleles per locus. Simulation studies have indi...

Full description

Bibliographic Details
Main Authors: Smith, Christian T., Seeb, Lisa W.
Format: Dataset
Language:English
Published: Dryad 2020
Subjects:
STR
SNP
geo
Online Access:https://doi.org/10.5061/dryad.db8hd
id fttriple:oai:gotriple.eu:50|dedup_wf_001::11532f027df1f61125ed377f487b3c7e
record_format openpolar
spelling fttriple:oai:gotriple.eu:50|dedup_wf_001::11532f027df1f61125ed377f487b3c7e 2023-05-15T15:43:55+02:00 Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon Smith, Christian T. Seeb, Lisa W. 2020-07-14 https://doi.org/10.5061/dryad.db8hd en eng Dryad http://dx.doi.org/10.5061/dryad.db8hd https://dx.doi.org/10.5061/dryad.db8hd lic_creative-commons 10.5061/dryad.db8hd oai:easy.dans.knaw.nl:easy-dataset:80560 oai:services.nod.dans.knaw.nl:Products/dans:oai:easy.dans.knaw.nl:easy-dataset:80560 10|openaire____::9e3be59865b2c1c335d32dae2fe7b254 10|re3data_____::94816e6421eeb072e7742ce6a9decc5f re3data_____::r3d100000044 10|re3data_____::84e123776089ce3c7a33db98d9cd15a8 10|eurocrisdris::fe4903425d9040f680d8610d9079ea14 10|openaire____::081b82f96300b6a6e3d282bad31cb6e2 STR SNP Oncorhynchus keta number of alleles Genetic assignment North Pacific Ocean Bering Sea Life sciences medicine and health care envir geo Dataset https://vocabularies.coar-repositories.org/resource_types/c_ddb1/ 2020 fttriple https://doi.org/10.5061/dryad.db8hd 2023-01-22T17:23:34Z Short tandem repeat (STR) markers, which exhibit many alleles per locus, are commonly used to assign fish to their populations of origin. Single nucleotide polymorphisms (SNPs), which have many technical advantages over STRs, typically exhibit only two alleles per locus. Simulation studies have indicated that number of independent alleles is a good predictor of accuracy of genetic markers for fishery applications. Extant STR baselines for salmon contain hundreds of alleles, and it has been extrapolated that hundreds of SNP markers need to be developed before SNP baselines will compare to these STR baselines. We compared 15 STRs exhibiting 349 independent alleles to 61 SNP assays exhibiting 66 independent alleles for accuracy in assigning to closely related populations of chum salmon. The SNP baseline yielded slightly higher mean accuracies for proportional assignment and comparable accuracies for individual assignment. Overall the SNP baseline performed considerably better, relative to the microsatellite baseline, than predicted based on the number of independent alleles in each baseline. We suggest that this discrepancy is due to the fact that the simulation studies do not capture the impacts of the different strategies commonly employed for discovering and selecting STR and SNP markers. convert input3MSATShort tandem repeat (STR) genotype data in CONVERT formatconvert input3SNPSingle nucleotide polymorphism (SNP) genotype data in CONVERT format. Includes all loci in raw format.snp4b convert inputsingle nucleotide polymorphism (SNP) genotype data in CONVERT format. Linked SNPs converted to haplotypes and uninformative SNPs removed (as described in paper). Dataset Bering Sea Unknown Bering Sea Pacific Keta ENVELOPE(-19.455,-19.455,65.656,65.656)
institution Open Polar
collection Unknown
op_collection_id fttriple
language English
topic STR
SNP
Oncorhynchus keta
number of alleles
Genetic assignment
North Pacific Ocean
Bering Sea
Life sciences
medicine and health care
envir
geo
spellingShingle STR
SNP
Oncorhynchus keta
number of alleles
Genetic assignment
North Pacific Ocean
Bering Sea
Life sciences
medicine and health care
envir
geo
Smith, Christian T.
Seeb, Lisa W.
Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon
topic_facet STR
SNP
Oncorhynchus keta
number of alleles
Genetic assignment
North Pacific Ocean
Bering Sea
Life sciences
medicine and health care
envir
geo
description Short tandem repeat (STR) markers, which exhibit many alleles per locus, are commonly used to assign fish to their populations of origin. Single nucleotide polymorphisms (SNPs), which have many technical advantages over STRs, typically exhibit only two alleles per locus. Simulation studies have indicated that number of independent alleles is a good predictor of accuracy of genetic markers for fishery applications. Extant STR baselines for salmon contain hundreds of alleles, and it has been extrapolated that hundreds of SNP markers need to be developed before SNP baselines will compare to these STR baselines. We compared 15 STRs exhibiting 349 independent alleles to 61 SNP assays exhibiting 66 independent alleles for accuracy in assigning to closely related populations of chum salmon. The SNP baseline yielded slightly higher mean accuracies for proportional assignment and comparable accuracies for individual assignment. Overall the SNP baseline performed considerably better, relative to the microsatellite baseline, than predicted based on the number of independent alleles in each baseline. We suggest that this discrepancy is due to the fact that the simulation studies do not capture the impacts of the different strategies commonly employed for discovering and selecting STR and SNP markers. convert input3MSATShort tandem repeat (STR) genotype data in CONVERT formatconvert input3SNPSingle nucleotide polymorphism (SNP) genotype data in CONVERT format. Includes all loci in raw format.snp4b convert inputsingle nucleotide polymorphism (SNP) genotype data in CONVERT format. Linked SNPs converted to haplotypes and uninformative SNPs removed (as described in paper).
format Dataset
author Smith, Christian T.
Seeb, Lisa W.
author_facet Smith, Christian T.
Seeb, Lisa W.
author_sort Smith, Christian T.
title Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon
title_short Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon
title_full Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon
title_fullStr Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon
title_full_unstemmed Data from: Number of alleles as a predictor of the relative assignment accuracy of STR and SNP baselines for chum salmon
title_sort data from: number of alleles as a predictor of the relative assignment accuracy of str and snp baselines for chum salmon
publisher Dryad
publishDate 2020
url https://doi.org/10.5061/dryad.db8hd
long_lat ENVELOPE(-19.455,-19.455,65.656,65.656)
geographic Bering Sea
Pacific
Keta
geographic_facet Bering Sea
Pacific
Keta
genre Bering Sea
genre_facet Bering Sea
op_source 10.5061/dryad.db8hd
oai:easy.dans.knaw.nl:easy-dataset:80560
oai:services.nod.dans.knaw.nl:Products/dans:oai:easy.dans.knaw.nl:easy-dataset:80560
10|openaire____::9e3be59865b2c1c335d32dae2fe7b254
10|re3data_____::94816e6421eeb072e7742ce6a9decc5f
re3data_____::r3d100000044
10|re3data_____::84e123776089ce3c7a33db98d9cd15a8
10|eurocrisdris::fe4903425d9040f680d8610d9079ea14
10|openaire____::081b82f96300b6a6e3d282bad31cb6e2
op_relation http://dx.doi.org/10.5061/dryad.db8hd
https://dx.doi.org/10.5061/dryad.db8hd
op_rights lic_creative-commons
op_doi https://doi.org/10.5061/dryad.db8hd
_version_ 1766378137657016320