Supplementary Materials from Accurate recapture identification for genetic mark–recapture studies with error-tolerant likelihood-based match calling and sample clustering

Supplementary Materials are combined into a single .pdf document, with the following contents: Supplement 1: Detail of the error-tolerant likelihood-based match calling and sample clustering approach Supplement 2: R script to implement the error-tolerant likelihood-based match calling model and samp...

Full description

Bibliographic Details
Main Authors: Sethi, Suresh Andrew, Linden, Daniel, Wenburg, John, Lewis, Cara, Lemons, Patrick, Fuller, Angela, Hare, Matthew
Format: Text
Language:unknown
Published: The Royal Society 2016
Subjects:
Online Access:https://dx.doi.org/10.6084/m9.figshare.4309295.v1
https://rs.figshare.com/articles/journal_contribution/Supplementary_Materials_from_Accurate_recapture_identification_for_genetic_mark_recapture_studies_with_error-tolerant_likelihood-based_match_calling_and_sample_clustering/4309295/1
Description
Summary:Supplementary Materials are combined into a single .pdf document, with the following contents: Supplement 1: Detail of the error-tolerant likelihood-based match calling and sample clustering approach Supplement 2: R script to implement the error-tolerant likelihood-based match calling model and sample clustering algorithms: MSATs Supplement 3: R script to implement the error-tolerant likelihood-based match calling model and sample clustering algorithms : SNPs Supplement 4: Case study genetic marker characteristics Supplement 5: Detailed base case simulation results. Error-tolerant likelihood-based match calling presents a promising technique to accurately identify recapture events in genetic mark–recapture studies by combining probabilities of latent genotypes and probabilities of observed genotypes, which may contain genotyping errors. Combined with clustering algorithms to group samples into sets of recaptures based upon pairwise match calls, these tools can be used to reconstruct accurate capture histories for mark–recapture modelling. Here, we assess the performance of a recently introduced error-tolerant likelihood-based match-calling model and sample clustering algorithm for genetic mark–recapture studies. We assessed both biallelic (i.e. single nucleotide polymorphisms; SNP) and multiallelic (i.e. microsatellite; MSAT) markers using a combination of simulation analyses and case study data on Pacific walrus ( Odobenus rosmarus divergens ) and fishers ( Pekania pennanti ). A novel two-stage clustering approach is demonstrated for genetic mark–recapture applications. First, repeat captures within a sampling occasion are identified. Subsequently, recaptures across sampling occasions are identified. The likelihood-based matching protocol performed well in simulation trials, demonstrating utility for use in a wide range of genetic mark–recapture studies. Moderately sized SNP (more than or equal to 64) and MSAT (10–15) panels produced accurate match calls for recaptures and accurate non-match calls for samples from closely related individuals in the face of low to moderate genotyping error. Furthermore, matching performance remained stable or increased as the number of genetic markers increased, genotyping error notwithstanding.