Novel Pipeline for Large-Scale Comparative Population Genetics

This study determined population genetic structure measures, compared these measures across species with different biological traits; and created efficient, reproducible, reusable programming modules that are publicly available for future research. Cytochrome C Oxidase subunit I gene sequences from...

Full description

Bibliographic Details
Main Author: Majoros, Samantha
Other Authors: Adamowicz, Sarah
Format: Thesis
Language:English
Published: University of Guelph 2022
Subjects:
Online Access:https://hdl.handle.net/10214/27408
Description
Summary:This study determined population genetic structure measures, compared these measures across species with different biological traits; and created efficient, reproducible, reusable programming modules that are publicly available for future research. Cytochrome C Oxidase subunit I gene sequences from Diptera (true fly) species from Greenland and Canada were used as a case study and proof of concept. I hypothesized that population genetic structure measures will be influenced by the biological traits of organisms. Data were pulled from public databases, as well as taxon-specific literature. The R pipeline includes fifteen modules that can be adapted and applied to a diverse set of animal groups, geographic regions, genes, and traits. Habitat, larval diet, geographical distance, latitude, and longitude were all significantly related to population genetic structure in Diptera. Overall, this study has created efficient, reusable bioinformatics modules, as well as provided insight into the factors affecting population genetic structure in Northern fly communities. Natural Sciences and Engineering Research Council of Canada Genome Canada