Understanding population structure in an evolutionary context: population-specific F(ST) and pairwise F(ST)

Populations are shaped by their history. It is crucial to interpret population structure in an evolutionary context. Pairwise F(ST) measures population structure, whereas population-specific F(ST) measures deviation from the ancestral population. To understand the current population structure and a...

Full description

Bibliographic Details
Published in:G3 Genes|Genomes|Genetics
Main Authors: Kitada, Shuichi, Nakamichi, Reiichiro, Kishino, Hirohisa
Format: Text
Language:English
Published: Oxford University Press 2021
Subjects:
Online Access:http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8527463/
http://www.ncbi.nlm.nih.gov/pubmed/34549777
https://doi.org/10.1093/g3journal/jkab316
Description
Summary:Populations are shaped by their history. It is crucial to interpret population structure in an evolutionary context. Pairwise F(ST) measures population structure, whereas population-specific F(ST) measures deviation from the ancestral population. To understand the current population structure and a population’s history of range expansion, we propose a representation method that overlays population-specific F(ST) estimates on a sampling location map, and on an unrooted neighbor-joining tree and a multi-dimensional scaling plot inferred from a pairwise F(ST) distance matrix. We examined the usefulness of our procedure using simulations that mimicked population colonization from an ancestral population and by analyzing published human, Atlantic cod, and wild poplar data. Our results demonstrated that population-specific F(ST) values identify the source population and trace the evolutionary history of its derived populations. Conversely, pairwise F(ST) values represent the current population structure. By integrating the results of both estimators, we obtained a new picture of the population structure that incorporates evolutionary history. The generalized least squares estimate of genome-wide population-specific F(ST) indicated that the wild poplar population expanded its distribution to the north, where daylight hours are long in summer, to coastal areas with abundant rainfall, and to the south where summers are dry. Genomic data highlight the power of the bias-corrected moment estimators of F(ST), whether global, pairwise, or population-specific, that provide unbiased estimates of F(ST). All F(ST) moment estimators described in this paper have reasonable processing times and are useful in population genomics studies.