Genomic Analyses of >3,100 Nasopharyngeal Pneumococci Revealed Significant Differences Between Pneumococci Recovered in Four Different Geographical Regions

Understanding the structure of a bacterial population is essential in order to understand bacterial evolution. Estimating the core genome (those genes common to all, or nearly all, strains of a species) is a key component of such analyses. The size and composition of the core genome varies by datase...

Full description

Bibliographic Details
Published in:Frontiers in Microbiology
Main Authors: Andries J. van Tonder, James E. Bray, Keith A. Jolley, Melissa Jansen van Rensburg, Sigríður J. Quirk, Gunnsteinn Haraldsson, Martin C. J. Maiden, Stephen D. Bentley, Ásgeir Haraldsson, Helga Erlendsdóttir, Karl G. Kristinsson, Angela B. Brueggemann
Format: Article in Journal/Newspaper
Language:English
Published: Frontiers Media S.A. 2019
Subjects:
Online Access:https://doi.org/10.3389/fmicb.2019.00317
https://doaj.org/article/ec42f552c9e84c319a33a29cd9138f93
Description
Summary:Understanding the structure of a bacterial population is essential in order to understand bacterial evolution. Estimating the core genome (those genes common to all, or nearly all, strains of a species) is a key component of such analyses. The size and composition of the core genome varies by dataset, but we hypothesized that the variation between different collections of the same bacterial species would be minimal. To investigate this, we analyzed the genome sequences of 3,118 pneumococci recovered from healthy individuals in Reykjavik (Iceland), Southampton (United Kingdom), Boston (United States), and Maela (Thailand). The analyses revealed a “supercore” genome (genes shared by all 3,118 pneumococci) of 558 genes, although an additional 354 core genes were shared by pneumococci from Reykjavik, Southampton, and Boston. Overall, the size and composition of the core and pan-genomes among pneumococci recovered in Reykjavik, Southampton, and Boston were similar. Maela pneumococci were distinctly different in that they had a smaller core genome and larger pan-genome. The pan-genome of Maela pneumococci contained several >25 Kb sequence regions (flanked by pneumococcal genes) that were homologous to genomic regions found in other bacterial species. Overall, our work revealed that some subsets of the global pneumococcal population are highly heterogeneous, and our hypothesis was rejected. This is an important finding in terms of understanding genetic variation among pneumococci and is also an essential point of consideration before generalizing the findings from a single dataset to the wider pneumococcal population.