Minimizing Taxonomic and Natural Product Redundancy in Microbial Libraries using MALDI-TOF MS and the bioinformatics pipeline IDBac.

Libraries of microorganisms have been a cornerstone of drug discovery efforts since the mid-1950s, but strain duplication in some libraries has resulted in unwanted natural product redundancy. In the current study, we implemented a workflow that minimizes both the natural product overlap and the tot...

Full description

Bibliographic Details
Published in:Journal of Natural Products
Main Authors: Costa, Maria S., Clark, Chase M., Ómarsdóttir, Sesselja, Sanchez, Laura M., Murphy, Brian T.
Format: Text
Language:English
Published: 2019
Subjects:
Online Access:http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7197193/
http://www.ncbi.nlm.nih.gov/pubmed/31335140
https://doi.org/10.1021/acs.jnatprod.9b00168
Description
Summary:Libraries of microorganisms have been a cornerstone of drug discovery efforts since the mid-1950s, but strain duplication in some libraries has resulted in unwanted natural product redundancy. In the current study, we implemented a workflow that minimizes both the natural product overlap and the total number of bacterial isolates in a library. Using a collection expedition to Iceland as an example, we purified every distinct bacterial colony (1,616 total) off isolation plates derived from 86 environmental samples. We employed our mass spectrometry (MS) based IDBac workflow on these isolates to form groups of taxa based on protein MS fingerprints (3-15 kDa), and further distinguished taxa subgroups based on their degree of overlap within corresponding natural product spectra (0.2-2 kDa). This informed the decision to create a library of 301 isolates spanning 54 genera. This process required only 25 hours of data acquisition and 2 hours of analysis. In a separate experiment, we reduced the size of an existing library based on the degree of metabolic overlap observed in natural product MS spectra of bacterial colonies (from 833 to 233 isolates, a 72.0% reduction). Overall, our pipeline allows for the reduction of library size and costs associated with library generation, and minimizes natural product redundancy entering into downstream biological screening efforts.