Optimization of whole-genome sequencing of Plasmodium falciparum from low-density dried blood spot samples

Abstract Background Whole-genome sequencing (WGS) is becoming increasingly useful to study the biology, epidemiology, and ecology of malaria parasites. Despite ease of sampling, DNA extracted from dried blood spots (DBS) has a high ratio of human DNA compared to parasite DNA, which poses a challenge...

Full description

Bibliographic Details
Published in:Malaria Journal
Main Authors: Noam B. Teyssier, Anna Chen, Elias M. Duarte, Rene Sit, Bryan Greenhouse, Sofonias K. Tessema
Format: Article in Journal/Newspaper
Language:English
Published: BMC 2021
Subjects:
Online Access:https://doi.org/10.1186/s12936-021-03630-4
https://doaj.org/article/1e560d116f5d4f30b8398e2112f7e961
Description
Summary:Abstract Background Whole-genome sequencing (WGS) is becoming increasingly useful to study the biology, epidemiology, and ecology of malaria parasites. Despite ease of sampling, DNA extracted from dried blood spots (DBS) has a high ratio of human DNA compared to parasite DNA, which poses a challenge for downstream genetic analyses. The effects of multiple methods for DNA extraction, digestion of methylated DNA, and amplification were evaluated on the quality and fidelity of WGS data recovered from DBS. Methods Low parasite density mock DBS samples were created, extracted either with Tween-Chelex or QIAamp, treated with or without McrBC, and amplified with one of three different amplification techniques (two sWGA primer sets and one rWGA). Extraction conditions were evaluated on performance of sequencing depth, percentiles of coverage, and expected SNP concordance. Results At 100 parasites/μL, Chelex-Tween-McrBC samples had higher coverage (5 × depth = 93% genome) than QIAamp extracted samples (5 × depth = 76% genome). The two evaluated sWGA primer sets showed minor differences in overall genome coverage and SNP concordance, with a newly proposed combination of 20 primers showing a modest improvement in coverage over those previously published. Conclusions Overall, Tween-Chelex extracted samples that were treated with McrBC digestion and are amplified using 6A10AD sWGA conditions had minimal dropout rate, higher percentages of coverage at higher depth, and more accurate SNP concordance than QiaAMP extracted samples. These findings extend the results of previously reported methods, making whole genome sequencing accessible to a larger number of low density samples that are commonly encountered in cross-sectional surveys.