Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients.

BACKGROUND:Assessment of the response to the 2014-15 Ebola outbreak indicates the need for innovations in data collection, sharing, and use to improve case detection and treatment. Here we introduce a Machine Learning pipeline for Ebola Virus Disease (EVD) prognosis prediction, which packages the be...

Full description

Bibliographic Details
Published in:PLOS Neglected Tropical Diseases
Main Authors: Andres Colubri, Tom Silver, Terrence Fradet, Kalliroi Retzepi, Ben Fry, Pardis Sabeti
Format: Article in Journal/Newspaper
Language:English
Published: Public Library of Science (PLoS) 2016
Subjects:
Online Access:https://doi.org/10.1371/journal.pntd.0004549
https://doaj.org/article/4c0a41f4e8194662a9e0aa83507c7dfc
id ftdoajarticles:oai:doaj.org/article:4c0a41f4e8194662a9e0aa83507c7dfc
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:4c0a41f4e8194662a9e0aa83507c7dfc 2023-05-15T15:16:48+02:00 Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients. Andres Colubri Tom Silver Terrence Fradet Kalliroi Retzepi Ben Fry Pardis Sabeti 2016-03-01T00:00:00Z https://doi.org/10.1371/journal.pntd.0004549 https://doaj.org/article/4c0a41f4e8194662a9e0aa83507c7dfc EN eng Public Library of Science (PLoS) http://europepmc.org/articles/PMC4798608?pdf=render https://doaj.org/toc/1935-2727 https://doaj.org/toc/1935-2735 1935-2727 1935-2735 doi:10.1371/journal.pntd.0004549 https://doaj.org/article/4c0a41f4e8194662a9e0aa83507c7dfc PLoS Neglected Tropical Diseases, Vol 10, Iss 3, p e0004549 (2016) Arctic medicine. Tropical medicine RC955-962 Public aspects of medicine RA1-1270 article 2016 ftdoajarticles https://doi.org/10.1371/journal.pntd.0004549 2022-12-31T02:27:06Z BACKGROUND:Assessment of the response to the 2014-15 Ebola outbreak indicates the need for innovations in data collection, sharing, and use to improve case detection and treatment. Here we introduce a Machine Learning pipeline for Ebola Virus Disease (EVD) prognosis prediction, which packages the best models into a mobile app to be available in clinical care settings. The pipeline was trained on a public EVD clinical dataset, from 106 patients in Sierra Leone. METHODS/PRINCIPAL FINDINGS:We used a new tool for exploratory analysis, Mirador, to identify the most informative clinical factors that correlate with EVD outcome. The small sample size and high prevalence of missing records were significant challenges. We applied multiple imputation and bootstrap sampling to address missing data and quantify overfitting. We trained several predictors over all combinations of covariates, which resulted in an ensemble of predictors, with and without viral load information, with an area under the receiver operator characteristic curve of 0.8 or more, after correcting for optimistic bias. We ranked the predictors by their F1-score, and those above a set threshold were compiled into a mobile app, Ebola CARE (Computational Assignment of Risk Estimates). CONCLUSIONS/SIGNIFICANCE:This method demonstrates how to address small sample sizes and missing data, while creating predictive models that can be readily deployed to assist treatment in future outbreaks of EVD and other infectious diseases. By generating an ensemble of predictors instead of relying on a single model, we are able to handle situations where patient data is partially available. The prognosis app can be updated as new data become available, and we made all the computational protocols fully documented and open-sourced to encourage timely data sharing, independent validation, and development of better prediction models in outbreak response. Article in Journal/Newspaper Arctic Directory of Open Access Journals: DOAJ Articles Arctic PLOS Neglected Tropical Diseases 10 3 e0004549
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic Arctic medicine. Tropical medicine
RC955-962
Public aspects of medicine
RA1-1270
spellingShingle Arctic medicine. Tropical medicine
RC955-962
Public aspects of medicine
RA1-1270
Andres Colubri
Tom Silver
Terrence Fradet
Kalliroi Retzepi
Ben Fry
Pardis Sabeti
Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients.
topic_facet Arctic medicine. Tropical medicine
RC955-962
Public aspects of medicine
RA1-1270
description BACKGROUND:Assessment of the response to the 2014-15 Ebola outbreak indicates the need for innovations in data collection, sharing, and use to improve case detection and treatment. Here we introduce a Machine Learning pipeline for Ebola Virus Disease (EVD) prognosis prediction, which packages the best models into a mobile app to be available in clinical care settings. The pipeline was trained on a public EVD clinical dataset, from 106 patients in Sierra Leone. METHODS/PRINCIPAL FINDINGS:We used a new tool for exploratory analysis, Mirador, to identify the most informative clinical factors that correlate with EVD outcome. The small sample size and high prevalence of missing records were significant challenges. We applied multiple imputation and bootstrap sampling to address missing data and quantify overfitting. We trained several predictors over all combinations of covariates, which resulted in an ensemble of predictors, with and without viral load information, with an area under the receiver operator characteristic curve of 0.8 or more, after correcting for optimistic bias. We ranked the predictors by their F1-score, and those above a set threshold were compiled into a mobile app, Ebola CARE (Computational Assignment of Risk Estimates). CONCLUSIONS/SIGNIFICANCE:This method demonstrates how to address small sample sizes and missing data, while creating predictive models that can be readily deployed to assist treatment in future outbreaks of EVD and other infectious diseases. By generating an ensemble of predictors instead of relying on a single model, we are able to handle situations where patient data is partially available. The prognosis app can be updated as new data become available, and we made all the computational protocols fully documented and open-sourced to encourage timely data sharing, independent validation, and development of better prediction models in outbreak response.
format Article in Journal/Newspaper
author Andres Colubri
Tom Silver
Terrence Fradet
Kalliroi Retzepi
Ben Fry
Pardis Sabeti
author_facet Andres Colubri
Tom Silver
Terrence Fradet
Kalliroi Retzepi
Ben Fry
Pardis Sabeti
author_sort Andres Colubri
title Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients.
title_short Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients.
title_full Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients.
title_fullStr Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients.
title_full_unstemmed Transforming Clinical Data into Actionable Prognosis Models: Machine-Learning Framework and Field-Deployable App to Predict Outcome of Ebola Patients.
title_sort transforming clinical data into actionable prognosis models: machine-learning framework and field-deployable app to predict outcome of ebola patients.
publisher Public Library of Science (PLoS)
publishDate 2016
url https://doi.org/10.1371/journal.pntd.0004549
https://doaj.org/article/4c0a41f4e8194662a9e0aa83507c7dfc
geographic Arctic
geographic_facet Arctic
genre Arctic
genre_facet Arctic
op_source PLoS Neglected Tropical Diseases, Vol 10, Iss 3, p e0004549 (2016)
op_relation http://europepmc.org/articles/PMC4798608?pdf=render
https://doaj.org/toc/1935-2727
https://doaj.org/toc/1935-2735
1935-2727
1935-2735
doi:10.1371/journal.pntd.0004549
https://doaj.org/article/4c0a41f4e8194662a9e0aa83507c7dfc
op_doi https://doi.org/10.1371/journal.pntd.0004549
container_title PLOS Neglected Tropical Diseases
container_volume 10
container_issue 3
container_start_page e0004549
_version_ 1766347093312536576