A semantic problem solving environment for integrative parasite research: identification of intervention targets for Trypanosoma cruzi.

Background Research on the biology of parasites requires a sophisticated and integrated computational platform to query and analyze large volumes of data, representing both unpublished (internal) and public (external) data sources. Effective analysis of an integrated data resource using knowledge di...

Full description

Bibliographic Details
Published in:PLoS Neglected Tropical Diseases
Main Authors: Priti P Parikh, Todd A Minning, Vinh Nguyen, Sarasi Lalithsena, Amir H Asiaee, Satya S Sahoo, Prashant Doshi, Rick Tarleton, Amit P Sheth
Format: Article in Journal/Newspaper
Language:English
Published: Public Library of Science (PLoS) 2012
Subjects:
Online Access:https://doi.org/10.1371/journal.pntd.0001458
https://doaj.org/article/edbacf58232f44f398c51787722e6cb0
Description
Summary:Background Research on the biology of parasites requires a sophisticated and integrated computational platform to query and analyze large volumes of data, representing both unpublished (internal) and public (external) data sources. Effective analysis of an integrated data resource using knowledge discovery tools would significantly aid biologists in conducting their research, for example, through identifying various intervention targets in parasites and in deciding the future direction of ongoing as well as planned projects. A key challenge in achieving this objective is the heterogeneity between the internal lab data, usually stored as flat files, Excel spreadsheets or custom-built databases, and the external databases. Reconciling the different forms of heterogeneity and effectively integrating data from disparate sources is a nontrivial task for biologists and requires a dedicated informatics infrastructure. Thus, we developed an integrated environment using Semantic Web technologies that may provide biologists the tools for managing and analyzing their data, without the need for acquiring in-depth computer science knowledge. Methodology/principal findings We developed a semantic problem-solving environment (SPSE) that uses ontologies to integrate internal lab data with external resources in a Parasite Knowledge Base (PKB), which has the ability to query across these resources in a unified manner. The SPSE includes Web Ontology Language (OWL)-based ontologies, experimental data with its provenance information represented using the Resource Description Format (RDF), and a visual querying tool, Cuebee, that features integrated use of Web services. We demonstrate the use and benefit of SPSE using example queries for identifying gene knockout targets of Trypanosoma cruzi for vaccine development. Answers to these queries involve looking up multiple sources of data, linking them together and presenting the results. Conclusion/significance The SPSE facilitates parasitologists in leveraging the growing, but ...