causalizeR: a text mining algorithm to identify causal relationships in scientific literature

Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using dat...

Full description

Bibliographic Details
Published in:PeerJ
Main Authors: Ancin-Murguzur, Francisco J., Hausner, Vera H.
Format: Text
Language:English
Published: PeerJ Inc. 2021
Subjects:
Online Access:http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300496/
http://www.ncbi.nlm.nih.gov/pubmed/34322328
https://doi.org/10.7717/peerj.11850
id ftpubmed:oai:pubmedcentral.nih.gov:8300496
record_format openpolar
spelling ftpubmed:oai:pubmedcentral.nih.gov:8300496 2023-05-15T18:40:23+02:00 causalizeR: a text mining algorithm to identify causal relationships in scientific literature Ancin-Murguzur, Francisco J. Hausner, Vera H. 2021-07-20 http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300496/ http://www.ncbi.nlm.nih.gov/pubmed/34322328 https://doi.org/10.7717/peerj.11850 en eng PeerJ Inc. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300496/ http://www.ncbi.nlm.nih.gov/pubmed/34322328 http://dx.doi.org/10.7717/peerj.11850 ©2021 Ancin-Murguzur and Hausner https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. CC-BY PeerJ Bioinformatics Text 2021 ftpubmed https://doi.org/10.7717/peerj.11850 2021-08-01T00:28:55Z Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem. Text Tundra PubMed Central (PMC) PeerJ 9 e11850
institution Open Polar
collection PubMed Central (PMC)
op_collection_id ftpubmed
language English
topic Bioinformatics
spellingShingle Bioinformatics
Ancin-Murguzur, Francisco J.
Hausner, Vera H.
causalizeR: a text mining algorithm to identify causal relationships in scientific literature
topic_facet Bioinformatics
description Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem.
format Text
author Ancin-Murguzur, Francisco J.
Hausner, Vera H.
author_facet Ancin-Murguzur, Francisco J.
Hausner, Vera H.
author_sort Ancin-Murguzur, Francisco J.
title causalizeR: a text mining algorithm to identify causal relationships in scientific literature
title_short causalizeR: a text mining algorithm to identify causal relationships in scientific literature
title_full causalizeR: a text mining algorithm to identify causal relationships in scientific literature
title_fullStr causalizeR: a text mining algorithm to identify causal relationships in scientific literature
title_full_unstemmed causalizeR: a text mining algorithm to identify causal relationships in scientific literature
title_sort causalizer: a text mining algorithm to identify causal relationships in scientific literature
publisher PeerJ Inc.
publishDate 2021
url http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300496/
http://www.ncbi.nlm.nih.gov/pubmed/34322328
https://doi.org/10.7717/peerj.11850
genre Tundra
genre_facet Tundra
op_source PeerJ
op_relation http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8300496/
http://www.ncbi.nlm.nih.gov/pubmed/34322328
http://dx.doi.org/10.7717/peerj.11850
op_rights ©2021 Ancin-Murguzur and Hausner
https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
op_rightsnorm CC-BY
op_doi https://doi.org/10.7717/peerj.11850
container_title PeerJ
container_volume 9
container_start_page e11850
_version_ 1766229732517478400