Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature
Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using dat...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Other/Unknown Material |
Language: | English |
Published: |
DataverseNO
2021
|
Subjects: | |
Online Access: | https://doi.org/10.18710/PTQ8X7 |
id |
ftdataverseno:doi:10.18710/PTQ8X7 |
---|---|
record_format |
openpolar |
spelling |
ftdataverseno:doi:10.18710/PTQ8X7 2023-10-29T02:40:44+01:00 Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature Ancin-Murguzur, Francisco Javier Hausner, Vera Helene Ancin Murguzur, Francisco Javier 2021-05-18 https://doi.org/10.18710/PTQ8X7 English eng DataverseNO https://github.com/fjmurguzur/causalizeR https://doi.org/10.5281/zenodo.4817639 https://doi.org/10.18710/PTQ8X7 Computer and Information Science Earth and Environmental Sciences text mining causal link algorithm literature review Algorithm in R language to perform bibliographic analyses 2021 ftdataverseno https://doi.org/10.18710/PTQ8X710.5281/zenodo.4817639 2023-10-04T22:55:37Z Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem. Other/Unknown Material Tundra DataverseNO |
institution |
Open Polar |
collection |
DataverseNO |
op_collection_id |
ftdataverseno |
language |
English |
topic |
Computer and Information Science Earth and Environmental Sciences text mining causal link algorithm literature review |
spellingShingle |
Computer and Information Science Earth and Environmental Sciences text mining causal link algorithm literature review Ancin-Murguzur, Francisco Javier Hausner, Vera Helene Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
topic_facet |
Computer and Information Science Earth and Environmental Sciences text mining causal link algorithm literature review |
description |
Complex interactions among multiple abiotic and biotic drivers result in rapid changes in ecosystems worldwide. Predicting how specific interactions can cause ripple effects potentially resulting in abrupt shifts in ecosystems is of high relevance to policymakers, but difficult to quantify using data from singular cases. We present causalizeR (https://github.com/fjmurguzur/causalizeR), a text-processing algorithm that extracts causal relations from literature based on simple grammatical rules that can be used to synthesize evidence in unstructured texts in a structured manner. The algorithm extracts causal links using the relative position of nouns relative to the keyword of choice to extract the cause and effects of interest. The resulting database can be combined with network analysis tools to estimate the direct and indirect effects of multiple drivers at the network level, which is useful for synthesizing available knowledge and for hypothesis creation and testing. We illustrate the use of the algorithm by detecting causal relationships in scientific literature relating to the tundra ecosystem. |
author2 |
Ancin Murguzur, Francisco Javier |
format |
Other/Unknown Material |
author |
Ancin-Murguzur, Francisco Javier Hausner, Vera Helene |
author_facet |
Ancin-Murguzur, Francisco Javier Hausner, Vera Helene |
author_sort |
Ancin-Murguzur, Francisco Javier |
title |
Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
title_short |
Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
title_full |
Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
title_fullStr |
Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
title_full_unstemmed |
Replication Data for: causalizeR: A text mining algorithm to identify causal relationships in scientific literature |
title_sort |
replication data for: causalizer: a text mining algorithm to identify causal relationships in scientific literature |
publisher |
DataverseNO |
publishDate |
2021 |
url |
https://doi.org/10.18710/PTQ8X7 |
genre |
Tundra |
genre_facet |
Tundra |
op_relation |
https://github.com/fjmurguzur/causalizeR https://doi.org/10.5281/zenodo.4817639 https://doi.org/10.18710/PTQ8X7 |
op_doi |
https://doi.org/10.18710/PTQ8X710.5281/zenodo.4817639 |
_version_ |
1781069254374719488 |