Data for "SuperSim: a test set for word similarity and relatedness in Swedish"

This repository contains the data described in SuperSim: a test set for word similarity and relatedness in Swedish (Hengchen and Tahmasebi, 2021) available at https://aclanthology.org/2021.nodalida-main.27/ . If you use part orwhole of this resource, please cite the following work or alternatively u...

Full description

Bibliographic Details
Main Authors: Simon Hengchen, Nina Tahmasebi
Format: Other/Unknown Material
Language:Swedish
Published: Zenodo 2021
Subjects:
Online Access:https://doi.org/10.5281/zenodo.4660084
id ftzenodo:oai:zenodo.org:4660084
record_format openpolar
spelling ftzenodo:oai:zenodo.org:4660084 2024-09-15T18:14:11+00:00 Data for "SuperSim: a test set for word similarity and relatedness in Swedish" Simon Hengchen Nina Tahmasebi 2021-04-02 https://doi.org/10.5281/zenodo.4660084 swe swe Zenodo https://doi.org/10.5281/zenodo.4660083 https://doi.org/10.5281/zenodo.4660084 oai:zenodo.org:4660084 info:eu-repo/semantics/openAccess Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode NoDaLiDa, 23rd Nordic Conference of Computational Linguistics, Reykjavik info:eu-repo/semantics/other 2021 ftzenodo https://doi.org/10.5281/zenodo.466008410.5281/zenodo.4660083 2024-07-26T13:30:54Z This repository contains the data described in SuperSim: a test set for word similarity and relatedness in Swedish (Hengchen and Tahmasebi, 2021) available at https://aclanthology.org/2021.nodalida-main.27/ . If you use part orwhole of this resource, please cite the following work or alternatively use the bibtex entry: Hengchen, Simonand Tahmasebi, Nina, 2021. SuperSim: a test set for word similarity and relatedness in Swedish. In The 23rd Nordic Conference on Computational Linguistics (NoDaLiDa’21) . <code>@inproceedings{hengchen-tahmasebi-2021-supersim, title = "{SuperSim:} a test set for word similarity and relatedness in {Swedish}", author = "Hengchen, Simon and Tahmasebi, Nina", booktitle = "Proceedings of the 23rd Nordic Conference on Computational Linguistics", month = may # "{--}" # jun, year = "2021", address = "Reykjavik, Iceland, and Online", publisher = {Link{\"o}ping University Electronic Press}, }</code> The data contained in this repository is as follows: The<code>code</code>folder contains: <code>main.py</code> <code>utils.py</code> <code>train_base_models.py</code> <code>perl-clean.pl</code> <code>requirements.txt</code> The<code>data</code>folder contains: <code>gold_relatedness.tsv</code>: all relatedness judgments from all annotators, as well as the mean <code>gold_similarity.tsv</code>: all similarity judgments from all annotators, as well as the mean <code>models</code>contains baseline models: Trained on the Swedish Gigaword: ... Other/Unknown Material Iceland Zenodo
institution Open Polar
collection Zenodo
op_collection_id ftzenodo
language Swedish
description This repository contains the data described in SuperSim: a test set for word similarity and relatedness in Swedish (Hengchen and Tahmasebi, 2021) available at https://aclanthology.org/2021.nodalida-main.27/ . If you use part orwhole of this resource, please cite the following work or alternatively use the bibtex entry: Hengchen, Simonand Tahmasebi, Nina, 2021. SuperSim: a test set for word similarity and relatedness in Swedish. In The 23rd Nordic Conference on Computational Linguistics (NoDaLiDa’21) . <code>@inproceedings{hengchen-tahmasebi-2021-supersim, title = "{SuperSim:} a test set for word similarity and relatedness in {Swedish}", author = "Hengchen, Simon and Tahmasebi, Nina", booktitle = "Proceedings of the 23rd Nordic Conference on Computational Linguistics", month = may # "{--}" # jun, year = "2021", address = "Reykjavik, Iceland, and Online", publisher = {Link{\"o}ping University Electronic Press}, }</code> The data contained in this repository is as follows: The<code>code</code>folder contains: <code>main.py</code> <code>utils.py</code> <code>train_base_models.py</code> <code>perl-clean.pl</code> <code>requirements.txt</code> The<code>data</code>folder contains: <code>gold_relatedness.tsv</code>: all relatedness judgments from all annotators, as well as the mean <code>gold_similarity.tsv</code>: all similarity judgments from all annotators, as well as the mean <code>models</code>contains baseline models: Trained on the Swedish Gigaword: ...
format Other/Unknown Material
author Simon Hengchen
Nina Tahmasebi
spellingShingle Simon Hengchen
Nina Tahmasebi
Data for "SuperSim: a test set for word similarity and relatedness in Swedish"
author_facet Simon Hengchen
Nina Tahmasebi
author_sort Simon Hengchen
title Data for "SuperSim: a test set for word similarity and relatedness in Swedish"
title_short Data for "SuperSim: a test set for word similarity and relatedness in Swedish"
title_full Data for "SuperSim: a test set for word similarity and relatedness in Swedish"
title_fullStr Data for "SuperSim: a test set for word similarity and relatedness in Swedish"
title_full_unstemmed Data for "SuperSim: a test set for word similarity and relatedness in Swedish"
title_sort data for "supersim: a test set for word similarity and relatedness in swedish"
publisher Zenodo
publishDate 2021
url https://doi.org/10.5281/zenodo.4660084
genre Iceland
genre_facet Iceland
op_source NoDaLiDa, 23rd Nordic Conference of Computational Linguistics, Reykjavik
op_relation https://doi.org/10.5281/zenodo.4660083
https://doi.org/10.5281/zenodo.4660084
oai:zenodo.org:4660084
op_rights info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
op_doi https://doi.org/10.5281/zenodo.466008410.5281/zenodo.4660083
_version_ 1810451954857410560