Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets

Multilingual MigrationskB (MGKB) is a mulitlingual extended version of English MGKB. The tweets geotagged with Geo location from 32 European Countries (Austria, Belgium, Bulgaria, Croatia, Cyprus, Czech, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Latvia, Lithuania,...

Full description

Bibliographic Details
Main Author: Yiyi Chen
Format: Dataset
Language:unknown
Published: 2022
Subjects:
Online Access:https://zenodo.org/record/5918508
https://doi.org/10.5281/zenodo.5918508
id ftzenodo:oai:zenodo.org:5918508
record_format openpolar
spelling ftzenodo:oai:zenodo.org:5918508 2023-05-15T16:51:01+02:00 Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets Yiyi Chen 2022-01-29 https://zenodo.org/record/5918508 https://doi.org/10.5281/zenodo.5918508 unknown info:eu-repo/grantAgreement/EC/Horizon 2020 Framework Programme - Research and Innovation action/882986/ doi:10.5281/zenodo.5918507 https://zenodo.org/record/5918508 https://doi.org/10.5281/zenodo.5918508 oai:zenodo.org:5918508 info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by/4.0/legalcode info:eu-repo/semantics/other dataset 2022 ftzenodo https://doi.org/10.5281/zenodo.591850810.5281/zenodo.5918507 2023-03-10T23:06:56Z Multilingual MigrationskB (MGKB) is a mulitlingual extended version of English MGKB. The tweets geotagged with Geo location from 32 European Countries (Austria, Belgium, Bulgaria, Croatia, Cyprus, Czech, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, Iceland, Liechtenstein, Norway, Switzerland, the United Kingdom) are extracted and filtered by 11 languages (English, French, Finnish, German, Greek, Dutch, Hungarian, Italian, Polish, Spain, Swedish). Metadata information about the tweets, such as Geo information (place name, coordinates, country code) are included. MGKB contains sentiments, offensive and hate speeches, topics, hashtags, user mentions in RDF format. The schema of MGKB is an extension of TweetsKB for migration related information. Moreover, to associate and represent the potential economic and social factors driving the migration flows, the data from Eurostat and FIBO ontology was used. To represent multilinguality, the CIDOC Conceptual Reference Model (CIDOC-CRM) is used. The extracted economic indicators, i.e., GDP Growth Rate, Total Unemployment Rate, Youth Unemployment Rate, Long-term Unemployment Rate and Income per househould, are connected with each tweet in RDF using geographical and temporal dimensions. For this version, the Multilingual MGKB is delivered separated by year. The extracted topic words are also published. Code: https://github.com/migrationsKB/MRL Please contact Yiyi Chen (yiyi.chen@partner.kit.edu) for pretrained models (Sentiment analysis/hate speech detection/ETM) if necessary. Dataset Iceland Zenodo Norway
institution Open Polar
collection Zenodo
op_collection_id ftzenodo
language unknown
description Multilingual MigrationskB (MGKB) is a mulitlingual extended version of English MGKB. The tweets geotagged with Geo location from 32 European Countries (Austria, Belgium, Bulgaria, Croatia, Cyprus, Czech, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, Iceland, Liechtenstein, Norway, Switzerland, the United Kingdom) are extracted and filtered by 11 languages (English, French, Finnish, German, Greek, Dutch, Hungarian, Italian, Polish, Spain, Swedish). Metadata information about the tweets, such as Geo information (place name, coordinates, country code) are included. MGKB contains sentiments, offensive and hate speeches, topics, hashtags, user mentions in RDF format. The schema of MGKB is an extension of TweetsKB for migration related information. Moreover, to associate and represent the potential economic and social factors driving the migration flows, the data from Eurostat and FIBO ontology was used. To represent multilinguality, the CIDOC Conceptual Reference Model (CIDOC-CRM) is used. The extracted economic indicators, i.e., GDP Growth Rate, Total Unemployment Rate, Youth Unemployment Rate, Long-term Unemployment Rate and Income per househould, are connected with each tweet in RDF using geographical and temporal dimensions. For this version, the Multilingual MGKB is delivered separated by year. The extracted topic words are also published. Code: https://github.com/migrationsKB/MRL Please contact Yiyi Chen (yiyi.chen@partner.kit.edu) for pretrained models (Sentiment analysis/hate speech detection/ETM) if necessary.
format Dataset
author Yiyi Chen
spellingShingle Yiyi Chen
Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets
author_facet Yiyi Chen
author_sort Yiyi Chen
title Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets
title_short Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets
title_full Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets
title_fullStr Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets
title_full_unstemmed Multilingual MigrationsKB: A Mulitlingual Knowledge Base of Migration related annotated Tweets
title_sort multilingual migrationskb: a mulitlingual knowledge base of migration related annotated tweets
publishDate 2022
url https://zenodo.org/record/5918508
https://doi.org/10.5281/zenodo.5918508
geographic Norway
geographic_facet Norway
genre Iceland
genre_facet Iceland
op_relation info:eu-repo/grantAgreement/EC/Horizon 2020 Framework Programme - Research and Innovation action/882986/
doi:10.5281/zenodo.5918507
https://zenodo.org/record/5918508
https://doi.org/10.5281/zenodo.5918508
oai:zenodo.org:5918508
op_rights info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by/4.0/legalcode
op_doi https://doi.org/10.5281/zenodo.591850810.5281/zenodo.5918507
_version_ 1766041134799257600