xLiD-Lexica: Cross-lingual Linked Data Lexica

We provide cross-lingual linked data lexica called xLiD-Lexica . The data set contains the cross-lingual groundings of linked data resources from the Linked Open Data cloud as RDF data , which can be easily integrated into the LOD data sources. In addition, we created a SPARQL endpoint over ourxLiD-...

Full description

Bibliographic Details
Main Authors: Zhang, Lei, Färber, Michael, Rettinger, Achim
Format: Dataset
Language:unknown
Published: Zenodo 2014
Subjects:
Online Access:https://dx.doi.org/10.5281/zenodo.3544851
https://zenodo.org/record/3544851
id ftdatacite:10.5281/zenodo.3544851
record_format openpolar
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
description We provide cross-lingual linked data lexica called xLiD-Lexica . The data set contains the cross-lingual groundings of linked data resources from the Linked Open Data cloud as RDF data , which can be easily integrated into the LOD data sources. In addition, we created a SPARQL endpoint over ourxLiD-Lexica to allow users to easily access them using SPARQL query language. Multilingual and cross-lingual information access can be facilitated by the availability of such lexica, e.g., allowing for an easy mapping of natural language expressions in different languages to linked data resources from LOD. Many tasks in natural language processing, such as natural language generation, cross-lingual entity linking, text annotation and question answering, can benefit from our xLiD-Lexica. More information can be found in the LREC'14 paper xLiD-Lexica: Cross-lingual Linked Data Lexica and on our website https://km.aifb.kit.edu/sites/xlid-lexica/ . Please cite this data set as follows (see also DBLP): Lei Zhang, Michael Färber, Achim Rettinger. "xLiD-Lexica: Cross-lingual Linked Data Lexica". In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014). Reykjavik, Iceland, 2014, pp. 2101–2105. Example queries: 1. Retrieve all entities with surface form which contain "iPhone": <code class="language-sql">Select ?resource, ?label, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?label . ?label bif:contains "iPhone" . } order by DESC(?probability) limit 100 2. Retrieve the top 100 resources for a given surface form ("iphone"): <code class="language-sql">Select ?resource, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> "iphone"@en . } order by DESC(?probability) limit 100 3. Retrieve the top 100 resources for a given surface form ("iphone", case-insensitive): <code class="language-sql">Select ?resource, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?surfaceform . filter(regex(?surfaceform, "^iphone$", "i")) } limit 100 4. Retrieve the top 100 surface forms per entity: <code class="language-sql">Select ?label ?probability from <http://www.xlid-lexica.org> where { <http://dbpedia.org/resource/IPhone_5> <http://www.xlid-lexica.org/block> ?b1. ?b1 <http://www.xlid-lexica.org/res#sf> ?sf. ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability. ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?label. ?b2 <http://www.xlid-lexica.org/block#lang> "en". } order by DESC(?probability) limit 100
format Dataset
author Zhang, Lei
Färber, Michael
Rettinger, Achim
spellingShingle Zhang, Lei
Färber, Michael
Rettinger, Achim
xLiD-Lexica: Cross-lingual Linked Data Lexica
author_facet Zhang, Lei
Färber, Michael
Rettinger, Achim
author_sort Zhang, Lei
title xLiD-Lexica: Cross-lingual Linked Data Lexica
title_short xLiD-Lexica: Cross-lingual Linked Data Lexica
title_full xLiD-Lexica: Cross-lingual Linked Data Lexica
title_fullStr xLiD-Lexica: Cross-lingual Linked Data Lexica
title_full_unstemmed xLiD-Lexica: Cross-lingual Linked Data Lexica
title_sort xlid-lexica: cross-lingual linked data lexica
publisher Zenodo
publishDate 2014
url https://dx.doi.org/10.5281/zenodo.3544851
https://zenodo.org/record/3544851
genre Iceland
genre_facet Iceland
op_relation https://zenodo.org/communities/linkeddata
https://zenodo.org/communities/natural-language-processing
https://dx.doi.org/10.5281/zenodo.3544852
https://zenodo.org/communities/linkeddata
https://zenodo.org/communities/natural-language-processing
op_rights Open Access
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
cc-by-4.0
info:eu-repo/semantics/openAccess
op_rightsnorm CC-BY
op_doi https://doi.org/10.5281/zenodo.3544851
https://doi.org/10.5281/zenodo.3544852
_version_ 1766043693732593664
spelling ftdatacite:10.5281/zenodo.3544851 2023-05-15T16:53:11+02:00 xLiD-Lexica: Cross-lingual Linked Data Lexica Zhang, Lei Färber, Michael Rettinger, Achim 2014 https://dx.doi.org/10.5281/zenodo.3544851 https://zenodo.org/record/3544851 unknown Zenodo https://zenodo.org/communities/linkeddata https://zenodo.org/communities/natural-language-processing https://dx.doi.org/10.5281/zenodo.3544852 https://zenodo.org/communities/linkeddata https://zenodo.org/communities/natural-language-processing Open Access Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 info:eu-repo/semantics/openAccess CC-BY dataset Dataset 2014 ftdatacite https://doi.org/10.5281/zenodo.3544851 https://doi.org/10.5281/zenodo.3544852 2021-11-05T12:55:41Z We provide cross-lingual linked data lexica called xLiD-Lexica . The data set contains the cross-lingual groundings of linked data resources from the Linked Open Data cloud as RDF data , which can be easily integrated into the LOD data sources. In addition, we created a SPARQL endpoint over ourxLiD-Lexica to allow users to easily access them using SPARQL query language. Multilingual and cross-lingual information access can be facilitated by the availability of such lexica, e.g., allowing for an easy mapping of natural language expressions in different languages to linked data resources from LOD. Many tasks in natural language processing, such as natural language generation, cross-lingual entity linking, text annotation and question answering, can benefit from our xLiD-Lexica. More information can be found in the LREC'14 paper xLiD-Lexica: Cross-lingual Linked Data Lexica and on our website https://km.aifb.kit.edu/sites/xlid-lexica/ . Please cite this data set as follows (see also DBLP): Lei Zhang, Michael Färber, Achim Rettinger. "xLiD-Lexica: Cross-lingual Linked Data Lexica". In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014). Reykjavik, Iceland, 2014, pp. 2101–2105. Example queries: 1. Retrieve all entities with surface form which contain "iPhone": <code class="language-sql">Select ?resource, ?label, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?label . ?label bif:contains "iPhone" . } order by DESC(?probability) limit 100 2. Retrieve the top 100 resources for a given surface form ("iphone"): <code class="language-sql">Select ?resource, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> "iphone"@en . } order by DESC(?probability) limit 100 3. Retrieve the top 100 resources for a given surface form ("iphone", case-insensitive): <code class="language-sql">Select ?resource, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?surfaceform . filter(regex(?surfaceform, "^iphone$", "i")) } limit 100 4. Retrieve the top 100 surface forms per entity: <code class="language-sql">Select ?label ?probability from <http://www.xlid-lexica.org> where { <http://dbpedia.org/resource/IPhone_5> <http://www.xlid-lexica.org/block> ?b1. ?b1 <http://www.xlid-lexica.org/res#sf> ?sf. ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability. ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?label. ?b2 <http://www.xlid-lexica.org/block#lang> "en". } order by DESC(?probability) limit 100 Dataset Iceland DataCite Metadata Store (German National Library of Science and Technology)