xLiD-Lexica: Cross-lingual Linked Data Lexica

We provide cross-lingual linked data lexica called xLiD-Lexica . The data set contains the cross-lingual groundings of linked data resources from the Linked Open Data cloud as RDF data , which can be easily integrated into the LOD data sources. In addition, we created a SPARQL endpoint over ourxLiD-...

Full description

Bibliographic Details
Main Authors: Zhang, Lei, Färber, Michael, Rettinger, Achim
Format: Dataset
Language:unknown
Published: Zenodo 2014
Subjects:
Online Access:https://dx.doi.org/10.5281/zenodo.3544851
https://zenodo.org/record/3544851
Description
Summary:We provide cross-lingual linked data lexica called xLiD-Lexica . The data set contains the cross-lingual groundings of linked data resources from the Linked Open Data cloud as RDF data , which can be easily integrated into the LOD data sources. In addition, we created a SPARQL endpoint over ourxLiD-Lexica to allow users to easily access them using SPARQL query language. Multilingual and cross-lingual information access can be facilitated by the availability of such lexica, e.g., allowing for an easy mapping of natural language expressions in different languages to linked data resources from LOD. Many tasks in natural language processing, such as natural language generation, cross-lingual entity linking, text annotation and question answering, can benefit from our xLiD-Lexica. More information can be found in the LREC'14 paper xLiD-Lexica: Cross-lingual Linked Data Lexica and on our website https://km.aifb.kit.edu/sites/xlid-lexica/ . Please cite this data set as follows (see also DBLP): Lei Zhang, Michael Färber, Achim Rettinger. "xLiD-Lexica: Cross-lingual Linked Data Lexica". In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014). Reykjavik, Iceland, 2014, pp. 2101–2105. Example queries: 1. Retrieve all entities with surface form which contain "iPhone": <code class="language-sql">Select ?resource, ?label, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?label . ?label bif:contains "iPhone" . } order by DESC(?probability) limit 100 2. Retrieve the top 100 resources for a given surface form ("iphone"): <code class="language-sql">Select ?resource, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> "iphone"@en . } order by DESC(?probability) limit 100 3. Retrieve the top 100 resources for a given surface form ("iphone", case-insensitive): <code class="language-sql">Select ?resource, ?probability from <http://www.xlid-lexica.org> where { ?resource <http://www.xlid-lexica.org/block> ?b1 . ?b1 <http://www.xlid-lexica.org/res#sf> ?sf . ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability . ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?surfaceform . filter(regex(?surfaceform, "^iphone$", "i")) } limit 100 4. Retrieve the top 100 surface forms per entity: <code class="language-sql">Select ?label ?probability from <http://www.xlid-lexica.org> where { <http://dbpedia.org/resource/IPhone_5> <http://www.xlid-lexica.org/block> ?b1. ?b1 <http://www.xlid-lexica.org/res#sf> ?sf. ?b1 <http://www.xlid-lexica.org/res#priorProbability> ?probability. ?sf <http://www.xlid-lexica.org/block> ?b2. ?b2 <http://www.xlid-lexica.org/sf#label> ?label. ?b2 <http://www.xlid-lexica.org/block#lang> "en". } order by DESC(?probability) limit 100