Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language
We describe a paradigm for combining manual and automatic error correction of noisy structured lexicographic data. Modifications to the structure and underlying text of the lexicographic data are expressed in a simple, interpreted programming language. Dictionary Manipulation Language (DML) commands...
Other Authors: | |
---|---|
Format: | Text |
Language: | English |
Subjects: | |
Online Access: | http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.359.8412 http://lampsrv02.umiacs.umd.edu/pubs/Papers/davidzajic-11/davidzajic-11.pdf |
id |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.359.8412 |
---|---|
record_format |
openpolar |
spelling |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.359.8412 2023-05-15T16:01:16+02:00 Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language The Pennsylvania State University CiteSeerX Archives application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.359.8412 http://lampsrv02.umiacs.umd.edu/pubs/Papers/davidzajic-11/davidzajic-11.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.359.8412 http://lampsrv02.umiacs.umd.edu/pubs/Papers/davidzajic-11/davidzajic-11.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://lampsrv02.umiacs.umd.edu/pubs/Papers/davidzajic-11/davidzajic-11.pdf noisy structured data error correction digital lexicography text ftciteseerx 2016-01-08T00:44:48Z We describe a paradigm for combining manual and automatic error correction of noisy structured lexicographic data. Modifications to the structure and underlying text of the lexicographic data are expressed in a simple, interpreted programming language. Dictionary Manipulation Language (DML) commands identify nodes by unique identifiers, and manipulations are performed using simple commands such as create, move, set text, etc. Corrected lexicons are produced by applying sequences of DML commands to the source version of the lexicon. DML commands can be written manually to repair one-off errors or generated automatically to correct recurring problems. We discuss advantages of the paradigm for the task of editing digital bilingual dictionaries. Text DML Unknown |
institution |
Open Polar |
collection |
Unknown |
op_collection_id |
ftciteseerx |
language |
English |
topic |
noisy structured data error correction digital lexicography |
spellingShingle |
noisy structured data error correction digital lexicography Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language |
topic_facet |
noisy structured data error correction digital lexicography |
description |
We describe a paradigm for combining manual and automatic error correction of noisy structured lexicographic data. Modifications to the structure and underlying text of the lexicographic data are expressed in a simple, interpreted programming language. Dictionary Manipulation Language (DML) commands identify nodes by unique identifiers, and manipulations are performed using simple commands such as create, move, set text, etc. Corrected lexicons are produced by applying sequences of DML commands to the source version of the lexicon. DML commands can be written manually to repair one-off errors or generated automatically to correct recurring problems. We discuss advantages of the paradigm for the task of editing digital bilingual dictionaries. |
author2 |
The Pennsylvania State University CiteSeerX Archives |
format |
Text |
title |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language |
title_short |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language |
title_full |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language |
title_fullStr |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language |
title_full_unstemmed |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language |
title_sort |
correcting errors in digital lexicographic resources using a dictionary manipulation language |
url |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.359.8412 http://lampsrv02.umiacs.umd.edu/pubs/Papers/davidzajic-11/davidzajic-11.pdf |
genre |
DML |
genre_facet |
DML |
op_source |
http://lampsrv02.umiacs.umd.edu/pubs/Papers/davidzajic-11/davidzajic-11.pdf |
op_relation |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.359.8412 http://lampsrv02.umiacs.umd.edu/pubs/Papers/davidzajic-11/davidzajic-11.pdf |
op_rights |
Metadata may be used without restrictions as long as the oai identifier remains attached to it. |
_version_ |
1766397207146135552 |