Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction*
We inspect the viability of finite-state spell-checking and contextless correction of non-word errors in morphologically different languages. Overviewing previous work, we conduct large-scale tests involving three languages — covering a broad spectrum of morphological features; English, Finnish and...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Text |
Language: | English |
Published: |
2012
|
Subjects: | |
Online Access: | http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.303.1476 http://www.helsinki.fi/~tapirine/publications/Pirinen-2012-fsmnlp.pdf |
id |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.303.1476 |
---|---|
record_format |
openpolar |
spelling |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.303.1476 2023-05-15T16:31:08+02:00 Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction* Tommi A Pirinen Sam Hardwick The Pennsylvania State University CiteSeerX Archives 2012 application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.303.1476 http://www.helsinki.fi/~tapirine/publications/Pirinen-2012-fsmnlp.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.303.1476 http://www.helsinki.fi/~tapirine/publications/Pirinen-2012-fsmnlp.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://www.helsinki.fi/~tapirine/publications/Pirinen-2012-fsmnlp.pdf text 2012 ftciteseerx 2016-01-07T22:09:57Z We inspect the viability of finite-state spell-checking and contextless correction of non-word errors in morphologically different languages. Overviewing previous work, we conduct large-scale tests involving three languages — covering a broad spectrum of morphological features; English, Finnish and Greenlandic — and a variety of error models and algorithms, including proposed improvements of our own. Special reference is made to on-line threeway composition of the input, the error model and the language model. Tests are run on real-world text acquired from freely available sources. We show that the finite-state approaches discussed are sufficiently fast for high-quality correction, even for Greenlandic which, due to its morphological complexity, is a difficult task for non-finite-state approaches. 1 Text greenlandic Unknown |
institution |
Open Polar |
collection |
Unknown |
op_collection_id |
ftciteseerx |
language |
English |
description |
We inspect the viability of finite-state spell-checking and contextless correction of non-word errors in morphologically different languages. Overviewing previous work, we conduct large-scale tests involving three languages — covering a broad spectrum of morphological features; English, Finnish and Greenlandic — and a variety of error models and algorithms, including proposed improvements of our own. Special reference is made to on-line threeway composition of the input, the error model and the language model. Tests are run on real-world text acquired from freely available sources. We show that the finite-state approaches discussed are sufficiently fast for high-quality correction, even for Greenlandic which, due to its morphological complexity, is a difficult task for non-finite-state approaches. 1 |
author2 |
The Pennsylvania State University CiteSeerX Archives |
format |
Text |
author |
Tommi A Pirinen Sam Hardwick |
spellingShingle |
Tommi A Pirinen Sam Hardwick Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction* |
author_facet |
Tommi A Pirinen Sam Hardwick |
author_sort |
Tommi A Pirinen |
title |
Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction* |
title_short |
Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction* |
title_full |
Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction* |
title_fullStr |
Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction* |
title_full_unstemmed |
Effect of Language and Error Models on Efficiency of Finite-State Spell-Checking and Correction* |
title_sort |
effect of language and error models on efficiency of finite-state spell-checking and correction* |
publishDate |
2012 |
url |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.303.1476 http://www.helsinki.fi/~tapirine/publications/Pirinen-2012-fsmnlp.pdf |
genre |
greenlandic |
genre_facet |
greenlandic |
op_source |
http://www.helsinki.fi/~tapirine/publications/Pirinen-2012-fsmnlp.pdf |
op_relation |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.303.1476 http://www.helsinki.fi/~tapirine/publications/Pirinen-2012-fsmnlp.pdf |
op_rights |
Metadata may be used without restrictions as long as the oai identifier remains attached to it. |
_version_ |
1766020916094959616 |