Numerals and what counts
This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between...
Main Authors: | , , |
---|---|
Other Authors: | , , , |
Format: | Conference Object |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | http://hdl.handle.net/10138/343000 |
_version_ | 1824233579617452032 |
---|---|
author | Rueter, Jack Partanen, Niko Pirinen, Tommi A |
author2 | Lhoneux, Miryam de Tsarfaty, Reut Language Technology The National Library of Finland, Library Network Services |
author_facet | Rueter, Jack Partanen, Niko Pirinen, Tommi A |
author_sort | Rueter, Jack |
collection | HELDA – University of Helsinki Open Repository |
description | This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between individual treebanks, and aim to highlight some areas where further development work and systematization could prove beneficial. At the same time, the Universal Dependencies project already offers a wide range of conventions to mark nuanced variation in numerals and counting expressions, and the harmonization of conventions between different languages could be the next step to take. The discussion here makes specific reference to Universal Dependencies version 2.8, and some differences found may already have been harmonized in version 2.9. Regardless of whether this takes place or not, we believe that the study still forms an important documentation of this period in the project. Peer reviewed |
format | Conference Object |
genre | karelian sami |
genre_facet | karelian sami |
id | ftunivhelsihelda:oai:helda.helsinki.fi:10138/343000 |
institution | Open Polar |
language | English |
op_collection_id | ftunivhelsihelda |
op_relation | Fifth Workshop on Universal Dependencies 978-1-955917-17-9 Unknown funder Rueter , J , Partanen , N & Pirinen , T A 2021 , Numerals and what counts . in M D Lhoneux & R Tsarfaty (eds) , Fifth Workshop on Universal Dependencies : Proceedings . The Association for Computational Linguistics , Stroudsburg , pp. 151–159 , Workshop on Universal Dependencies , Sofia , 21/03/2022 . workshop http://hdl.handle.net/10138/343000 |
op_rights | cc_by info:eu-repo/semantics/openAccess openAccess |
publishDate | 2022 |
record_format | openpolar |
spelling | ftunivhelsihelda:oai:helda.helsinki.fi:10138/343000 2025-02-16T15:05:58+00:00 Numerals and what counts Rueter, Jack Partanen, Niko Pirinen, Tommi A Lhoneux, Miryam de Tsarfaty, Reut Language Technology The National Library of Finland, Library Network Services 2022-04-25T12:29:02Z 9 application/pdf http://hdl.handle.net/10138/343000 eng eng Fifth Workshop on Universal Dependencies 978-1-955917-17-9 Unknown funder Rueter , J , Partanen , N & Pirinen , T A 2021 , Numerals and what counts . in M D Lhoneux & R Tsarfaty (eds) , Fifth Workshop on Universal Dependencies : Proceedings . The Association for Computational Linguistics , Stroudsburg , pp. 151–159 , Workshop on Universal Dependencies , Sofia , 21/03/2022 . workshop http://hdl.handle.net/10138/343000 cc_by info:eu-repo/semantics/openAccess openAccess Languages universal dependencies numerals treebanks Morphological annotation Uralic languages Erzya language Moksha language Olonets-Karelian Karelian language Komi-Zyrian Komi-Permyak language Finnish language Estonian Language Skolt Sami language Northern Sami language Hungarian language syntax Conference contribution publishedVersion 2022 ftunivhelsihelda 2025-01-21T16:11:30Z This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between individual treebanks, and aim to highlight some areas where further development work and systematization could prove beneficial. At the same time, the Universal Dependencies project already offers a wide range of conventions to mark nuanced variation in numerals and counting expressions, and the harmonization of conventions between different languages could be the next step to take. The discussion here makes specific reference to Universal Dependencies version 2.8, and some differences found may already have been harmonized in version 2.9. Regardless of whether this takes place or not, we believe that the study still forms an important documentation of this period in the project. Peer reviewed Conference Object karelian sami HELDA – University of Helsinki Open Repository |
spellingShingle | Languages universal dependencies numerals treebanks Morphological annotation Uralic languages Erzya language Moksha language Olonets-Karelian Karelian language Komi-Zyrian Komi-Permyak language Finnish language Estonian Language Skolt Sami language Northern Sami language Hungarian language syntax Rueter, Jack Partanen, Niko Pirinen, Tommi A Numerals and what counts |
title | Numerals and what counts |
title_full | Numerals and what counts |
title_fullStr | Numerals and what counts |
title_full_unstemmed | Numerals and what counts |
title_short | Numerals and what counts |
title_sort | numerals and what counts |
topic | Languages universal dependencies numerals treebanks Morphological annotation Uralic languages Erzya language Moksha language Olonets-Karelian Karelian language Komi-Zyrian Komi-Permyak language Finnish language Estonian Language Skolt Sami language Northern Sami language Hungarian language syntax |
topic_facet | Languages universal dependencies numerals treebanks Morphological annotation Uralic languages Erzya language Moksha language Olonets-Karelian Karelian language Komi-Zyrian Komi-Permyak language Finnish language Estonian Language Skolt Sami language Northern Sami language Hungarian language syntax |
url | http://hdl.handle.net/10138/343000 |