Numerals and what counts

This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between...

Full description

Bibliographic Details
Main Authors: Rueter, Jack, Partanen, Niko, Pirinen, Tommi A
Other Authors: Lhoneux, Miryam de, Tsarfaty, Reut, Language Technology, The National Library of Finland, Library Network Services
Format: Conference Object
Language:English
Published: 2022
Subjects:
Online Access:http://hdl.handle.net/10138/343000
_version_ 1824233579617452032
author Rueter, Jack
Partanen, Niko
Pirinen, Tommi A
author2 Lhoneux, Miryam de
Tsarfaty, Reut
Language Technology
The National Library of Finland, Library Network Services
author_facet Rueter, Jack
Partanen, Niko
Pirinen, Tommi A
author_sort Rueter, Jack
collection HELDA – University of Helsinki Open Repository
description This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between individual treebanks, and aim to highlight some areas where further development work and systematization could prove beneficial. At the same time, the Universal Dependencies project already offers a wide range of conventions to mark nuanced variation in numerals and counting expressions, and the harmonization of conventions between different languages could be the next step to take. The discussion here makes specific reference to Universal Dependencies version 2.8, and some differences found may already have been harmonized in version 2.9. Regardless of whether this takes place or not, we believe that the study still forms an important documentation of this period in the project. Peer reviewed
format Conference Object
genre karelian
sami
genre_facet karelian
sami
id ftunivhelsihelda:oai:helda.helsinki.fi:10138/343000
institution Open Polar
language English
op_collection_id ftunivhelsihelda
op_relation Fifth Workshop on Universal Dependencies
978-1-955917-17-9
Unknown funder
Rueter , J , Partanen , N & Pirinen , T A 2021 , Numerals and what counts . in M D Lhoneux & R Tsarfaty (eds) , Fifth Workshop on Universal Dependencies : Proceedings . The Association for Computational Linguistics , Stroudsburg , pp. 151–159 , Workshop on Universal Dependencies , Sofia , 21/03/2022 .
workshop
http://hdl.handle.net/10138/343000
op_rights cc_by
info:eu-repo/semantics/openAccess
openAccess
publishDate 2022
record_format openpolar
spelling ftunivhelsihelda:oai:helda.helsinki.fi:10138/343000 2025-02-16T15:05:58+00:00 Numerals and what counts Rueter, Jack Partanen, Niko Pirinen, Tommi A Lhoneux, Miryam de Tsarfaty, Reut Language Technology The National Library of Finland, Library Network Services 2022-04-25T12:29:02Z 9 application/pdf http://hdl.handle.net/10138/343000 eng eng Fifth Workshop on Universal Dependencies 978-1-955917-17-9 Unknown funder Rueter , J , Partanen , N & Pirinen , T A 2021 , Numerals and what counts . in M D Lhoneux & R Tsarfaty (eds) , Fifth Workshop on Universal Dependencies : Proceedings . The Association for Computational Linguistics , Stroudsburg , pp. 151–159 , Workshop on Universal Dependencies , Sofia , 21/03/2022 . workshop http://hdl.handle.net/10138/343000 cc_by info:eu-repo/semantics/openAccess openAccess Languages universal dependencies numerals treebanks Morphological annotation Uralic languages Erzya language Moksha language Olonets-Karelian Karelian language Komi-Zyrian Komi-Permyak language Finnish language Estonian Language Skolt Sami language Northern Sami language Hungarian language syntax Conference contribution publishedVersion 2022 ftunivhelsihelda 2025-01-21T16:11:30Z This study discusses the way different numerals and related expressions are currently annotated in the Universal Dependencies project, with a specific focus on the Uralic language family and only occasional references to the other language groups. We analyse different annotation conventions between individual treebanks, and aim to highlight some areas where further development work and systematization could prove beneficial. At the same time, the Universal Dependencies project already offers a wide range of conventions to mark nuanced variation in numerals and counting expressions, and the harmonization of conventions between different languages could be the next step to take. The discussion here makes specific reference to Universal Dependencies version 2.8, and some differences found may already have been harmonized in version 2.9. Regardless of whether this takes place or not, we believe that the study still forms an important documentation of this period in the project. Peer reviewed Conference Object karelian sami HELDA – University of Helsinki Open Repository
spellingShingle Languages
universal dependencies
numerals
treebanks
Morphological annotation
Uralic languages
Erzya language
Moksha language
Olonets-Karelian
Karelian language
Komi-Zyrian
Komi-Permyak language
Finnish language
Estonian Language
Skolt Sami language
Northern Sami language
Hungarian language
syntax
Rueter, Jack
Partanen, Niko
Pirinen, Tommi A
Numerals and what counts
title Numerals and what counts
title_full Numerals and what counts
title_fullStr Numerals and what counts
title_full_unstemmed Numerals and what counts
title_short Numerals and what counts
title_sort numerals and what counts
topic Languages
universal dependencies
numerals
treebanks
Morphological annotation
Uralic languages
Erzya language
Moksha language
Olonets-Karelian
Karelian language
Komi-Zyrian
Komi-Permyak language
Finnish language
Estonian Language
Skolt Sami language
Northern Sami language
Hungarian language
syntax
topic_facet Languages
universal dependencies
numerals
treebanks
Morphological annotation
Uralic languages
Erzya language
Moksha language
Olonets-Karelian
Karelian language
Komi-Zyrian
Komi-Permyak language
Finnish language
Estonian Language
Skolt Sami language
Northern Sami language
Hungarian language
syntax
url http://hdl.handle.net/10138/343000