Neural models for morphological generation, analysis and lemmatization in 22 languages
Morphological models for generation, lemmatization and analysis in 22 languages. The models are trained in OpenNMT-py https://github.com/OpenNMT/OpenNMT-py . Feed one word at a time, split into characters (kissa -> k i s s a) Supported languages:German (deu), Kven (fkv), Komi-Zyrian (kpv), Mokhsa...
Main Authors: | , , , |
---|---|
Format: | Other/Unknown Material |
Language: | Finnish |
Published: |
Zenodo
2020
|
Subjects: | |
Online Access: | https://doi.org/10.5281/zenodo.3926769 |
id |
ftzenodo:oai:zenodo.org:3926769 |
---|---|
record_format |
openpolar |
spelling |
ftzenodo:oai:zenodo.org:3926769 2024-09-15T18:16:21+00:00 Neural models for morphological generation, analysis and lemmatization in 22 languages Hämäläinen, Mika Partanen, Niko Rueter, Jack Alnajjar, Khalid 2020-07-01 https://doi.org/10.5281/zenodo.3926769 fin fin Zenodo https://doi.org/10.5281/zenodo.3926768 https://doi.org/10.5281/zenodo.3926769 oai:zenodo.org:3926769 info:eu-repo/semantics/openAccess Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode morphology fst endangered languages neural models info:eu-repo/semantics/other 2020 ftzenodo https://doi.org/10.5281/zenodo.392676910.5281/zenodo.3926768 2024-07-25T14:59:19Z Morphological models for generation, lemmatization and analysis in 22 languages. The models are trained in OpenNMT-py https://github.com/OpenNMT/OpenNMT-py . Feed one word at a time, split into characters (kissa -> k i s s a) Supported languages:German (deu), Kven (fkv), Komi-Zyrian (kpv), Mokhsa (mdf), Mansi (mns), Erzya (myv), Norwegian Bokmål (nob), Russian (rus), South Sami (sma), Lule Sami (smj), Skolt Sami (sms), Võro (vro), Finnish (fin), Komi-Permyak (koi), Latvian (lav), Eastern Mari (mhr), Western Mari (mrj), Namonuito (nmt), Olonets-Karelian (olo), Pite Sami (sje), Northern Sami (sme), Inari Sami (smn) and Udmurt (udm) Cite: Hämäläinen, M., Partanen, N., Rueter, J., & Alnajjar, K. (2021). Neural Morphology Dataset and Models for Multiple Languages, from the Large to the Endangered. In Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021) Other/Unknown Material karelian sami Mansi Zenodo |
institution |
Open Polar |
collection |
Zenodo |
op_collection_id |
ftzenodo |
language |
Finnish |
topic |
morphology fst endangered languages neural models |
spellingShingle |
morphology fst endangered languages neural models Hämäläinen, Mika Partanen, Niko Rueter, Jack Alnajjar, Khalid Neural models for morphological generation, analysis and lemmatization in 22 languages |
topic_facet |
morphology fst endangered languages neural models |
description |
Morphological models for generation, lemmatization and analysis in 22 languages. The models are trained in OpenNMT-py https://github.com/OpenNMT/OpenNMT-py . Feed one word at a time, split into characters (kissa -> k i s s a) Supported languages:German (deu), Kven (fkv), Komi-Zyrian (kpv), Mokhsa (mdf), Mansi (mns), Erzya (myv), Norwegian Bokmål (nob), Russian (rus), South Sami (sma), Lule Sami (smj), Skolt Sami (sms), Võro (vro), Finnish (fin), Komi-Permyak (koi), Latvian (lav), Eastern Mari (mhr), Western Mari (mrj), Namonuito (nmt), Olonets-Karelian (olo), Pite Sami (sje), Northern Sami (sme), Inari Sami (smn) and Udmurt (udm) Cite: Hämäläinen, M., Partanen, N., Rueter, J., & Alnajjar, K. (2021). Neural Morphology Dataset and Models for Multiple Languages, from the Large to the Endangered. In Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021) |
format |
Other/Unknown Material |
author |
Hämäläinen, Mika Partanen, Niko Rueter, Jack Alnajjar, Khalid |
author_facet |
Hämäläinen, Mika Partanen, Niko Rueter, Jack Alnajjar, Khalid |
author_sort |
Hämäläinen, Mika |
title |
Neural models for morphological generation, analysis and lemmatization in 22 languages |
title_short |
Neural models for morphological generation, analysis and lemmatization in 22 languages |
title_full |
Neural models for morphological generation, analysis and lemmatization in 22 languages |
title_fullStr |
Neural models for morphological generation, analysis and lemmatization in 22 languages |
title_full_unstemmed |
Neural models for morphological generation, analysis and lemmatization in 22 languages |
title_sort |
neural models for morphological generation, analysis and lemmatization in 22 languages |
publisher |
Zenodo |
publishDate |
2020 |
url |
https://doi.org/10.5281/zenodo.3926769 |
genre |
karelian sami Mansi |
genre_facet |
karelian sami Mansi |
op_relation |
https://doi.org/10.5281/zenodo.3926768 https://doi.org/10.5281/zenodo.3926769 oai:zenodo.org:3926769 |
op_rights |
info:eu-repo/semantics/openAccess Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode |
op_doi |
https://doi.org/10.5281/zenodo.392676910.5281/zenodo.3926768 |
_version_ |
1810454353049288704 |