NRC systems for the 2020 Inuktitut–English news translation task

We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut–English shared task on news translation at the Fifth Conference on Machine Translation (WMT20). Our submissions consist of ensembled domain-specific finetuned transformer models, trained using the Nunavut Ha...

Full description

Bibliographic Details
Main Authors: Knowles, Rebecca, Stewart, Darlene, Larkin, Samuel, Littell, Patrick
Format: Article in Journal/Newspaper
Language:English
Published: Association of Computational Linguistics 2020
Subjects:
Online Access:https://nrc-publications.canada.ca/eng/view/accepted/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851
https://nrc-publications.canada.ca/eng/view/object/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851
https://nrc-publications.canada.ca/fra/voir/objet/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851
id ftnrccanada:oai:cisti-icist.nrc-cnrc.ca:cistinparc:e06a1d9c-5574-4ea1-8b93-1ab28090e851
record_format openpolar
spelling ftnrccanada:oai:cisti-icist.nrc-cnrc.ca:cistinparc:e06a1d9c-5574-4ea1-8b93-1ab28090e851 2023-05-15T16:55:32+02:00 NRC systems for the 2020 Inuktitut–English news translation task Knowles, Rebecca Stewart, Darlene Larkin, Samuel Littell, Patrick 2020-11-19 text https://nrc-publications.canada.ca/eng/view/accepted/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/eng/view/object/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/fra/voir/objet/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 eng eng Association of Computational Linguistics 5th Conference on Machine Translation (WMT), The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 19-20, 2020 [Held Online], ISBN: 978-1-948087-81-0, Publication date: 2020-11-19, Pages: 155–169 article 2020 ftnrccanada 2021-09-01T06:36:01Z We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut–English shared task on news translation at the Fifth Conference on Machine Translation (WMT20). Our submissions consist of ensembled domain-specific finetuned transformer models, trained using the Nunavut Hansard and news data and, in the case of Inuktitut–English, backtranslated news and parliamentary data. In this work we explore challenges related to the relatively small amount of parallel data, morphological complexity, and domain shifts. Peer reviewed: Yes NRC publication: Yes Article in Journal/Newspaper inuktitut Nunavut National Research Council Canada: NRC Publications Archive Canada Nunavut
institution Open Polar
collection National Research Council Canada: NRC Publications Archive
op_collection_id ftnrccanada
language English
description We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut–English shared task on news translation at the Fifth Conference on Machine Translation (WMT20). Our submissions consist of ensembled domain-specific finetuned transformer models, trained using the Nunavut Hansard and news data and, in the case of Inuktitut–English, backtranslated news and parliamentary data. In this work we explore challenges related to the relatively small amount of parallel data, morphological complexity, and domain shifts. Peer reviewed: Yes NRC publication: Yes
format Article in Journal/Newspaper
author Knowles, Rebecca
Stewart, Darlene
Larkin, Samuel
Littell, Patrick
spellingShingle Knowles, Rebecca
Stewart, Darlene
Larkin, Samuel
Littell, Patrick
NRC systems for the 2020 Inuktitut–English news translation task
author_facet Knowles, Rebecca
Stewart, Darlene
Larkin, Samuel
Littell, Patrick
author_sort Knowles, Rebecca
title NRC systems for the 2020 Inuktitut–English news translation task
title_short NRC systems for the 2020 Inuktitut–English news translation task
title_full NRC systems for the 2020 Inuktitut–English news translation task
title_fullStr NRC systems for the 2020 Inuktitut–English news translation task
title_full_unstemmed NRC systems for the 2020 Inuktitut–English news translation task
title_sort nrc systems for the 2020 inuktitut–english news translation task
publisher Association of Computational Linguistics
publishDate 2020
url https://nrc-publications.canada.ca/eng/view/accepted/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851
https://nrc-publications.canada.ca/eng/view/object/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851
https://nrc-publications.canada.ca/fra/voir/objet/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851
geographic Canada
Nunavut
geographic_facet Canada
Nunavut
genre inuktitut
Nunavut
genre_facet inuktitut
Nunavut
op_relation 5th Conference on Machine Translation (WMT), The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 19-20, 2020 [Held Online], ISBN: 978-1-948087-81-0, Publication date: 2020-11-19, Pages: 155–169
_version_ 1766046546860703744