NRC systems for the 2020 Inuktitut–English news translation task
We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut–English shared task on news translation at the Fifth Conference on Machine Translation (WMT20). Our submissions consist of ensembled domain-specific finetuned transformer models, trained using the Nunavut Ha...
Main Authors: | , , , |
---|---|
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
Association of Computational Linguistics
2020
|
Subjects: | |
Online Access: | https://nrc-publications.canada.ca/eng/view/accepted/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/eng/view/object/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/fra/voir/objet/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 |
id |
ftnrccanada:oai:cisti-icist.nrc-cnrc.ca:cistinparc:e06a1d9c-5574-4ea1-8b93-1ab28090e851 |
---|---|
record_format |
openpolar |
spelling |
ftnrccanada:oai:cisti-icist.nrc-cnrc.ca:cistinparc:e06a1d9c-5574-4ea1-8b93-1ab28090e851 2023-05-15T16:55:32+02:00 NRC systems for the 2020 Inuktitut–English news translation task Knowles, Rebecca Stewart, Darlene Larkin, Samuel Littell, Patrick 2020-11-19 text https://nrc-publications.canada.ca/eng/view/accepted/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/eng/view/object/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/fra/voir/objet/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 eng eng Association of Computational Linguistics 5th Conference on Machine Translation (WMT), The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 19-20, 2020 [Held Online], ISBN: 978-1-948087-81-0, Publication date: 2020-11-19, Pages: 155–169 article 2020 ftnrccanada 2021-09-01T06:36:01Z We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut–English shared task on news translation at the Fifth Conference on Machine Translation (WMT20). Our submissions consist of ensembled domain-specific finetuned transformer models, trained using the Nunavut Hansard and news data and, in the case of Inuktitut–English, backtranslated news and parliamentary data. In this work we explore challenges related to the relatively small amount of parallel data, morphological complexity, and domain shifts. Peer reviewed: Yes NRC publication: Yes Article in Journal/Newspaper inuktitut Nunavut National Research Council Canada: NRC Publications Archive Canada Nunavut |
institution |
Open Polar |
collection |
National Research Council Canada: NRC Publications Archive |
op_collection_id |
ftnrccanada |
language |
English |
description |
We describe the National Research Council of Canada (NRC) submissions for the 2020 Inuktitut–English shared task on news translation at the Fifth Conference on Machine Translation (WMT20). Our submissions consist of ensembled domain-specific finetuned transformer models, trained using the Nunavut Hansard and news data and, in the case of Inuktitut–English, backtranslated news and parliamentary data. In this work we explore challenges related to the relatively small amount of parallel data, morphological complexity, and domain shifts. Peer reviewed: Yes NRC publication: Yes |
format |
Article in Journal/Newspaper |
author |
Knowles, Rebecca Stewart, Darlene Larkin, Samuel Littell, Patrick |
spellingShingle |
Knowles, Rebecca Stewart, Darlene Larkin, Samuel Littell, Patrick NRC systems for the 2020 Inuktitut–English news translation task |
author_facet |
Knowles, Rebecca Stewart, Darlene Larkin, Samuel Littell, Patrick |
author_sort |
Knowles, Rebecca |
title |
NRC systems for the 2020 Inuktitut–English news translation task |
title_short |
NRC systems for the 2020 Inuktitut–English news translation task |
title_full |
NRC systems for the 2020 Inuktitut–English news translation task |
title_fullStr |
NRC systems for the 2020 Inuktitut–English news translation task |
title_full_unstemmed |
NRC systems for the 2020 Inuktitut–English news translation task |
title_sort |
nrc systems for the 2020 inuktitut–english news translation task |
publisher |
Association of Computational Linguistics |
publishDate |
2020 |
url |
https://nrc-publications.canada.ca/eng/view/accepted/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/eng/view/object/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 https://nrc-publications.canada.ca/fra/voir/objet/?id=e06a1d9c-5574-4ea1-8b93-1ab28090e851 |
geographic |
Canada Nunavut |
geographic_facet |
Canada Nunavut |
genre |
inuktitut Nunavut |
genre_facet |
inuktitut Nunavut |
op_relation |
5th Conference on Machine Translation (WMT), The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 19-20, 2020 [Held Online], ISBN: 978-1-948087-81-0, Publication date: 2020-11-19, Pages: 155–169 |
_version_ |
1766046546860703744 |