Speech Technology for Minority Languages: the Case of Irish (Gaelic)

PUBLISHED Pittsburgh Abstract?Unit selection is a data-driven approach to speech synthesis that concatenates pieces of recorded speech from a large database in order to create novel sentences. Many corpora are available in the English language, including the Arctic database [1], which allows a user...

Full description

Bibliographic Details
Main Authors: NI CHASAIDE, AILBHE, GOBL, CHRISTER
Format: Conference Object
Language:English
Published: 2006
Subjects:
Online Access:http://hdl.handle.net/2262/39404
http://people.tcd.ie/anichsid
http://people.tcd.ie/cegobl
http://tcd.academia.edu/documents/0027/8616/corpus.pdf
id fttrinitycoll:oai:tara.tcd.ie:2262/39404
record_format openpolar
spelling fttrinitycoll:oai:tara.tcd.ie:2262/39404 2023-05-15T14:54:03+02:00 Speech Technology for Minority Languages: the Case of Irish (Gaelic) Proceedings of the 9th International Conference on Spoken Language Processing INTERSPEECH 2006 NI CHASAIDE, AILBHE GOBL, CHRISTER 2006 181 184 http://hdl.handle.net/2262/39404 http://people.tcd.ie/anichsid http://people.tcd.ie/cegobl http://tcd.academia.edu/documents/0027/8616/corpus.pdf en eng N? Chasaide, A., Wogan, J., ? Raghallaigh, B., N? Bhriain, ?., Zoerner, E., Berthelsen, H. and Gobl, C., Speech Technology for Minority Languages: the Case of Irish (Gaelic), Proceedings of the 9th International Conference on Spoken Language Processing, INTERSPEECH 2006, Pittsburgh, 2006, 181 - 184 Y http://hdl.handle.net/2262/39404 http://people.tcd.ie/anichsid http://people.tcd.ie/cegobl 45524 http://tcd.academia.edu/documents/0027/8616/corpus.pdf Y speech synthesis corpus design Arctic Irish Making Ireland Conference Paper scholarly_publications refereed_publications 2006 fttrinitycoll 2020-02-16T13:48:58Z PUBLISHED Pittsburgh Abstract?Unit selection is a data-driven approach to speech synthesis that concatenates pieces of recorded speech from a large database in order to create novel sentences. Many corpora are available in the English language, including the Arctic database [1], which allows a user to create small, reliable speech synthesisers using only a small set of recorded sentences. Such resources for minority languages are scarce however, despite their increasing importance for the survival of such languages. This paper describes the current research in creating efficient Irish language corpora for speech synthesis. Corpus design techniques are discussed, in particular, two methods of data reduction that are applied to an aligned spoken corpus of Irish in order to create smaller, more efficient speech corpora. The CAB ' OGA'I II project is funded by Foras na Gaeilge. Conference Object Arctic The University of Dublin, Trinity College: TARA (Trinity's Access to Research Archive) Arctic
institution Open Polar
collection The University of Dublin, Trinity College: TARA (Trinity's Access to Research Archive)
op_collection_id fttrinitycoll
language English
topic speech synthesis
corpus design
Arctic
Irish
Making Ireland
spellingShingle speech synthesis
corpus design
Arctic
Irish
Making Ireland
NI CHASAIDE, AILBHE
GOBL, CHRISTER
Speech Technology for Minority Languages: the Case of Irish (Gaelic)
topic_facet speech synthesis
corpus design
Arctic
Irish
Making Ireland
description PUBLISHED Pittsburgh Abstract?Unit selection is a data-driven approach to speech synthesis that concatenates pieces of recorded speech from a large database in order to create novel sentences. Many corpora are available in the English language, including the Arctic database [1], which allows a user to create small, reliable speech synthesisers using only a small set of recorded sentences. Such resources for minority languages are scarce however, despite their increasing importance for the survival of such languages. This paper describes the current research in creating efficient Irish language corpora for speech synthesis. Corpus design techniques are discussed, in particular, two methods of data reduction that are applied to an aligned spoken corpus of Irish in order to create smaller, more efficient speech corpora. The CAB ' OGA'I II project is funded by Foras na Gaeilge.
format Conference Object
author NI CHASAIDE, AILBHE
GOBL, CHRISTER
author_facet NI CHASAIDE, AILBHE
GOBL, CHRISTER
author_sort NI CHASAIDE, AILBHE
title Speech Technology for Minority Languages: the Case of Irish (Gaelic)
title_short Speech Technology for Minority Languages: the Case of Irish (Gaelic)
title_full Speech Technology for Minority Languages: the Case of Irish (Gaelic)
title_fullStr Speech Technology for Minority Languages: the Case of Irish (Gaelic)
title_full_unstemmed Speech Technology for Minority Languages: the Case of Irish (Gaelic)
title_sort speech technology for minority languages: the case of irish (gaelic)
publishDate 2006
url http://hdl.handle.net/2262/39404
http://people.tcd.ie/anichsid
http://people.tcd.ie/cegobl
http://tcd.academia.edu/documents/0027/8616/corpus.pdf
geographic Arctic
geographic_facet Arctic
genre Arctic
genre_facet Arctic
op_relation N? Chasaide, A., Wogan, J., ? Raghallaigh, B., N? Bhriain, ?., Zoerner, E., Berthelsen, H. and Gobl, C., Speech Technology for Minority Languages: the Case of Irish (Gaelic), Proceedings of the 9th International Conference on Spoken Language Processing, INTERSPEECH 2006, Pittsburgh, 2006, 181 - 184
Y
http://hdl.handle.net/2262/39404
http://people.tcd.ie/anichsid
http://people.tcd.ie/cegobl
45524
http://tcd.academia.edu/documents/0027/8616/corpus.pdf
op_rights Y
_version_ 1766325738063003648