El-WOZ: a client-server wizard-of-oz open-source interface

International audience Wizard of Oz (WOZ) prototyping employs a human wizard to simulate anticipated functions of a future system. In Natural Language Processing this method is usually used to obtain early feedback on dialogue designs, to collect language corpora, or to explore interaction strategie...

Full description

Bibliographic Details
Main Authors: Pellegrini, Thomas, Hedayati, Vahid, Costa, Angela
Other Authors: Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa (INESC-ID), Instituto Superior Técnico, Universidade Técnica de Lisboa (IST)-Instituto de Engenharia de Sistemas e Computadores (INESC), Centro de Linguística da Universidade Nova de Lisboa (CLUNL), Faculty of Social and Human Sciences (NOVAFCSH), Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)
Format: Conference Object
Language:English
Published: HAL CCSD 2014
Subjects:
Online Access:https://hal.archives-ouvertes.fr/hal-01145413
https://hal.archives-ouvertes.fr/hal-01145413/document
https://hal.archives-ouvertes.fr/hal-01145413/file/Pellegrini_13044.pdf
id ftccsdartic:oai:HAL:hal-01145413v1
record_format openpolar
institution Open Polar
collection Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
op_collection_id ftccsdartic
language English
topic Wizard-of-oz
European Portuguese elderly speech
Speech recording interface
[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
spellingShingle Wizard-of-oz
European Portuguese elderly speech
Speech recording interface
[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Pellegrini, Thomas
Hedayati, Vahid
Costa, Angela
El-WOZ: a client-server wizard-of-oz open-source interface
topic_facet Wizard-of-oz
European Portuguese elderly speech
Speech recording interface
[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
description International audience Wizard of Oz (WOZ) prototyping employs a human wizard to simulate anticipated functions of a future system. In Natural Language Processing this method is usually used to obtain early feedback on dialogue designs, to collect language corpora, or to explore interaction strategies. Yet, existing tools often require complex client-server configurations and setup routines, or suffer from compatibility problems with different platforms. Integrated solutions, which may also be used by designers and researchers without technical background, are missing. In this paper we present a framework for multi-lingual dialog research, which combines speech recognition and synthesis with WOZ. All components are open source and adaptable toIn this paper, we present a speech recording interface developed in the context of a project on automatic speech recognition for elderly native speakers of European Portuguese. In order to collect spontaneous speech in a situation of interaction with a machine, this interface was designed as a Wizard-of-Oz (WOZ) plateform. In this setup, users interact with a fake automated dialog system controled by a human wizard. It was implemented as a client-server application and the subjects interact with a talking head. The human wizard chooses pre-defined questions or sentences in a graphical user interface, which are then synthesized and spoken aloud by the avatar on the client side. A small spontaneous speech corpus was collected in a daily center. Eight speakers between 75 and 90 years old were recorded. They appreciated the interface and felt at ease with the avatar. Manual orthographic transcriptions were created for the total of about 45 minutes of speech. different application scenarios
author2 Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA)
Institut de recherche en informatique de Toulouse (IRIT)
Université Toulouse 1 Capitole (UT1)
Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3)
Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP)
Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)
Université Fédérale Toulouse Midi-Pyrénées
Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa (INESC-ID)
Instituto Superior Técnico, Universidade Técnica de Lisboa (IST)-Instituto de Engenharia de Sistemas e Computadores (INESC)
Centro de Linguística da Universidade Nova de Lisboa (CLUNL)
Faculty of Social and Human Sciences (NOVAFCSH)
Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)
format Conference Object
author Pellegrini, Thomas
Hedayati, Vahid
Costa, Angela
author_facet Pellegrini, Thomas
Hedayati, Vahid
Costa, Angela
author_sort Pellegrini, Thomas
title El-WOZ: a client-server wizard-of-oz open-source interface
title_short El-WOZ: a client-server wizard-of-oz open-source interface
title_full El-WOZ: a client-server wizard-of-oz open-source interface
title_fullStr El-WOZ: a client-server wizard-of-oz open-source interface
title_full_unstemmed El-WOZ: a client-server wizard-of-oz open-source interface
title_sort el-woz: a client-server wizard-of-oz open-source interface
publisher HAL CCSD
publishDate 2014
url https://hal.archives-ouvertes.fr/hal-01145413
https://hal.archives-ouvertes.fr/hal-01145413/document
https://hal.archives-ouvertes.fr/hal-01145413/file/Pellegrini_13044.pdf
op_coverage Reykyavik, Iceland
genre Iceland
genre_facet Iceland
op_source Language Resources and Evaluation Conference - LREC 2014
https://hal.archives-ouvertes.fr/hal-01145413
Language Resources and Evaluation Conference - LREC 2014, May 2014, Reykyavik, Iceland. pp. 279-282
op_relation hal-01145413
https://hal.archives-ouvertes.fr/hal-01145413
https://hal.archives-ouvertes.fr/hal-01145413/document
https://hal.archives-ouvertes.fr/hal-01145413/file/Pellegrini_13044.pdf
OATAO: 13044
op_rights info:eu-repo/semantics/OpenAccess
_version_ 1766042774802530304
spelling ftccsdartic:oai:HAL:hal-01145413v1 2023-05-15T16:52:29+02:00 El-WOZ: a client-server wizard-of-oz open-source interface Pellegrini, Thomas Hedayati, Vahid Costa, Angela Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA) Institut de recherche en informatique de Toulouse (IRIT) Université Toulouse 1 Capitole (UT1) Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3) Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP) Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1) Université Fédérale Toulouse Midi-Pyrénées Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa (INESC-ID) Instituto Superior Técnico, Universidade Técnica de Lisboa (IST)-Instituto de Engenharia de Sistemas e Computadores (INESC) Centro de Linguística da Universidade Nova de Lisboa (CLUNL) Faculty of Social and Human Sciences (NOVAFCSH) Universidade Nova de Lisboa = NOVA University Lisbon (NOVA)-Universidade Nova de Lisboa = NOVA University Lisbon (NOVA) Reykyavik, Iceland 2014-05-26 https://hal.archives-ouvertes.fr/hal-01145413 https://hal.archives-ouvertes.fr/hal-01145413/document https://hal.archives-ouvertes.fr/hal-01145413/file/Pellegrini_13044.pdf en eng HAL CCSD hal-01145413 https://hal.archives-ouvertes.fr/hal-01145413 https://hal.archives-ouvertes.fr/hal-01145413/document https://hal.archives-ouvertes.fr/hal-01145413/file/Pellegrini_13044.pdf OATAO: 13044 info:eu-repo/semantics/OpenAccess Language Resources and Evaluation Conference - LREC 2014 https://hal.archives-ouvertes.fr/hal-01145413 Language Resources and Evaluation Conference - LREC 2014, May 2014, Reykyavik, Iceland. pp. 279-282 Wizard-of-oz European Portuguese elderly speech Speech recording interface [INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR] [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV] [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV] [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] info:eu-repo/semantics/conferenceObject Conference papers 2014 ftccsdartic 2021-10-24T12:06:17Z International audience Wizard of Oz (WOZ) prototyping employs a human wizard to simulate anticipated functions of a future system. In Natural Language Processing this method is usually used to obtain early feedback on dialogue designs, to collect language corpora, or to explore interaction strategies. Yet, existing tools often require complex client-server configurations and setup routines, or suffer from compatibility problems with different platforms. Integrated solutions, which may also be used by designers and researchers without technical background, are missing. In this paper we present a framework for multi-lingual dialog research, which combines speech recognition and synthesis with WOZ. All components are open source and adaptable toIn this paper, we present a speech recording interface developed in the context of a project on automatic speech recognition for elderly native speakers of European Portuguese. In order to collect spontaneous speech in a situation of interaction with a machine, this interface was designed as a Wizard-of-Oz (WOZ) plateform. In this setup, users interact with a fake automated dialog system controled by a human wizard. It was implemented as a client-server application and the subjects interact with a talking head. The human wizard chooses pre-defined questions or sentences in a graphical user interface, which are then synthesized and spoken aloud by the avatar on the client side. A small spontaneous speech corpus was collected in a daily center. Eight speakers between 75 and 90 years old were recorded. They appreciated the interface and felt at ease with the avatar. Manual orthographic transcriptions were created for the total of about 45 minutes of speech. different application scenarios Conference Object Iceland Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)