OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

El-WOZ: a client-server wizard-of-oz open-source interface

Pellegrini, Thomas and Hedayati, Vahid and Costa, Angela El-WOZ: a client-server wizard-of-oz open-source interface. (2014) In: Language Resources and Evaluation Conference - LREC 2014, 26 May 2014 - 31 May 2014 (Reykyavik, Iceland).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
420kB

Abstract

Wizard of Oz (WOZ) prototyping employs a human wizard to simulate anticipated functions of a future system. In Natural Language Processing this method is usually used to obtain early feedback on dialogue designs, to collect language corpora, or to explore interaction strategies. Yet, existing tools often require complex client-server configurations and setup routines, or suffer from compatibility problems with different platforms. Integrated solutions, which may also be used by designers and researchers without technical background, are missing. In this paper we present a framework for multi-lingual dialog research, which combines speech recognition and synthesis with WOZ. All components are open source and adaptable toIn this paper, we present a speech recording interface developed in the context of a project on automatic speech recognition for elderly native speakers of European Portuguese. In order to collect spontaneous speech in a situation of interaction with a machine, this interface was designed as a Wizard-of-Oz (WOZ) plateform. In this setup, users interact with a fake automated dialog system controled by a human wizard. It was implemented as a client-server application and the subjects interact with a talking head. The human wizard chooses pre-defined questions or sentences in a graphical user interface, which are then synthesized and spoken aloud by the avatar on the client side. A small spontaneous speech corpus was collected in a daily center. Eight speakers between 75 and 90 years old were recorded. They appreciated the interface and felt at ease with the avatar. Manual orthographic transcriptions were created for the total of about 45 minutes of speech. different application scenarios

Item Type:Conference or Workshop Item (Paper)
Additional Information:The LREC 2014 Proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
HAL Id:hal-01145413
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Other partners > Instituto de Engenharia de Sistemas e Computadores - Investigação e Desenvolvimento - INESC-ID (PORTUGAL)
Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > Universidade Nova de Lisboa - UNL (PORTUGAL)
Laboratory name:
Statistics:download
Deposited On:24 Apr 2015 09:23

Repository Staff Only: item control page