مرکز منطقه ای اطلاع رساني علوم و فناوري - Creating conversational interfaces for children

DocumentCode :

1253794

Title :

Creating conversational interfaces for children

Author :

Narayanan, Shrikanth ; Potamianos, Alexandros

Author_Institution :

Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA

Volume :

Issue :

fYear :

2002

fDate :

2/1/2002 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

Creating conversational interfaces for children is challenging in several respects. These include acoustic modeling for automatic speech recognition (ASR), language and dialog modeling, and multimodal-multimedia user interface design. First, issues in ASR of children´s speech are introduced by an analysis of developmental changes in the spectral and temporal characteristics of the speech signal using data obtained from 456 children, ages five to 18 years. Acoustic modeling adaptation and vocal tract normalization algorithms that yielded state-of-the-art ASR performance on children´s speech are described. Second, an experiment designed to better understand how children interact with machines using spoken language is described. Realistic conversational multimedia interaction data were obtained from 160 children who played a voice-activated computer game in a Wizard of Oz (WoZ) scenario. Results of using these data in developing novel language and dialog models as well as in a unified maximum likelihood framework for acoustic decoding in ASR and semantic classification for spoken language understanding are described. Leveraging the lessons learned from the WoZ study and a concurrent user experience evaluation, a multimedia personal agent prototype for children was designed. Details of the architecture and application details are described. Informal evaluation by children was found positive especially for the animated agent and the speech interface

Keywords :

graphical user interfaces; maximum likelihood decoding; multimedia systems; speech recognition; speech-based user interfaces; ASR; Wizard of Oz scenario; WoZ scenario; acoustic decoding; acoustic modeling; acoustic modeling adaptation; automatic speech recognition; children; conversational interfaces; developmental changes; dialog modeling; dialog models; language modeling; language models; multimedia personal agent prototype; multimodal-multimedia user interface design; semantic classification; spectral characteristics; speech signal; spoken language; temporal characteristics; unified maximum likelihood framework; vocal tract normalization algorithms; voice-activated computer game; Application software; Automatic speech recognition; Internet; Multimedia systems; Natural languages; Pediatrics; Prototypes; Speech analysis; Statistics; User interfaces;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.985544

Filename :

985544

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1253794