Title :
Designing a robust speech and gaze multimodal system for diverse users
Author :
Zhang, Qiaohui ; Go, Kentaro ; Imamiya, Atsumi ; Mao, Xiaoyang
Author_Institution :
Dept. of Comput. & Media Eng., Yamanashi Univ., Kofu, Japan
Abstract :
The recognition errors make recognition-based systems brittle, and lead to usability problems. Multimodal system is generally believed as an effective means of being able to contribute to error avoidance and recovery. This work explores how to combine gaze and speech, which are two error-prone modes, in order to get a robust multimodal architecture. Combining the two overcomes imperfections of recognition techniques, compensates for drawbacks of a single mode, resolves the language ambiguity, and leads to a much more effective system. In addition, we try to employ a new performance criterion about the error-handling ability to analyze and assess the multimodal integration strategies. With this new measure approach, not only the benefits of mutual disambiguation of individual input signals within the multimodal architecture are demonstrated, but also the condition under which the multimodal system becomes the most effective is identified.
Keywords :
error handling; human computer interaction; image recognition; speech recognition; error avoidance; error recovery; error-handling ability; error-prone mode; eye tracking; gaze multimodal system; human computer interaction; multimodal integration; recognition error; recognition-based system; robust multimodal architecture; robust speech system; speech input; speech multimodal system; Computer architecture; Computer errors; Computer interfaces; Design engineering; Error analysis; Error correction; Mice; Performance analysis; Robustness; Speech recognition;
Conference_Titel :
Information Reuse and Integration, 2003. IRI 2003. IEEE International Conference on
Print_ISBN :
0-7803-8242-0
DOI :
10.1109/IRI.2003.1251437