DocumentCode :
2995468
Title :
Building an application framework for speech and pen input integration in multimodal learning interfaces
Author :
Vo, Minh Tue ; Wood, Cindy
Author_Institution :
Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
6
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
3545
Abstract :
While significant advances have been made improve speech recognition performance, and gesture and handwriting recognition, speech- and pen-based systems have still not found broad acceptance in everyday life. One reason for this is the inflexibility of each input modality when used alone. Human communication is very natural and flexible because we can take advantage of a multiplicity of communication signals working in concert to supply complementary information or increase robustness with redundancy. We present a multimodal interface capable of jointly interpreting speech, pen-based gestures, and handwriting in the context of an appointment scheduling application. The interpretation engine based on semantic frame merging correctly interprets 80% of a multimodal data set assuming perfect speech and gesture/handwriting recognition; in the presence of recognition errors the interpretation performance is in the range of 35-62%. A dialog processing scheme uses task domain knowledge to guide the user in supplying information and permits human-computer interactions to span several related multimodal input events
Keywords :
character recognition; graphical user interfaces; learning (artificial intelligence); natural language interfaces; scheduling; speech recognition; application framework; appointment scheduling application; dialog processing scheme; handwriting; handwriting recognition; human-computer interactions; input modality; interpretation performance; multimodal learning interfaces; pen input integration; semantic frame merging; speech input integration; speech recognition; task domain knowledge; Calendars; Engines; Error correction; Handwriting recognition; Interactive systems; Laboratories; Merging; Personal digital assistants; Speech recognition; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.550794
Filename :
550794
Link To Document :
بازگشت