Title :
Building an application framework for speech and pen input integration in multimodal learning interfaces
Author :
Vo, Minh Tue ; Wood, Cindy
Author_Institution :
Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
While significant advances have been made improve speech recognition performance, and gesture and handwriting recognition, speech- and pen-based systems have still not found broad acceptance in everyday life. One reason for this is the inflexibility of each input modality when used alone. Human communication is very natural and flexible because we can take advantage of a multiplicity of communication signals working in concert to supply complementary information or increase robustness with redundancy. We present a multimodal interface capable of jointly interpreting speech, pen-based gestures, and handwriting in the context of an appointment scheduling application. The interpretation engine based on semantic frame merging correctly interprets 80% of a multimodal data set assuming perfect speech and gesture/handwriting recognition; in the presence of recognition errors the interpretation performance is in the range of 35-62%. A dialog processing scheme uses task domain knowledge to guide the user in supplying information and permits human-computer interactions to span several related multimodal input events
Keywords :
character recognition; graphical user interfaces; learning (artificial intelligence); natural language interfaces; scheduling; speech recognition; application framework; appointment scheduling application; dialog processing scheme; handwriting; handwriting recognition; human-computer interactions; input modality; interpretation performance; multimodal learning interfaces; pen input integration; semantic frame merging; speech input integration; speech recognition; task domain knowledge; Calendars; Engines; Error correction; Handwriting recognition; Interactive systems; Laboratories; Merging; Personal digital assistants; Speech recognition; Writing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.550794