DocumentCode :
2512155
Title :
Multimodal Human Computer Interaction with MIDAS Intelligent Infokiosk
Author :
Karpov, Alexey ; Ronzhin, Andrey ; Kipyatkova, Irina ; Ronzhin, Alexander ; Akarun, Lale
Author_Institution :
St. Petersburg Inst. for Inf. & Autom., RAS, St. Petersburg, Russia
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
3862
Lastpage :
3865
Abstract :
In this paper, we present an intelligent information kiosk called MIDAS (Multimodal Interactive-Dialogue Automaton for Self-service), including its hardware and software architecture, stages of deployment of speech recognition and synthesis technologies. MIDAS uses the methodology Wizard of Oz (WOZ) that allows an expert to correct speech recognition results and control the dialogue flow. User statistics of the multimodal human computer interaction (HCI) have been analyzed for the operation of the kiosk in the automatic and automated modes. The infokiosk offers information about the structure and staff of laboratories, the location and phones of departments and employees of the institution. The multimodal user interface is provided with a touch screen, natural speech input and head and manual gestures, both for ordinary and physically handicapped users.
Keywords :
human computer interaction; interactive systems; software architecture; speech recognition; speech synthesis; speech-based user interfaces; touch sensitive screens; HCI; MIDAS intelligent infokiosk; WOZ; Wizard of Oz; dialogue flow; hardware architecture; head gestures; intelligent information kiosk; manual gestures; multimodal human computer interaction; multimodal interactive-dialogue automaton for self-service; multimodal user interface; natural speech input; ordinary users; physically handicapped users; software architecture; speech recognition; speech synthesis technology; touch screen; Data models; Grammar; Hidden Markov models; Human computer interaction; Laboratories; Speech; Speech recognition; artificial intelligence; automatic speech recognition; human-computer interaction; infokiosk; multimodal user interfaces; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.941
Filename :
5597644
Link To Document :
بازگشت