Title : 
Vision-based multimodal human-computer interaction using hand and head gestures
         
        
            Author : 
Agrawal, Ankit ; Raj, Ranga ; Porwal, S.
         
        
            Author_Institution : 
Indian Inst. of Inf. Technol., Allahabad, India
         
        
        
        
        
        
            Abstract : 
Gestures are used in day to day life like nodding and waving without us being aware of them. It has become an important part in the communication among the humans. In the recent years new methods of Human Computer Interaction (HCI) are being developed. Some of them are based on interaction with machines through hand, head, facial expressions, voice, touch and many are still the current topic of research. However relying on just one of them reduces the accuracy of the whole HCI and is also limiting the options available to users. The objective of this paper is thus to use two of the important modes of interaction - hand and head to control any application running on computer using Computer Vision algorithms. From input video stream, hand is segmented and the corresponding gesture is being recognized based on the shape and pattern of movement of hand. For head gesture recognition, head is first detected and then optical flow method is used to get the movement of head which is then recognized by finite state automata. Using the user interface of the software, an operator can control any interactive application (say VLC player, Image browser etc) using hand and head gestures which in turn are automatically mapped to the mouse and keyboard events through Windows API. The proposed multimodal approach is particularly useful to communicate with computers and other electronic appliances from a distance where mouse and keyboard are not convenient to work with.
         
        
            Keywords : 
computer vision; finite state machines; gesture recognition; human computer interaction; image segmentation; video signal processing; HCI; Windows API; computer vision algorithm; facial expression; finite state automata; hand gestures; hand segmentation; head gesture recognition; input video stream; interactive application; keyboard event; mouse event; optical flow method; user interface; vision based multimodal human computer interaction; Accuracy; Computers; Conferences; Gesture recognition; Head; Human computer interaction; Magnetic heads; Hand gesture; Head gesture; Multimodal interface; Personalized HCI; Vision algorithms;
         
        
        
        
            Conference_Titel : 
Information & Communication Technologies (ICT), 2013 IEEE Conference on
         
        
            Conference_Location : 
JeJu Island
         
        
            Print_ISBN : 
978-1-4673-5759-3
         
        
        
            DOI : 
10.1109/CICT.2013.6558300