• DocumentCode
    618489
  • Title

    Vision-based multimodal human-computer interaction using hand and head gestures

  • Author

    Agrawal, Ankit ; Raj, Ranga ; Porwal, S.

  • Author_Institution
    Indian Inst. of Inf. Technol., Allahabad, India
  • fYear
    2013
  • fDate
    11-12 April 2013
  • Firstpage
    1288
  • Lastpage
    1292
  • Abstract
    Gestures are used in day to day life like nodding and waving without us being aware of them. It has become an important part in the communication among the humans. In the recent years new methods of Human Computer Interaction (HCI) are being developed. Some of them are based on interaction with machines through hand, head, facial expressions, voice, touch and many are still the current topic of research. However relying on just one of them reduces the accuracy of the whole HCI and is also limiting the options available to users. The objective of this paper is thus to use two of the important modes of interaction - hand and head to control any application running on computer using Computer Vision algorithms. From input video stream, hand is segmented and the corresponding gesture is being recognized based on the shape and pattern of movement of hand. For head gesture recognition, head is first detected and then optical flow method is used to get the movement of head which is then recognized by finite state automata. Using the user interface of the software, an operator can control any interactive application (say VLC player, Image browser etc) using hand and head gestures which in turn are automatically mapped to the mouse and keyboard events through Windows API. The proposed multimodal approach is particularly useful to communicate with computers and other electronic appliances from a distance where mouse and keyboard are not convenient to work with.
  • Keywords
    computer vision; finite state machines; gesture recognition; human computer interaction; image segmentation; video signal processing; HCI; Windows API; computer vision algorithm; facial expression; finite state automata; hand gestures; hand segmentation; head gesture recognition; input video stream; interactive application; keyboard event; mouse event; optical flow method; user interface; vision based multimodal human computer interaction; Accuracy; Computers; Conferences; Gesture recognition; Head; Human computer interaction; Magnetic heads; Hand gesture; Head gesture; Multimodal interface; Personalized HCI; Vision algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information & Communication Technologies (ICT), 2013 IEEE Conference on
  • Conference_Location
    JeJu Island
  • Print_ISBN
    978-1-4673-5759-3
  • Type

    conf

  • DOI
    10.1109/CICT.2013.6558300
  • Filename
    6558300