• DocumentCode
    2137263
  • Title

    A gesture-driven computer interface using Kinect

  • Author

    Lai, Kam ; Konrad, Janusz ; Ishwar, Prakash

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Boston Univ., Boston, MA, USA
  • fYear
    2012
  • fDate
    22-24 April 2012
  • Firstpage
    185
  • Lastpage
    188
  • Abstract
    Automatic recognition of human actions from video has been studied for many years. Although still very difficult in uncontrolled scenarios, it has been successful in more restricted settings (e.g., fixed viewpoint, no occlusions) with recognition rates approaching 100%. However, the best-performing methods are complex and computationally-demanding and thus not well-suited for real-time deployments. This paper proposes to leverage the Kinect camera for close-range gesture recognition using two methods. Both methods use feature vectors that are derived from the skeleton model provided by the Kinect SDK in real-time. Although both methods perform nearest-neighbor classification, one method does this in the space of features using the Euclidean distance metric, while the other method does this in the space of feature covariances using a log-Euclidean metric. Both methods recognize 8 hand gestures in real time achieving correct-classification rates of over 99% on a dataset of 20 subjects but the method based on Euclidean distance requires feature-vector collections to be of the same size, is sensitive to temporal misalignment, and has higher computation and storage requirements.
  • Keywords
    covariance analysis; gesture recognition; human computer interaction; infrared imaging; pattern classification; video signal processing; Euclidean distance metric; Kinect SDK; Kinect camera; automatic human action recognition; close-range gesture recognition; feature covariances; feature vectors; gesture-driven computer interface; hand gesture recognition; human computer interaction; log-Euclidean metric; nearest-neighbor classification; skeleton model; storage temporal; temporal misalignment; Cameras; Covariance matrix; Humans; Joints; Real time systems; Vectors; Gesture recognition; Human action recognition; Human-computer interaction; Kinect camera;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Analysis and Interpretation (SSIAI), 2012 IEEE Southwest Symposium on
  • Conference_Location
    Santa Fe, NM
  • Print_ISBN
    978-1-4673-1831-0
  • Electronic_ISBN
    978-1-4673-1829-7
  • Type

    conf

  • DOI
    10.1109/SSIAI.2012.6202484
  • Filename
    6202484