• DocumentCode
    607699
  • Title

    Turkish Sign Language recognition using spatio-temporal features on Kinect RGB video sequences and depth maps

  • Author

    Memis, A. ; Albayrak, Sahin

  • Author_Institution
    Bilgisayar Muhendisligi Bolumu, Yildiz Teknik Univ., İstanbul, Turkey
  • fYear
    2013
  • fDate
    24-26 April 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents a Turkish Sign Language recognition system that uses spatio-temporal features on Kinect sensor RGB video sequences and depth maps. Proposed system uses cumulative motion images which based on motion differences and represent the temporal characteristics of dynamic signs in motion sequences. Cumulative motion images represent the whole motions of signers. 2-D Discrete Cosine Transform (DCT) is applied to cumulative sign images in order to obtain spatial features of signs and transformed images that represent the energy density of signs are obtained. Two transform images are obtained by applying referred methods to both of RGB video sequences and depth maps seperately. Feature vectors of dynamic signs are produced by combining a certain amount of DCT coefficients that contain higher energy via zig-zag scanning on transform images. K-Nearist Neighbor classifier with Manhattan distance used for recognition process. System performance is evaluated on a sign database that contains 1002 signs belongs to 111 words in three different categories of Turkish Sign Language (TID). Proposed sign language recognition system has a recognition rate about %90.
  • Keywords
    discrete cosine transforms; image colour analysis; image motion analysis; image sequences; sign language recognition; video signal processing; 2D discrete cosine transform; K-nearist neighbor classifier; Kinect sensor RGB video sequences; Manhattan distance; Turkish sign language recognition; cumulative motion images; cumulative sign images; depth maps; energy density; feature vectors; motion differences; sign database; spatio-temporal features; transform images; zig-zag scanning; Assistive technology; Discrete cosine transforms; Gesture recognition; Hidden Markov models; Video sequences; Kinect sensor; Turkish Sign Language; depth maps; dynamic signs; sign language recognition; spatio-temporal features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference (SIU), 2013 21st
  • Conference_Location
    Haspolat
  • Print_ISBN
    978-1-4673-5562-9
  • Electronic_ISBN
    978-1-4673-5561-2
  • Type

    conf

  • DOI
    10.1109/SIU.2013.6531360
  • Filename
    6531360