• DocumentCode
    2263865
  • Title

    Automatic sign segmentation from continuous signing via multiple sequence alignment

  • Author

    Santemiz, Pinar ; Aran, Oya ; Saraclar, Murat ; Akarun, Lale

  • Author_Institution
    Dept. of Comput. Eng., Bogazici Univ., Istanbul, Turkey
  • fYear
    2009
  • fDate
    Sept. 27 2009-Oct. 4 2009
  • Firstpage
    2001
  • Lastpage
    2008
  • Abstract
    In order to build a sign language recognition framework, one needs to collect sign databases that contain multiple samples of isolated signs, which is a hard and time consuming task. In this study, our aim is to obtain such a database by automatically extracting isolated signs from continuous signing, recorded from the broadcast news for the hearing-impaired. We present an unsupervised, multiple alignment-based approach for sign segmentation. Among the modalities used to form a sign, hand gestures carry most of the information, manifested as hand motion and shape. To handle these two sources of information, we experimented with different feature sets, with different fusion methods on different alignment approaches: feature concatenation on Dynamic Time Warping (DTW) and Hidden Markov Models (HMMs), modeling via coupled and parallel HMMs, and sequential fusion of DTW and HMM. Our experiments on Turkish broadcast news videos show that (1) using low level shape descriptors is suitable for the alignment task, (2) the highest accuracy is obtained by modeling the signs with HMM using the intervals found previously by DTW.
  • Keywords
    gesture recognition; handicapped aids; hidden Markov models; image segmentation; natural language processing; automatic sign segmentation; continuous signing; dynamic time warping; feature concatenation; hearing-impaired; hidden Markov models; multiple alignment-based approach; multiple sequence alignment; sequential fusion; sign databases; sign language recognition; unsupervised approach; Broadcasting; Data engineering; Data mining; Databases; Dictionaries; Handicapped aids; Hidden Markov models; Shape; Speech; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-4442-7
  • Electronic_ISBN
    978-1-4244-4441-0
  • Type

    conf

  • DOI
    10.1109/ICCVW.2009.5457527
  • Filename
    5457527