DocumentCode
2720005
Title
Advances in phonetics-based sub-unit modeling for transcription alignment and sign language recognition
Author
Pitsikalis, Vassilis ; Theodorakis, Stavros ; Vogler, Christian ; Maragos, Petros
Author_Institution
Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens, Athens, Greece
fYear
2011
fDate
20-25 June 2011
Firstpage
1
Lastpage
6
Abstract
We explore novel directions for incorporating phonetic transcriptions into sub-unit based statistical models for sign language recognition. First, we employ a new symbolic processing approach for converting sign language annotations, based on HamNoSys symbols, into structured sequences of labels according to the Posture-Detention-Transition-Steady Shift phonetic model. Next, we exploit these labels, and their correspondence with visual features to construct phonetics-based statistical sub-unit models. We also align these sequences, via the statistical sub-unit construction and decoding, to the visual data to extract time boundary information that they would lack otherwise. The resulting phonetic sub-units offer new perspectives for sign language analysis, phonetic modeling, and automatic recognition. We evaluate this approach via sign language recognition experiments on an extended Lemmas Corpus of Greek Sign Language, which results not only in improved performance compared to pure data-driven approaches, but also in meaningful phonetic sub-unit models that can be further exploited in interdisciplinary sign language analysis.
Keywords
gesture recognition; image coding; speech processing; speech recognition; statistical analysis; Greek sign language; HamNoSys symbols; Lemmas corpus; automatic recognition; decoding; phonetic based statistical subunit model; phonetic transcription alignment; posture-detention-transition steady shift phonetic model; sign language annotation; sign language recognition; statistical models; structured sequences; symbolic processing approach; time boundary information extraction; visual data; Feature extraction; Handicapped aids; Hidden Markov models; Speech recognition; Training; Trajectory; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition Workshops (CVPRW), 2011 IEEE Computer Society Conference on
Conference_Location
Colorado Springs, CO
ISSN
2160-7508
Print_ISBN
978-1-4577-0529-8
Type
conf
DOI
10.1109/CVPRW.2011.5981681
Filename
5981681
Link To Document