DocumentCode
680708
Title
Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition
Author
Okita, Shinsuke ; Mitsukura, Yasue ; Hamada, Nozomu
Author_Institution
Dept. of Syst. Design Eng., Keio Univ., Yokohama, Japan
fYear
2013
fDate
13-15 Dec. 2013
Firstpage
62
Lastpage
67
Abstract
For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme´. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.
Keywords
image classification; image representation; natural language processing; speech recognition; Japanese visemes; MDA; audio domain; augmented classification; discriminative ability enhancement; equivalent unit; hierarchical weighted discrimination; multiple discriminative analysis; phoneme; utterance unit; visual domain; visual speech recognition; visually identifiable unit; word recognition; word representation; Accuracy; Conferences; Feature extraction; Speech; Speech recognition; Support vector machine classification; Visualization; image processing; pattern recognition; visemes; visual speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Process & Control (ICSPC), 2013 IEEE Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4799-2208-6
Type
conf
DOI
10.1109/SPC.2013.6735104
Filename
6735104
Link To Document