Title :
Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition
Author :
Okita, Shinsuke ; Mitsukura, Yasue ; Hamada, Nozomu
Author_Institution :
Dept. of Syst. Design Eng., Keio Univ., Yokohama, Japan
Abstract :
For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme´. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.
Keywords :
image classification; image representation; natural language processing; speech recognition; Japanese visemes; MDA; audio domain; augmented classification; discriminative ability enhancement; equivalent unit; hierarchical weighted discrimination; multiple discriminative analysis; phoneme; utterance unit; visual domain; visual speech recognition; visually identifiable unit; word recognition; word representation; Accuracy; Conferences; Feature extraction; Speech; Speech recognition; Support vector machine classification; Visualization; image processing; pattern recognition; visemes; visual speech recognition;
Conference_Titel :
Systems, Process & Control (ICSPC), 2013 IEEE Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4799-2208-6
DOI :
10.1109/SPC.2013.6735104