Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition

Author

Okita, Shinsuke ; Mitsukura, Yasue ; Hamada, Nozomu

Author_Institution

Dept. of Syst. Design Eng., Keio Univ., Yokohama, Japan

fYear

2013

fDate

13-15 Dec. 2013

Firstpage

62

Lastpage

67

Abstract

For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme´. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.

Keywords

image classification; image representation; natural language processing; speech recognition; Japanese visemes; MDA; audio domain; augmented classification; discriminative ability enhancement; equivalent unit; hierarchical weighted discrimination; multiple discriminative analysis; phoneme; utterance unit; visual domain; visual speech recognition; visually identifiable unit; word recognition; word representation; Accuracy; Conferences; Feature extraction; Speech; Speech recognition; Support vector machine classification; Visualization; image processing; pattern recognition; visemes; visual speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Process & Control (ICSPC), 2013 IEEE Conference on

Conference_Location

Kuala Lumpur

Print_ISBN

978-1-4799-2208-6

Type

conf

DOI

10.1109/SPC.2013.6735104

Filename

6735104