Title :
Visual feature extraction for isolated word visual only speech recognition of Vietnamese
Author :
Nguyen Thien Chuong ; Chaloupka, J.
Author_Institution :
Inst. of Inf. Technol. & Electron., Tech. Univ. of Liberec, Liberec, Czech Republic
Abstract :
This paper presents our research on visual feature extraction with some special treatment for dealing with Vietnamese language. The effect of linear discriminant analysis (LDA) when training with different sets of basic class will be examined. For improving the visual features, we proposed two types of visual front end for automatic lip-reading: (a) 1-Stage LDA visual front end; and (b) hierarchical LDA (HLDA) visual front end. We also compare four different types of visual feature on an isolated word visual only speech recognition of Vietnamese task using our recorded audio-visual speech database. Experiments on our database show that the proposed visual front end improves up to 8% of recognition accuracy and the HLDA visual front end outperform the other.
Keywords :
audio databases; feature extraction; image recognition; natural language processing; speech recognition; visual databases; 1-stage LDA visual front end; Vietnamese language; audio-visual speech database; automatic lip-reading; hierarchical LDA visual front end; isolated word visual only speech recognition; linear discriminant analysis; recognition accuracy; visual feature extraction; Discrete cosine transforms; Feature extraction; Mouth; Principal component analysis; Training; Vectors; Visualization; Audio-visual speech recognition; LDA; Vietnamese language. visual feature; isolated word recognition;
Conference_Titel :
Telecommunications and Signal Processing (TSP), 2013 36th International Conference on
Conference_Location :
Rome
Print_ISBN :
978-1-4799-0402-0
DOI :
10.1109/TSP.2013.6613974