مرکز منطقه ای اطلاع رساني علوم و فناوري - A PCA Based Visual DCT Feature Extraction Method for Lip-Reading

DocumentCode :

2975507

Title :

A PCA Based Visual DCT Feature Extraction Method for Lip-Reading

Author :

Hong, Xiaopeng ; Yao, Hongxun ; Wan, Yuqi ; Chen, Rong

Author_Institution :

Harbin Institute of Technology, China

fYear :

2006

fDate :

Dec. 2006

Firstpage :

321

Lastpage :

326

Abstract :

This paper proposes a PCA based method to reduce the dimensionality of DCT coefficients for visual only lip-reading systems. A three-stage pixel based visual front end is adopted. First, DCT or block-based DCT features are extracted. Second, Principal Component Analysis is applied for dimension reduction. Finally, all the feature vectors are normalized into a uniform scale. This work investigates this three-stage method, comparing with PCA and two DCT based approaches whose features are selected manually. In the latter manner, PCA coefficients are selected according to energy while the reduction of DCT coefficients leans to the left components in the left-top corner. Experiments prove that the dimension reduction task based on PCA does improve the recognition accuracy when the final dimension is below a certain value. They also show that DCT and block-based DCT work similarly for lip reading task, outperforming PCA slightly.

Keywords :

Computer science; Data mining; Discrete cosine transforms; Discrete wavelet transforms; Feature extraction; Frequency; Linear discriminant analysis; Pixel; Principal component analysis; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Information Hiding and Multimedia Signal Processing, 2006. IIH-MSP '06. International Conference on

Conference_Location :

Pasadena, CA, USA

Print_ISBN :

0-7695-2745-0

Type :

conf

DOI :

10.1109/IIH-MSP.2006.265008

Filename :

4041728

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2975507