مرکز منطقه ای اطلاع رساني علوم و فناوري - A method of visual speech feature area localization

DocumentCode :

2736982

Title :

A method of visual speech feature area localization

Author :

Meng, Shan ; Zhang, Youwei

Author_Institution :

Inst. of Inf. Sci., Wuyi Univ., Guangdong, China

Volume :

fYear :

2003

fDate :

14-17 Dec. 2003

Firstpage :

1173

Abstract :

In audio-visual bimodal man-machine interaction, extracting Region Of Interest (ROI) that carries visual speech features is a very crucial step. In this paper, our work about ROI localization is described in detail. First, we propose a simplified human skin color model to segment input images and estimate the location of human face. When we locate ROI in the human face area, the traditional linear methods´ performance cannot satisfy system´s need, especially for unseen subjects. Then we propose a new localization method that is a combination of Support Vector Machine (SVM) and Distance of Likelihood in Feature Space (DLFS) derived from Kernel Principal Component Analysis (KPCA). Results show that the new method outperformed traditional linear ones. All experiments are based on Chinese Audio-Visual Speech Database2 (CAVSD).

Keywords :

face recognition; image segmentation; principal component analysis; speech recognition; support vector machines; SVM; audio visual bimodal man machine interaction; chinese audio visual speech database2; distance of likelihood in feature space; human face location; human skin color model; image segmentation; kernel principal component analysis; region of interest; support vector machines; visual speech feature area localization; Face; Humans; Image segmentation; Kernel; Man machine systems; Principal component analysis; Skin; Spatial databases; Speech; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks and Signal Processing, 2003. Proceedings of the 2003 International Conference on

Conference_Location :

Nanjing

Print_ISBN :

0-7803-7702-8

Type :

conf

DOI :

10.1109/ICNNSP.2003.1281078

Filename :

1281078

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2736982