DocumentCode :
2951565
Title :
Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification
Author :
Arsic, Ivana ; Vilagut, Roger ; Thiran, Jean-Philippe
Author_Institution :
Signal Process. Inst., Ecole Polytech. Fed. de Lausanne
fYear :
2006
fDate :
9-12 July 2006
Firstpage :
161
Lastpage :
164
Abstract :
In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, showing promising results
Keywords :
audio-visual systems; feature extraction; fuzzy logic; pattern clustering; speaker recognition; visual databases; CUAVE database; automatic extraction; closed-set audio-visual system; color space transformation; fuzzy-based c-means clustering technique; geometric lip feature; multimodal speaker identification; visual cue; visual information; Audio databases; Data mining; Face detection; Feature extraction; Mouth; Robustness; Spatial databases; Speech recognition; System performance; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
Type :
conf
DOI :
10.1109/ICME.2006.262594
Filename :
4036561
Link To Document :
بازگشت