DocumentCode
2951565
Title
Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification
Author
Arsic, Ivana ; Vilagut, Roger ; Thiran, Jean-Philippe
Author_Institution
Signal Process. Inst., Ecole Polytech. Fed. de Lausanne
fYear
2006
fDate
9-12 July 2006
Firstpage
161
Lastpage
164
Abstract
In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, showing promising results
Keywords
audio-visual systems; feature extraction; fuzzy logic; pattern clustering; speaker recognition; visual databases; CUAVE database; automatic extraction; closed-set audio-visual system; color space transformation; fuzzy-based c-means clustering technique; geometric lip feature; multimodal speaker identification; visual cue; visual information; Audio databases; Data mining; Face detection; Feature extraction; Mouth; Robustness; Spatial databases; Speech recognition; System performance; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location
Toronto, Ont.
Print_ISBN
1-4244-0366-7
Electronic_ISBN
1-4244-0367-7
Type
conf
DOI
10.1109/ICME.2006.262594
Filename
4036561
Link To Document