Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification

Author

Arsic, Ivana ; Vilagut, Roger ; Thiran, Jean-Philippe

Author_Institution

Signal Process. Inst., Ecole Polytech. Fed. de Lausanne

fYear

2006

fDate

9-12 July 2006

Firstpage

161

Lastpage

164

Abstract

In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, showing promising results

Keywords

audio-visual systems; feature extraction; fuzzy logic; pattern clustering; speaker recognition; visual databases; CUAVE database; automatic extraction; closed-set audio-visual system; color space transformation; fuzzy-based c-means clustering technique; geometric lip feature; multimodal speaker identification; visual cue; visual information; Audio databases; Data mining; Face detection; Feature extraction; Mouth; Robustness; Spatial databases; Speech recognition; System performance; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo, 2006 IEEE International Conference on

Conference_Location

Toronto, Ont.

Print_ISBN

1-4244-0366-7

Electronic_ISBN

1-4244-0367-7

Type

conf

DOI

10.1109/ICME.2006.262594

Filename

4036561