• DocumentCode
    2951565
  • Title

    Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification

  • Author

    Arsic, Ivana ; Vilagut, Roger ; Thiran, Jean-Philippe

  • Author_Institution
    Signal Process. Inst., Ecole Polytech. Fed. de Lausanne
  • fYear
    2006
  • fDate
    9-12 July 2006
  • Firstpage
    161
  • Lastpage
    164
  • Abstract
    In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, showing promising results
  • Keywords
    audio-visual systems; feature extraction; fuzzy logic; pattern clustering; speaker recognition; visual databases; CUAVE database; automatic extraction; closed-set audio-visual system; color space transformation; fuzzy-based c-means clustering technique; geometric lip feature; multimodal speaker identification; visual cue; visual information; Audio databases; Data mining; Face detection; Feature extraction; Mouth; Robustness; Spatial databases; Speech recognition; System performance; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2006 IEEE International Conference on
  • Conference_Location
    Toronto, Ont.
  • Print_ISBN
    1-4244-0366-7
  • Electronic_ISBN
    1-4244-0367-7
  • Type

    conf

  • DOI
    10.1109/ICME.2006.262594
  • Filename
    4036561