• DocumentCode
    1783992
  • Title

    A lip geometry approach for feature-fusion based audio-visual speech recognition

  • Author

    Ibrahim, M.Z. ; Mulvaney, D.J.

  • Author_Institution
    Sch. of Electron., Electr. & Syst. Eng., Loughborough Univ., Loughborough, UK
  • fYear
    2014
  • fDate
    21-23 May 2014
  • Firstpage
    644
  • Lastpage
    647
  • Abstract
    This paper describes a feature-fusion audio-visual speech recognition (AVSR) system that extracts lip geometry from the mouth region using a combination of skin color filter, border following and convex hull, and classification using a Hidden Markov Model. By defining a small number of highly descriptive geometrical features relevant to the recognition task, the approach avoids the poor scalability (termed the `curse of dimensionality´) that is often associated with featurefusion AVSR methods. The paper describes comparisons of the new approach with conventional appearance-based methods, namely the discrete cosine transform and the principal component analysis techniques, when operating under simulated ambient noise conditions that affect the spoken phrases. The experimental results demonstrate that, in the presence of audio noise, the geometrical method significantly improves speech recognition accuracy compared with appearance-based approaches, despite the new method requiring significantly fewer features.
  • Keywords
    geometry; hidden Markov models; speech recognition; AVSR system; DCT; HMM; PCA; appearance-based methods; audio noise; border following; convex hull; discrete cosine transform; feature-fusion based audio-visual speech recognition; hidden Markov model; lip geometry approach; mouth region; principal component analysis techniques; simulated ambient noise conditions; skin color filter; spoken phrases; Feature extraction; Hidden Markov models; Mouth; Signal to noise ratio; Speech recognition; Visualization; Lip geometry; OpenCV; audio-visual speech recognition; feature fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Control and Signal Processing (ISCCSP), 2014 6th International Symposium on
  • Conference_Location
    Athens
  • Type

    conf

  • DOI
    10.1109/ISCCSP.2014.6877957
  • Filename
    6877957