• DocumentCode
    705266
  • Title

    Class-specific classifiers in audio-visual speech recognition

  • Author

    Estellers, Virginia ; Baggenstoss, Paul M. ; Thiran, Jean-Philippe

  • Author_Institution
    Signal Process. Lab. (LTS5), Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland
  • fYear
    2010
  • fDate
    23-27 Aug. 2010
  • Firstpage
    1998
  • Lastpage
    2002
  • Abstract
    In this paper, class-specific classifiers for audio, visual and audiovisual speech recognition systems are developed and compared with traditional Bayes classifiers. We use state-of-the-art feature extraction methods and develop traditional and class-specific classifiers for speech recognition, showing the benefits of a class-specific method on each modality for speaker dependent and independent set-ups. Experiments with a reference audio-visual database show a general increase in the systems performance by the introduction of class-specific techniques on both visual and audio-visual modalities.
  • Keywords
    audio-visual systems; feature extraction; pattern classification; speech recognition; audio-visual modality; audio-visual speech recognition; class specific classifier; feature extraction method; reference audio-visual database; speaker dependent setup; speaker independent setup; Hidden Markov models; Mel frequency cepstral coefficient; Principal component analysis; Signal to noise ratio; Speech recognition; Transforms; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2010 18th European
  • Conference_Location
    Aalborg
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7096539