Class-specific classifiers in audio-visual speech recognition

Author

Estellers, Virginia ; Baggenstoss, Paul M. ; Thiran, Jean-Philippe

Author_Institution

Signal Process. Lab. (LTS5), Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland

fYear

2010

fDate

23-27 Aug. 2010

Firstpage

1998

Lastpage

2002

Abstract

In this paper, class-specific classifiers for audio, visual and audiovisual speech recognition systems are developed and compared with traditional Bayes classifiers. We use state-of-the-art feature extraction methods and develop traditional and class-specific classifiers for speech recognition, showing the benefits of a class-specific method on each modality for speaker dependent and independent set-ups. Experiments with a reference audio-visual database show a general increase in the systems performance by the introduction of class-specific techniques on both visual and audio-visual modalities.

Keywords

audio-visual systems; feature extraction; pattern classification; speech recognition; audio-visual modality; audio-visual speech recognition; class specific classifier; feature extraction method; reference audio-visual database; speaker dependent setup; speaker independent setup; Hidden Markov models; Mel frequency cepstral coefficient; Principal component analysis; Signal to noise ratio; Speech recognition; Transforms; Visualization;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2010 18th European

Conference_Location

Aalborg

ISSN

2219-5491

Type

conf

Filename

7096539