• DocumentCode
    1125045
  • Title

    Information Theoretic Feature Extraction for Audio-Visual Speech Recognition

  • Author

    Gurban, Mihai ; Thiran, Jean-Philippe

  • Author_Institution
    Signal Process. Lab. (LTS5), Ecole Polytech. Federate de Lausanne, Ecublens, Switzerland
  • Volume
    57
  • Issue
    12
  • fYear
    2009
  • Firstpage
    4765
  • Lastpage
    4776
  • Abstract
    The problem of feature selection has been thoroughly analyzed in the context of pattern classification, with the purpose of avoiding the curse of dimensionality. However, in the context of multimodal signal processing, this problem has been studied less. Our approach to feature extraction is based on information theory, with an application on multimodal classification, in particular audio-visual speech recognition. Contrary to previous work in information theoretic feature selection applied to multimodal signals, our proposed methods penalize features for their redundancy, achieving more compact feature sets and better performance. We propose two greedy selection algorithms, one that penalizes a proportion of feature redundancy, while the other uses conditional mutual information as an evaluation measure, for the selection of visual features for audio-visual speech recognition. Our features perform better than linear discriminant analysis, the most usual transform for dimensionality reduction in the field, across a wide range of dimensionality values and combined with audio at different quality levels.
  • Keywords
    audio-visual systems; feature extraction; greedy algorithms; image recognition; signal classification; speech recognition; audio quality level; audio-visual speech recognition; conditional mutual information; dimensionality reduction; feature redundancy; greedy selection algorithms; information theoretic feature extraction; multimodal classification; multimodal signal processing; pattern classification; Audio–visual speech recognition; feature selection; mutual information;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/TSP.2009.2026513
  • Filename
    5153314