• DocumentCode
    454728
  • Title

    Limited Training Data Robust Speech Recognition Using Kernel-Based Acoustic Models

  • Author

    Schafföner, Martin ; Krüger, Sven E. ; Andelic, Edin ; Katz, Marcel ; Wendemuth, Andreas

  • Author_Institution
    Dept. of Electr. Eng. & Inf. Technol., Otto-von-Guericke-Univ., Magdeburg
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    Contemporary automatic speech recognition uses hidden-Markov-models (HMMs) to model the temporal structure of speech where one HMM is used for each phonetic unit. The states of the HMMs are associated with state-conditional probability density functions (PDFs) which are typically realized using mixtures of Gaussian PDFs (GMMs). Training of GMMs is error-prone especially if training data size is limited. This paper evaluates two new methods of modeling state-conditional PDFs using probabilistically interpreted support vector machines and kernel Fisher discriminants. Extensive experiments on the RMI (P. Price et al., 1988) corpus yield substantially improved recognition rates compared to traditional GMMs. Due to their generalization ability, our new methods reduce the word error rate by up to 13% using the complete training set and up to 33% when the training set size is reduced
  • Keywords
    acoustics; probability; speech recognition; support vector machines; automatic speech recognition; kernel Fisher discriminants; kernel-based acoustic models; robust speech recognition; support vector machines; word error rate; Automatic speech recognition; Hidden Markov models; Information technology; Kernel; Probability density function; Robustness; Speech recognition; Support vector machine classification; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660226
  • Filename
    1660226