• DocumentCode
    3484603
  • Title

    Extending noise robust structured support vector machines to larger vocabulary tasks

  • Author

    Zhang, Shi-Xiong ; Gales, M.J.F.

  • Author_Institution
    Eng. Dept., Cambridge Univ., Cambridge, UK
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    18
  • Lastpage
    23
  • Abstract
    This paper describes a structured SVM framework suitable for noise-robust medium/large vocabulary speech recognition. Several theoretical and practical extensions to previous work on small vocabulary tasks are detailed. The joint feature space based on word models is extended to allow context-dependent triphone models to be used. By interpreting the structured SVM as a large margin log-linear model, illustrates that there is an implicit assumption that the prior of the discriminative parameter is a zero mean Gaussian. However, depending on the definition of likelihood feature space, a non-zero prior may be more appropriate. A general Gaussian prior is incorporated into the large margin training criterion in a form that allows the cutting plan algorithm to be directly applied. To further speed up the training process, 1-slack algorithm, caching competing hypothesis and parallelization strategies are also proposed. The performance of structured SVMs is evaluated on noise corrupted medium vocabulary speech recognition task: AURORA 4.
  • Keywords
    Gaussian processes; speech recognition; support vector machines; 1-slack algorithm; AURORA 4; SVM framework; caching competing hypothesis; context-dependent triphone models; cutting plan algorithm; discriminative parameter; general Gaussian prior; joint feature space; large margin log-linear model; large margin training criterion; likelihood feature space; medium-large vocabulary speech recognition; noise robust structured support vector machines; nonzero prior; parallelization strategies; word models; zero mean Gaussian; Hidden Markov models; Irrigation; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163898
  • Filename
    6163898