• DocumentCode
    3484589
  • Title

    Frame-level AnyBoost for LVCSR with the MMI Criterion

  • Author

    Tachibana, Ryuki ; Fukuda, Takashi ; Chaudhari, Upendra ; Ramabhadran, Bhuvana ; Zhan, Puming

  • Author_Institution
    IBM Res. - Tokyo, Tokyo, Japan
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    12
  • Lastpage
    17
  • Abstract
    This paper propose a variant of AnyBoost for a large vocabulary continuous speech recognition (LVCSR) task. AnyBoost is an efficient algorithm to train an ensemble of weak learners by gradient descent for an objective function.We present a novel training procedure that trains acoustic models via the MMI criterion using data that is weighted proportional to the summation of the posterior functions of previous round of weak learners. Optimized for system combination by n-best ROVER at runtime, data weights for a new weak learner are computed as a weighted summation of posteriors of previous weak learners. We compare a frame-based version and a sentence-based version of our proposed algorithm with a frame-based AdaBoost algorithm. We will present results on a voice search task trained with different amounts of data with gains of 5.1% to 7.5% relative in WER can be obtained by three rounds of boosting.
  • Keywords
    learning (artificial intelligence); speech recognition; LVCSR; MMI criterion; frame level AnyBoost; large vocabulary continuous speech recognition; n-best ROVER; weak learners; Acoustics; Boosting; Computational modeling; Data models; Lattices; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163897
  • Filename
    6163897