• DocumentCode
    2769275
  • Title

    A study on soft margin estimation for LVCSR

  • Author

    Li, Jinyu ; Yan, Zhi-Jie ; Lee, Chin-Hui ; Wang, Ren-Hua

  • Author_Institution
    Georgia Inst. of Technol., Atlanta
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    268
  • Lastpage
    271
  • Abstract
    We extend our previous work on soft margin estimation (SME) to large vocabulary continuous speech recognition in two aspects. The first is to use the extended Baum-Welch method to replace the conventional generalized probabilistic descent algorithm for optimization. The second is to compare SME with minimum classification error (MCE) training with the same implementation details in order to show that it is indeed the margin component in the objective function with margin-based utterance and frame selection that contributes to the success of SME. Tested on the 5 k-word Wall Street Journal task, all the SME methods work better than MCE. The best SME approach achieves a relative word error rate reduction of about 19% over our best baseline performance. This enhancement can only be demonstrated because of our use of margin-based objective function and the extended Baum-Welch parameter optimization method.
  • Keywords
    parameter estimation; speech recognition; vocabulary; Baum Welch method; LVCSR; extended Baum Welch parameter optimization method; large vocabulary continuous speech recognition; margin based objective function; minimum classification error; probabilistic descent algorithm; soft margin estimation; Acoustic testing; Automatic speech recognition; Error analysis; Hidden Markov models; Lattices; Maximum likelihood estimation; Mutual information; Optimization methods; Speech recognition; Vocabulary; discriminative training; extended Baum-Welch; hidden Markov model; lattice; soft margin estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430122
  • Filename
    4430122