• DocumentCode
    1224350
  • Title

    Approximate Test Risk Bound Minimization Through Soft Margin Estimation

  • Author

    Li, Jinyu ; Yuan, Ming ; Lee, Chin-Hui

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA
  • Volume
    15
  • Issue
    8
  • fYear
    2007
  • Firstpage
    2393
  • Lastpage
    2404
  • Abstract
    Inspired by the great success of margin-based classifiers, there is a trend to incorporate the margin concept into hidden Markov modeling for speech recognition. Several attempts based on margin maximization were proposed recently. In this paper, a new discriminative learning framework, called soft margin estimation (SME), is proposed for estimating the parameters of continuous-density hidden Markov models. The proposed method makes direct use of the successful ideas of soft margin in support vector machines to improve generalization capability and decision feedback learning in minimum classification error training to enhance model separation in classifier design. SME is illustrated from a perspective of statistical learning theory. By including a margin in formulating the SME objective function, SME is capable of directly minimizing an approximate test risk bound. Frame selection, utterance selection, and discriminative separation are unified into a single objective function that can be optimized using the generalized probabilistic descent algorithm. Tested on the TIDIGITS connected digit recognition task, the proposed SME approach achieves a string accuracy of 99.43%. On the 5 k-word Wall Street Journal task, SME obtains relative word error rate reductions of about 10% over our best baseline results in different experimental configurations. We believe this is the first attempt to show the effectiveness of margin-based acoustic modeling for large vocabulary continuous speech recognition in a hidden Markov model framework. Further improvements are expected because the approximate test risk bound minimization principle offers a flexible and rigorous framework to facilitate incorporation of new margin-based optimization criteria into hidden Markov model training.
  • Keywords
    feedback; generalisation (artificial intelligence); hidden Markov models; learning (artificial intelligence); minimisation; parameter estimation; signal classification; speech recognition; support vector machines; approximate test risk bound minimization; continuous-density hidden Markov models; decision feedback learning; generalization capability; minimum classification error training; optimization; parameter estimation; soft margin estimation; speech recognition; support vector machines; Error analysis; Feedback; Hidden Markov models; Machine learning; Parameter estimation; Speech recognition; Statistical learning; Support vector machine classification; Support vector machines; Testing; Discriminative training (DT); soft margin estimation (SME); statistical learning; test risk;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.906178
  • Filename
    4317566