• DocumentCode
    2703057
  • Title

    Approximate Test Risk Minimization Through Soft Margin Estimation

  • Author

    Jinyu Li ; Siniscalchi, Sabato Marco ; Chin-Hui Lee

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    In a recent study, we proposed soft margin estimation (SME) to learn parameters of continuous density hidden Markov models (HMMs). Our earlier experiments with connect digit recognition have shown that SME offers great advantages over other state-of-the-art discriminative training methods. In this paper, we illustrate SME from a perspective of statistical learning theory and show that by including a margin in formulating the SME objective function it is capable of directly minimizing the approximate test risk, while most other training methods intent to minimize only the empirical risks. We test SME on the 5k-word Wall Street Journal task, and find the proposed approach achieves a relative word error rate reduction of about 10% over our best baseline results in different experimental configurations. We believe this is the first attempt to show the effectiveness of margin-based acoustic modeling for large vocabulary continuous speech recognition. We also expect further performance improvements in the future because the approximate test risk minimization principle offers a flexible and yet rigorous framework to facilitate easy incorporation of new margin-based optimization criteria into HMM training.
  • Keywords
    hidden Markov models; risk analysis; speech recognition; statistical analysis; HMM; digit recognition; discriminative training methods; hidden Markov models; large vocabulary continuous speech recognition; margin-based acoustic modeling; risk minimization; soft margin estimation; statistical learning theory; word error rate reduction; Acoustic testing; Automatic speech recognition; Error analysis; Hidden Markov models; Lattices; Machine learning algorithms; Maximum likelihood estimation; Risk management; Speech recognition; Statistical learning; lattice; soft margin estimation; statistical learning; test risk;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366997
  • Filename
    4218185