• DocumentCode
    1326232
  • Title

    Margin-Based Discriminative Training for String Recognition

  • Author

    Heigold, Georg ; Dreuw, Philippe ; Hahn, Stefan ; Schlüter, Ralf ; Ney, Hermann

  • Author_Institution
    Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
  • Volume
    4
  • Issue
    6
  • fYear
    2010
  • Firstpage
    917
  • Lastpage
    925
  • Abstract
    Typical training criteria for string recognition like for example minimum phone error (MPE) and maximum mutual information (MMI) in speech recognition are based on a (regularized) loss function. In contrast, large-margin classifiers-the de-facto standard in machine learning-maximize the separation margin. An additional loss term penalizes misclassified samples. This paper shows how typical training criteria like for example MPE or MMI can be extended to incorporate the margin concept, and that such modified training criteria are smooth approximations to support vector machines with the respective loss function. The proposed approach takes advantage of the generalization bounds of large-margin classifiers while keeping the efficient framework for conventional discriminative training. This allows us to directly evaluate the utility of the margin term for string recognition. Experimental results are presented using the proposed modified training criteria for different tasks from speech recognition (including large-vocabulary continuous speech recognition tasks trained on up to 1500-h audio data), part-of-speech tagging, and handwriting recognition.
  • Keywords
    approximation theory; handwriting recognition; learning (artificial intelligence); pattern classification; smoothing methods; speech recognition; string matching; support vector machines; discriminative training; handwriting recognition; large-margin classifier; loss function; machine learning; margin-based discriminative training; part-of-speech tagging; smooth approximation; speech recognition; string recognition; support vector machine; Approximation methods; Handwriting recognition; Hidden Markov models; Optimization; Parameter estimation; Speech recognition; Training; Handwriting recognition; large-vocabulary continuous speech recognition; margin-based training; part-of-speech tagging;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Signal Processing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1932-4553
  • Type

    jour

  • DOI
    10.1109/JSTSP.2010.2076110
  • Filename
    5575382