• DocumentCode
    3133284
  • Title

    Margin-enhanced maximum mutual information estimation for hidden Markov models

  • Author

    Kim, Sungwoong ; Yun, Sungrack ; Yoo, Chang D.

  • Author_Institution
    Sch. of Electr. Eng. & Comput. Sci., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
  • fYear
    2009
  • fDate
    5-8 July 2009
  • Firstpage
    1347
  • Lastpage
    1351
  • Abstract
    A discriminative training algorithm to estimate continuous-density hidden Markov model (CDHMM) for automatic speech recognition is considered. The algorithm is based on the criterion, called margin-enhanced maximum mutual information (MEMMI), and it estimates the CDHMM parameters by maximizing the weighted sum of the maximum mutual information objective function and the large margin objective function. The MEMMI is motivated by the criterion used in such classifier as the soft margin support vector machine that maximizes the weighted sum of the empirical risk function and the margin-related generalization function. The algorithm is an iterative procedure, and at each stage, it updates the parameters by placing different weights on the utterances according to their log likelihood margins: incorrectly-classified (negative margin) utterances are emphasized more than correctly-classified utterances. The MEMMI leads to a simple objective function that can be optimized easily by a gradient ascent algorithm maintaining a probabilistic model. Experimental results show that the recognition accuracy of the MEMMI is better than other discriminative training criteria, such as the approximated maximum mutual information (AMMI), the maximum classification error (MCE), and the soft large margin estimation (SLME) on the TIDIGITS database.
  • Keywords
    gradient methods; hidden Markov models; pattern classification; speech recognition; MEMMI; TIDIGITS database; approximated maximum mutual information; automatic speech recognition; continuous-density hidden Markov model; discriminative training algorithm; empirical risk function; gradient ascent algorithm; iterative procedure; large margin objective function; log likelihood margin; margin-enhanced maximum mutual information estimation; margin-related generalization function; maximum classification error; maximum mutual information objective function; probabilistic model; soft large margin estimation; soft margin support vector machine; Automatic speech recognition; Databases; Error analysis; Hidden Markov models; Industrial electronics; Iterative algorithms; Mutual information; Parameter estimation; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Electronics, 2009. ISIE 2009. IEEE International Symposium on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4244-4347-5
  • Electronic_ISBN
    978-1-4244-4349-9
  • Type

    conf

  • DOI
    10.1109/ISIE.2009.5221315
  • Filename
    5221315