• DocumentCode
    2769403
  • Title

    A compact semidefinite programming (SDP) formulation for large margin estimation of HMMS in speech recognition

  • Author

    Yin, Yan ; Jiang, Hui

  • Author_Institution
    York Univ., Toronto
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    312
  • Lastpage
    317
  • Abstract
    In this paper, we study a new semidefinite programming (SDP) formulation to improve optimization efficiency for large margin estimation (LME) of HMMs in speech recognition. We re-formulate the same LME problem as smaller-scale SDP problems to speed up the SDP-based LME training, especially for large model sets. In the new formulation, instead of building the SDP problem from a single huge variable matrix, we consider to formulate the SDP problem based on many small independent variable matrices, each of which is built separately from a Gaussian mean vector. Moreover, we propose to further decompose feature vectors and Gaussian mean vectors according to static, delta and accelerate components to build even more compact variable matrices. This method can significantly reduce the total number of free variables and result in much smaller SDP problem even for the same model set. The proposed new LME/SDP methods have been evaluated on a connected digit string recognition task using the TIDIGITS database. Experimental results show that it can significantly improve optimization efficiency (about 30-50 times faster for large model sets) and meanwhile it can provide slightly better optimization accuracy and recognition performance than our previous SDP formulation.
  • Keywords
    Gaussian processes; hidden Markov models; learning (artificial intelligence); mathematical programming; matrix algebra; speech recognition; vectors; Gaussian mean vector; Gaussian mixture HMM; SDP formulation; SDP-based LME training; feature vector decomposition; independent variable matrices; large margin estimation; optimization efficiency; semidefinite programming; smaller-scale SDP problems; speech recognition; Acceleration; Computer science; Constraint optimization; Databases; Hidden Markov models; Matrix decomposition; Maximum likelihood estimation; Minimax techniques; Optimization methods; Speech recognition; Automatic Speech Recognition; Convex Optimization; Convex Relaxation; Discriminative training; Large Margin Estimation (LME); Semidefinite Programming (SDP);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430130
  • Filename
    4430130