• DocumentCode
    3752124
  • Title

    Distance metric learning for kernel density-based acoustic model under limited training data conditions

  • Author

    Van Hai Do;Xiong Xiao;Eng Siong Chng;Haizhou Li

  • Author_Institution
    School of Computer Engineering, Nanyang Technological University, Singapore
  • fYear
    2015
  • Firstpage
    54
  • Lastpage
    58
  • Abstract
    Kernel density model works well for limited training data in acoustic modeling. In this paper, we improve the kernel density-based acoustic model for low resource language speech recognition. In our previous study, we demonstrated the effectiveness of the kernel density-based acoustic model on discriminative features such as cross-lingual bottleneck features. In this paper, we propose to learn a Mahalanobis-based distance, which is equivalent to a full rank linear feature transformation, to minimize training data frame classification error. Experimental results on the Wall Street Journal (WSJ) task show that the proposed Mahalanobis-based distance learning results in significant improvements over the Euclidean distance. The kernel density acoustic model with the Mahalanobis-based distance also outperforms deep neural network acoustic model significantly in limited training data cases.
  • Keywords
    "Hidden Markov models","Kernel","Training data","Acoustics","Measurement","Data models","Training"
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
  • Type

    conf

  • DOI
    10.1109/APSIPA.2015.7415373
  • Filename
    7415373