DocumentCode
3752124
Title
Distance metric learning for kernel density-based acoustic model under limited training data conditions
Author
Van Hai Do;Xiong Xiao;Eng Siong Chng;Haizhou Li
Author_Institution
School of Computer Engineering, Nanyang Technological University, Singapore
fYear
2015
Firstpage
54
Lastpage
58
Abstract
Kernel density model works well for limited training data in acoustic modeling. In this paper, we improve the kernel density-based acoustic model for low resource language speech recognition. In our previous study, we demonstrated the effectiveness of the kernel density-based acoustic model on discriminative features such as cross-lingual bottleneck features. In this paper, we propose to learn a Mahalanobis-based distance, which is equivalent to a full rank linear feature transformation, to minimize training data frame classification error. Experimental results on the Wall Street Journal (WSJ) task show that the proposed Mahalanobis-based distance learning results in significant improvements over the Euclidean distance. The kernel density acoustic model with the Mahalanobis-based distance also outperforms deep neural network acoustic model significantly in limited training data cases.
Keywords
"Hidden Markov models","Kernel","Training data","Acoustics","Measurement","Data models","Training"
Publisher
ieee
Conference_Titel
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
Type
conf
DOI
10.1109/APSIPA.2015.7415373
Filename
7415373
Link To Document