DocumentCode :
2181253
Title :
Exploring nuisance attribute projection and score normalization for GLDS-SVM based automatic mispronunciation detection method
Author :
Li, Hongyan ; Huang, Shen ; Wang, ShiJin ; Liang, JiaEn ; Xu, Bo
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5668
Lastpage :
5671
Abstract :
In the task of mispronunciation detection, the cross-speaker degradation and some other confusing nuisances are the challenging problems demanding prompt solution. In this paper, we will attempt to remove the non-pronunciation variations in the GLDS-SVM expansion space by using nuisance attribute projection strategy, in order to increase the separating capacity between different phoneme instances. Moreover, different kinds of score normalization methods with softmax, posterior probability vector (PPV), Z-norm and T-norm are comparatively discussed. The experiments on three kinds of speech corpora demonstrate the effectiveness of the above methods, and the performance improvement is not very significant, but sustainable.
Keywords :
probability; speaker recognition; support vector machines; GLDS-SVM expansion; cross-speaker degradation; generalized linear discriminant sequence kernel; mispronunciation detection; nuisance attribute projection; posterior probability vector; score normalization; speech corpora; Computational modeling; Kernel; Speaker recognition; Speech; Support vector machines; Testing; Training; Automatic mispronunciation detection; generalized linear discriminant sequence; nuisance attribute projection; score normalization; support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947646
Filename :
5947646
Link To Document :
بازگشت