Title :
Exploring nuisance attribute projection and score normalization for GLDS-SVM based automatic mispronunciation detection method
Author :
Li, Hongyan ; Huang, Shen ; Wang, ShiJin ; Liang, JiaEn ; Xu, Bo
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
Abstract :
In the task of mispronunciation detection, the cross-speaker degradation and some other confusing nuisances are the challenging problems demanding prompt solution. In this paper, we will attempt to remove the non-pronunciation variations in the GLDS-SVM expansion space by using nuisance attribute projection strategy, in order to increase the separating capacity between different phoneme instances. Moreover, different kinds of score normalization methods with softmax, posterior probability vector (PPV), Z-norm and T-norm are comparatively discussed. The experiments on three kinds of speech corpora demonstrate the effectiveness of the above methods, and the performance improvement is not very significant, but sustainable.
Keywords :
probability; speaker recognition; support vector machines; GLDS-SVM expansion; cross-speaker degradation; generalized linear discriminant sequence kernel; mispronunciation detection; nuisance attribute projection; posterior probability vector; score normalization; speech corpora; Computational modeling; Kernel; Speaker recognition; Speech; Support vector machines; Testing; Training; Automatic mispronunciation detection; generalized linear discriminant sequence; nuisance attribute projection; score normalization; support vector machines;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947646