DocumentCode :
3752242
Title :
Automatic assessment of non-native accent degrees using phonetic level posterior and duration features from multiple languages
Author :
Shushan Chen;Yiming Zhou;Ming Li
Author_Institution :
SYSU-CMU Joint Institute of Engineering, Sun Yat-Sen University, Guangzhou, China
fYear :
2015
Firstpage :
156
Lastpage :
159
Abstract :
This paper presents an automatic non-native accent assessment approach using phonetic level posterior and duration features. In this method, instead of using conventional MFCC trained Gaussian Mixture Models (GMM), we use phonetic phoneme states as tokens to calculate the posterior probability and zero-oder Baum-Welch statistics. Phoneme recognizers from five languages are employed to extract phonetic level features. It is shown that features based on these five languages´ phoneme recognizers are complementary for capturing non-native information and phoneme duration based features are most effective in this task. The final proposed fusion system achieved 0.6089 Spearman´s Correlation Coefficient on the test set, which outperformed the openSMILE baseline by 43.3%.
Keywords :
"Feature extraction","Speech","Probability","Mel frequency cepstral coefficient","Correlation","Training"
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
Type :
conf
DOI :
10.1109/APSIPA.2015.7415493
Filename :
7415493
Link To Document :
بازگشت