DocumentCode
3752242
Title
Automatic assessment of non-native accent degrees using phonetic level posterior and duration features from multiple languages
Author
Shushan Chen;Yiming Zhou;Ming Li
Author_Institution
SYSU-CMU Joint Institute of Engineering, Sun Yat-Sen University, Guangzhou, China
fYear
2015
Firstpage
156
Lastpage
159
Abstract
This paper presents an automatic non-native accent assessment approach using phonetic level posterior and duration features. In this method, instead of using conventional MFCC trained Gaussian Mixture Models (GMM), we use phonetic phoneme states as tokens to calculate the posterior probability and zero-oder Baum-Welch statistics. Phoneme recognizers from five languages are employed to extract phonetic level features. It is shown that features based on these five languages´ phoneme recognizers are complementary for capturing non-native information and phoneme duration based features are most effective in this task. The final proposed fusion system achieved 0.6089 Spearman´s Correlation Coefficient on the test set, which outperformed the openSMILE baseline by 43.3%.
Keywords
"Feature extraction","Speech","Probability","Mel frequency cepstral coefficient","Correlation","Training"
Publisher
ieee
Conference_Titel
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
Type
conf
DOI
10.1109/APSIPA.2015.7415493
Filename
7415493
Link To Document