DocumentCode :
3412448
Title :
Tone pronunciation quality scoring of Mandarin multi-syllable words
Author :
Zhang, Junbo ; Wu, Hemin ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., CAS, Beijing, China
fYear :
2010
fDate :
24-28 Oct. 2010
Firstpage :
545
Lastpage :
548
Abstract :
This paper discusses tone pronunciation scoring for Mandarin multi-syllable words in Computer Assisted Language Learning (CALL) System. A commonly used tone evaluation method is using GMM to model various pitch sequence. Because the pattern of pitch sequence will change a lot in the multisyllable context, tone models trained on mono-tone database will not have good performance on multi-syllable speech. Scoring accuracy drops greatly due to tonal sandhi from mono-syllable to multi-syllable words. We proposed three major methods to solve the problem. The first is to train GMM with tri-syllable FO trace instead of mono-syllable´s. The second is not only to model FO contour´s trend, but also to model FO value, and we use normalization to make sure that FO values reflect tones. The third is to use linear regression to simulate the FO contour trend. Some minor improvements are also introduced. After these methods are taken, the tone recognition correct rate is improved from 41% to 82%.
Keywords :
natural language processing; speech processing; speech recognition; Mandarin multisyllable words; computer assisted language learning system; evaluation method; linear regression; mono-tone database; multisyllable context; multisyllable speech; pitch sequence; tone model; tone pronunciation quality scoring; tone recognition correct rate; Accuracy; Computational modeling; Equations; Hidden Markov models; Mathematical model; Speech; Speech recognition; CALL; FO; GMM; Pronunciation Quality; Tone Evaluation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2010 IEEE 10th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-5897-4
Type :
conf
DOI :
10.1109/ICOSP.2010.5656359
Filename :
5656359
Link To Document :
بازگشت