Title :
An Mandarin Pronunciation Quality Assessment System Using Two Kinds of Acoustic Models
Author :
Ge, Fengpei ; Lu, Li ; Liu, Changliang ; Pan, Fuping ; Dong, Bin ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci. Beijing, Beijing, China
Abstract :
This paper presents our Mandarin pronunciation quality assessment system for the examination of Putonghua Shuiping Kaoshi (PSK) and investigates some measures to improve the assessment accuracy. In this paper, a selective speaker adaptation method is studied. In the adaptation module, we select well pronounced speech as the adaptation data, and adopt Maximum Likelihood Linear Regression (MLLR) to update the speaker-independent (SI) acoustic model. Besides the triphone based acoustic model, the monophone based acoustic model is also applied to our system. Further improvements are obtained by combining posterior probabilities computed with triphone and monophone based acoustic models using Support Vector Machine (SVM) to assess the goodness of pronunciations. The experiment results show that the average correlation coefficient (ACC) between machine and the human scores achieves 0.8549, almost equivalent to ACC between different experts. The improved system achieves usable performance in actual applications.
Keywords :
maximum likelihood estimation; regression analysis; speech processing; support vector machines; Mandarin pronunciation quality assessment system; Putonghua Shuiping Kaoshi; acoustic models; average correlation coefficient; maximum likelihood linear regression; monophone based acoustic model; posterior probabilities; selective speaker adaptation; speaker-independent acoustic model; support vector machine; triphone based acoustic model; Acoustic measurements; Decoding; Hidden Markov models; Humans; Maximum likelihood linear regression; Natural languages; Quality assessment; Speech; Support vector machines; Viterbi algorithm;
Conference_Titel :
Research Challenges in Computer Science, 2009. ICRCCS '09. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3927-0
Electronic_ISBN :
978-1-4244-5410-5
DOI :
10.1109/ICRCCS.2009.25