DocumentCode :
3427371
Title :
Detecting tone errors in continuous Mandarin speech
Author :
Zhang, Yan-Bin ; Chu, Min ; Huang, Chao ; Liang, Man-Gui
Author_Institution :
Inf. Technol. Inst., Beijing Jiaotong Univ., Beijing
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
5065
Lastpage :
5068
Abstract :
This paper proposes a new approach for detecting tone errors in continuous Mandarin speech. In the training phase, tone variations are modeled with context-depended MSD-HMM which considers six contextual factors instead of two in traditional triphone HMM. In the evaluation phase, the goodness of tone pronunciation is measured by Kullback-Leibler divergence (KLD) between the expected tone model and the most representative tone model. When the KLD between the two models is larger than a threshold, the tone is detected as a pronunciation error. In the ROC curve, we get the equal error rate at 2.6%.
Keywords :
hidden Markov models; natural language processing; speech processing; Kullback-Leibler divergence; MSD-HMM; ROC curve; continuous Mandarin speech; multi-space distribution-hidden Markov models; tone error detection; tone model; tone pronunciation; tone variations; Asia; Chaos; Computer errors; Context modeling; Error analysis; Hidden Markov models; Information technology; Natural languages; Phase measurement; Speech; Context Depended Tone Model (CDTM); Kullback-Leibler Divergence (KLD); Tone Error Detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518797
Filename :
4518797
Link To Document :
بازگشت