DocumentCode
3427371
Title
Detecting tone errors in continuous Mandarin speech
Author
Zhang, Yan-Bin ; Chu, Min ; Huang, Chao ; Liang, Man-Gui
Author_Institution
Inf. Technol. Inst., Beijing Jiaotong Univ., Beijing
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
5065
Lastpage
5068
Abstract
This paper proposes a new approach for detecting tone errors in continuous Mandarin speech. In the training phase, tone variations are modeled with context-depended MSD-HMM which considers six contextual factors instead of two in traditional triphone HMM. In the evaluation phase, the goodness of tone pronunciation is measured by Kullback-Leibler divergence (KLD) between the expected tone model and the most representative tone model. When the KLD between the two models is larger than a threshold, the tone is detected as a pronunciation error. In the ROC curve, we get the equal error rate at 2.6%.
Keywords
hidden Markov models; natural language processing; speech processing; Kullback-Leibler divergence; MSD-HMM; ROC curve; continuous Mandarin speech; multi-space distribution-hidden Markov models; tone error detection; tone model; tone pronunciation; tone variations; Asia; Chaos; Computer errors; Context modeling; Error analysis; Hidden Markov models; Information technology; Natural languages; Phase measurement; Speech; Context Depended Tone Model (CDTM); Kullback-Leibler Divergence (KLD); Tone Error Detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518797
Filename
4518797
Link To Document