Title :
Detecting tone errors in continuous Mandarin speech
Author :
Zhang, Yan-Bin ; Chu, Min ; Huang, Chao ; Liang, Man-Gui
Author_Institution :
Inf. Technol. Inst., Beijing Jiaotong Univ., Beijing
fDate :
March 31 2008-April 4 2008
Abstract :
This paper proposes a new approach for detecting tone errors in continuous Mandarin speech. In the training phase, tone variations are modeled with context-depended MSD-HMM which considers six contextual factors instead of two in traditional triphone HMM. In the evaluation phase, the goodness of tone pronunciation is measured by Kullback-Leibler divergence (KLD) between the expected tone model and the most representative tone model. When the KLD between the two models is larger than a threshold, the tone is detected as a pronunciation error. In the ROC curve, we get the equal error rate at 2.6%.
Keywords :
hidden Markov models; natural language processing; speech processing; Kullback-Leibler divergence; MSD-HMM; ROC curve; continuous Mandarin speech; multi-space distribution-hidden Markov models; tone error detection; tone model; tone pronunciation; tone variations; Asia; Chaos; Computer errors; Context modeling; Error analysis; Hidden Markov models; Information technology; Natural languages; Phase measurement; Speech; Context Depended Tone Model (CDTM); Kullback-Leibler Divergence (KLD); Tone Error Detection;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518797