Title :
Tone variation modeling for fluent Mandarin tone recognition based on clustering
Author_Institution :
Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Tone recognition for fluent Mandarin speech has always been a very difficult problem, because the complicated tone behavior is difficult to analyze. In this paper, a new method of modeling tone variation for fluent Mandarin tone recognition by clustering training data into few subsets and weighting the likelihood computed by inter-syllabic features (Lin et al. (2003)) is proposed. Experimental results indicate that the tone recognition accuracy can be improved significantly by this method and one modification of the method is robust and has less computation. Our tone variation modeling method is shown to improve the recognition rate from 91.3% to 95.2%.
Keywords :
feature extraction; hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; pattern clustering; speech processing; speech recognition; HMM; fluent Mandarin speech; fluent Mandarin tone recognition; inter-syllabic features; likelihood weighting; recognition rate; tone variation modeling; training data clustering; Clustering algorithms; Hidden Markov models; Natural languages; Neural networks; Partitioning algorithms; Robustness; Speech analysis; Speech recognition; Training data; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326140