DocumentCode :
2576765
Title :
Methods for improving robustness of decision tree in Mandarin speech recognition
Author :
Xu, Xianghua ; Zhu, Jie ; Guo, Qiang
Author_Institution :
Dept. of Electron. Eng., Shanghai Jiao Tong Univ., China
Volume :
3
fYear :
2004
fDate :
27-30 June 2004
Firstpage :
1975
Abstract :
Phonetic decision tree based state tying has been widely used in most large vocabulary continuous speech recognition (LVCSR) systems. However, in most cases, the samples of different leaf nodes are very unbalanced, which may affect the recognition performance. In This work, node merging techniques are proposed to alleviate the problem and further decrease the number of senones. On the other hand, in order to lessen the impact of rare triphones on the quality of the decision tree based state tying and improve the accuracy of every final senone, two methods of dealing with rare triphones are added to hidden Markov model (HMM) acoustic modeling before state tying. Experimental results show that these methods greatly improve the robustness of the decision tree and can achieve better performance with even fewer parameters.
Keywords :
decision trees; hidden Markov models; natural languages; speech recognition; HMM acoustic modeling; LVCSR; Mandarin speech recognition robustness; hidden Markov models; large vocabulary continuous speech recognition systems; leaf node sample balance; node merging techniques; phonetic decision tree based state tying; rare triphones; senones number reduction; Context modeling; Decision trees; Frequency; Hidden Markov models; Maximum likelihood estimation; Merging; Robustness; Speech recognition; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8603-5
Type :
conf
DOI :
10.1109/ICME.2004.1394649
Filename :
1394649
Link To Document :
بازگشت