Title :
Methods for improving robustness of decision tree in Mandarin speech recognition
Author :
Xu, Xianghua ; Zhu, Jie ; Guo, Qiang
Author_Institution :
Dept. of Electron. Eng., Shanghai Jiao Tong Univ., China
Abstract :
Phonetic decision tree based state tying has been widely used in most large vocabulary continuous speech recognition (LVCSR) systems. However, in most cases, the samples of different leaf nodes are very unbalanced, which may affect the recognition performance. In This work, node merging techniques are proposed to alleviate the problem and further decrease the number of senones. On the other hand, in order to lessen the impact of rare triphones on the quality of the decision tree based state tying and improve the accuracy of every final senone, two methods of dealing with rare triphones are added to hidden Markov model (HMM) acoustic modeling before state tying. Experimental results show that these methods greatly improve the robustness of the decision tree and can achieve better performance with even fewer parameters.
Keywords :
decision trees; hidden Markov models; natural languages; speech recognition; HMM acoustic modeling; LVCSR; Mandarin speech recognition robustness; hidden Markov models; large vocabulary continuous speech recognition systems; leaf node sample balance; node merging techniques; phonetic decision tree based state tying; rare triphones; senones number reduction; Context modeling; Decision trees; Frequency; Hidden Markov models; Maximum likelihood estimation; Merging; Robustness; Speech recognition; Training data; Vocabulary;
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8603-5
DOI :
10.1109/ICME.2004.1394649