Title :
A unified approach of incorporating general features in decision tree based acoustic modeling
Author :
Reichl, Wolfgang ; Chou, Wu
Author_Institution :
Lucent Technol., AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
A unified maximum likelihood framework of incorporating phonetic and non-phonetic features in decision tree based acoustic modeling is proposed. Unlike phonetic features, non-phonetic features in this context are those features, which cannot be derived from the phoneme identities. Although non-phonetic features are used in speech recognition, they are often treated separately and based on various heuristics. In our approach, non-phonetic features are included as additional tags to the decision tree clustering. Moreover, the proposed tagged decision tree is based on the full training data, and therefore, it alleviates the problem of training data depletion in building specific feature dependent acoustic models. Experimental results indicate that up to 10% word error rate reduction can be achieved in a large vocabulary (Wall Street Journal) speech recognition task based on the proposed approach
Keywords :
acoustic signal processing; decision trees; feature extraction; hidden Markov models; maximum likelihood estimation; speech recognition; HMM; Wall Street Journal; decision tree based acoustic modeling; decision tree clustering; experimental results; feature dependent acoustic models; general features; large vocabulary speech recognition task; nonphonetic features; phonetic features; tagged decision tree; training data; unified maximum likelihood; word error rate reduction; Decision trees; Degradation; Error analysis; Loudspeakers; Maximum likelihood estimation; Noise level; Speech recognition; Tagging; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.759731