DocumentCode :
302315
Title :
An integrated model of acoustics and language using semantic classification trees
Author :
Noth, E. ; De Mori, R. ; Fischer, J. ; Gebhard, A. ; Harbeck, S. ; Kompe, R. ; Kuhn, R. ; Niemann, Helen ; Mast, M.
Author_Institution :
Lehrstuhl fur Mustererkennung, Erlangen-Nurnberg Univ., Germany
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
419
Abstract :
We propose multilevel semantic classification trees to combine different information sources for predicting speech events (e.g. word chains, phrases, etc.). Traditionally in speech recognition systems these information sources (acoustic evidence, language model) are calculated independently and combined via Bayes rule. The proposed approach allows one to combine sources of different types it is no longer necessary for each source to yield a probability. Moreover the tree can look at several information sources simultaneously. The approach is demonstrated for the prediction of prosodically marked phrase boundaries, combining information about the spoken word chain, word category information, prosodic parameters, and the result of a neural network predicting the boundary on the basis of acoustic-prosodic features. The recognition rates of up to 90% for the two class problem boundary vs. no boundary are already comparable to results achieved with the above mentioned Bayes rule approach that combines the acoustic classifier with a 5-gram categorical language model. This is remarkable, since so far only a small set of questions combining information from different sources have been implemented
Keywords :
acoustic signal processing; grammars; natural languages; semantic networks; speech processing; speech recognition; trees (mathematics); 5-gram categorical language model; Bayes rule; acoustic classifier; acoustic evidence; acoustic-prosodic features; boundary; information sources; integrated model; neural network; phrases; prosodic parameters; prosodically marked phrase boundaries; recognition rates; semantic classification trees; speech events prediction; speech recognition systems; two class problem; word category information; word chains; Acoustics; Classification tree analysis; Computer science; Context modeling; Educational institutions; Electronic mail; Irrigation; Natural languages; Neural networks; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.541122
Filename :
541122
Link To Document :
بازگشت