Title :
VERBMOBIL: the use of prosody in the linguistic components of a speech understanding system
Author :
Nöth, Elmar ; Batliner, Anton ; Kiessling, Andreas ; Kompe, Ralf ; Niemann, Heinrich
Author_Institution :
Lehrstuhl fur Mustererkennung, Erlangen-Nurnberg Univ., Germany
fDate :
9/1/2000 12:00:00 AM
Abstract :
We show how prosody can be used in speech understanding systems. This is demonstrated with the VERBMOBIL speech to-speech translation system which, to our knowledge, is the first complete system which successfully uses prosodic information in the linguistic analysis. Prosody is used by computing probabilities for clause boundaries, accentuation, and different types, of sentence mood for each of the word hypotheses computed by the word recognizer. These probabilities guide the search of the linguistic analysis. Disambiguation is already achieved during the analysis and not by a prosodic verification of different linguistic hypotheses. So far, the most useful prosodic information is provided by clause boundaries. These are detected with a recognition rate of 94%. For the parsing of word hypotheses graphs, the use of clause boundary probabilities yields a speed-up of 92% and a 96% reduction of alternative readings
Keywords :
language translation; linguistics; natural languages; probability; speech processing; VERBMOBIL; accentuation; alternative readings reduction; clause boundaries; disambiguation; experiments; linguistic analysis; linguistic components; probabilities; prosodic information; recognition rate; sentence mood; speech to-speech translation system; speech understanding system; stochastic language models; syntactic analysis; word hypotheses graph; word recognizer; Data mining; Decoding; Educational technology; Emotion recognition; Frequency; Humans; Information analysis; Mood; Rhythm; Speech analysis;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on