DocumentCode
294578
Title
Acoustic and language modeling of human and nonhuman noises for human-to-human spontaneous speech recognition
Author
Schultz, T. ; Rogina, I.
Author_Institution
Interactive Syst. Lab., Karlsruhe Univ., Germany
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
293
Abstract
Several improvements of our speech-to-speech translation system JANUS on spontaneous human-to-human dialogs are presented. Common phenomena in spontaneous speech are described, followed by a classification of different types of noise. To handle the variety of spontaneous effects in human-to-human dialogs, special noise models are introduced representing both human and nonhuman noise, as well as word fragments. It is shown that both the acoustic and the language modeling of the noise increase the recognition performance significantly. In the experiments, a clustering of the noise classes is performed and the resulting cluster variants are compared, thus allowing one to determine the best tradeoff between the sensitivity and trainability of the models
Keywords
acoustic signal processing; interactive systems; language translation; natural languages; speech processing; speech recognition; JANUS; acoustic modeling; cluster variants; experiments; human noise; human-to-human dialogs; human-to-human spontaneous speech recognition; language modeling; noise classes clustering; noise classification; noise models; nonhuman noise; recognition performance; sensitivity; speech-to-speech translation system; trainability; word fragments; Acoustic noise; Acoustic testing; Databases; Error analysis; Hidden Markov models; Humans; Interactive systems; Natural languages; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479531
Filename
479531
Link To Document