Acoustic and language modeling of human and nonhuman noises for human-to-human spontaneous speech recognition

Author

Schultz, T. ; Rogina, I.

Author_Institution

Interactive Syst. Lab., Karlsruhe Univ., Germany

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

293

Abstract

Several improvements of our speech-to-speech translation system JANUS on spontaneous human-to-human dialogs are presented. Common phenomena in spontaneous speech are described, followed by a classification of different types of noise. To handle the variety of spontaneous effects in human-to-human dialogs, special noise models are introduced representing both human and nonhuman noise, as well as word fragments. It is shown that both the acoustic and the language modeling of the noise increase the recognition performance significantly. In the experiments, a clustering of the noise classes is performed and the resulting cluster variants are compared, thus allowing one to determine the best tradeoff between the sensitivity and trainability of the models

Keywords

acoustic signal processing; interactive systems; language translation; natural languages; speech processing; speech recognition; JANUS; acoustic modeling; cluster variants; experiments; human noise; human-to-human dialogs; human-to-human spontaneous speech recognition; language modeling; noise classes clustering; noise classification; noise models; nonhuman noise; recognition performance; sensitivity; speech-to-speech translation system; trainability; word fragments; Acoustic noise; Acoustic testing; Databases; Error analysis; Hidden Markov models; Humans; Interactive systems; Natural languages; Speech recognition; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479531

Filename

479531