Title :
The use of acoustically detected filled and silent pauses in spontaneous speech recognition
Author :
Ogata, Jun ; Goto, Masataka ; Itou, Katunobu
Author_Institution :
Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Ibaraki
Abstract :
In recognizing spontaneous speech, the performance of typical speech recognizers tends to be degraded by filled and silent pauses, which are hesitation phenomena frequently occurred in such speech. In this paper, we present a method for improving the performance of a speech recognizer by detecting and handling both filled pauses (lengthened vowels) and silent (unfilled) pauses. Our method automatically detects these pauses by using a bottom-up acoustical analysis in parallel with a typical speech decoding process, and then incorporates the detected results into the decoding process. From the results of experiments conducted using the CIAIR spontaneous speech corpus, the effectiveness of the proposed method was confirmed.
Keywords :
acoustic signal detection; decoding; speech coding; speech recognition; CIAIR spontaneous speech corpus; acoustic filled pause detection; acoustic silent pause detection; bottom-up acoustical analysis; hesitation phenomena; speech decoding process; spontaneous speech recognition; Acoustic signal detection; Automatic speech recognition; Decoding; Detectors; Natural languages; Predictive models; Speech analysis; Speech processing; Speech recognition; Vocabulary; acoustic model; filled pause; language model; silent pause; spontaneous speech;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960581