DocumentCode
3245070
Title
A feature-based filled pause detection system for Dutch
Author
Stouten, Frederik ; Martens, Jean-Pierre
Author_Institution
ELIS, Ghent Univ., Belgium
fYear
2003
fDate
30 Nov.-3 Dec. 2003
Firstpage
309
Lastpage
314
Abstract
Nowadays, automatic speech recognizers have become quite good in recognizing well prepared fluent speech (e.g. news readings). However, the recognition of unprepared or spontaneous speech is still problematic. Some important reasons for this are that spontaneous speech is less articulated, exhibits a high speaking rate and usually contains a lot of disfluencies. The latter occur when the speaker needs time to think about the continuation of his discourse, or when he needs to change/correct his last utterance. Although there are different types of disfluencies (interruptions, corrections, repetitions, etc.) the most common ones are filled pauses. They can take the form of an interjection like /uh/ or /uhm/, or an abnormal lengthening of one syllable of a word. In this paper we propose a new method for detecting such fillers prior to the speech recognition. Tests show that it is possible to improve the recognition accuracy by just removing the detected filled pauses from the recognizer input.
Keywords
feature extraction; speech processing; speech recognition; Dutch language; abnormal syllable lengthening; automatic speech recognizers; disfluencies; feature-based filled pause detection; interjection; recognition accuracy; speech recognition; spontaneous speech; Automatic speech recognition; Computer vision; Databases; Detection algorithms; Detectors; Indexing; Search engines; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN
0-7803-7980-2
Type
conf
DOI
10.1109/ASRU.2003.1318459
Filename
1318459
Link To Document