Title :
Identification and modeling of word fragments in spontaneous speech
Author :
Tsvetkov, Yulia ; Sheikh, Zaid ; Metze, Florian
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
This paper presents a novel approach to handling disfluencies, word fragments and self-interruption points in Cantonese conversational speech. We train a classifier that exploits lexical and acoustic information to automatically identify disfluencies during training of a speech recognition system on conversational speech, and then use this classifier to augment reference annotations used for acoustic model training. We experiment with approaches to modeling disfluencies in the pronunciation dictionary, and their effect on the polyphonic decision tree clustering. We achieve automatic detection of disfluencies with 88% accuracy, which leads to a reduction in character error rate of 1.9% absolute. While the high baseline error rates are due to the task we are currently working on, we demonstrate that this approach works well on the Switchboard corpus, for which the conversational nature of speech is also a major problem.
Keywords :
acoustic signal processing; decision trees; dictionaries; natural language processing; pattern clustering; signal classification; speech recognition; Cantonese conversational speech; Switchboard corpus; acoustic model training; character error rate reduction; classifier training; disfluencies handling; disfluencies modeling; lexical information; polyphonic decision tree clustering; pronunciation dictionary; reference annotations; self-interruption points; speech recognition system; spontaneous speech; word fragment identification; word fragment modeling; word fragments; Accuracy; Acoustics; Dictionaries; Feature extraction; Speech; Speech recognition; Training; conversational speech; disfluency modeling; reference annotation; speech recognition; word fragments identification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639146