Title :
Flexible transcription alignment
Author :
Finke, Michael ; Waibel, Alex
Author_Institution :
Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Presents a set of techniques that we employed in our Janus Recognition Toolkit (JRTk) Switchboard and CallHome recognizer in order to deal with imperfections in the transcriptions: inconsistent transcription of pronunciations and contractions, as well as errors in utterance segmentations. These techniques consist of a dynamic, speaking-mode-dependent pronunciation model and a flexible utterance alignment procedure which is based on speaker-adapted models (label boosting). The idea is (a) to automatically retranscribe the training corpus based on these models and procedures, (b) to train a recognizer based on these flexible transcription graphs, and (c) to decode with a dynamic speaking-mode-dependent dictionary. The framework is successfully applied to increase the performance of our state-of-the-art JRTk Switchboard recognizer significantly
Keywords :
decoding; glossaries; speech coding; speech recognition; JRTk CallHome recognizer; JRTk Switchboard recognizer; Janus Recognition Toolkit; automatically retranscription; contractions; conversational speech recognition; decoding; dynamic speaking-mode-dependent dictionary; flexible transcription alignment; flexible transcription graphs; inconsistent transcription; label boosting; performance; pronunciations; speaker-adapted models; speaking-mode-dependent pronunciation model; training corpus; transcription imperfections; utterance alignment procedure; utterance segmentation errors; Automatic speech recognition; Boosting; Decoding; Dictionaries; Error analysis; Interactive systems; Laboratories; Speech recognition; Telephony; Vocabulary;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.658974