Title :
Experiments for an approach to language identification with conversational telephone speech
Author :
Yan, Yonghong ; Barnard, Etienne
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Portland, OR, USA
Abstract :
This paper presents work on language identification research using conversational speech (the LDC Conversational Telephone Speech Database). The baseline system used in this study is based on language-dependent phone recognition and phonotactic constraints. The system was trained using monologue data and obtained an error rate of around 9% on a commonly used nine-language monologue test set. While the system was used to process conversational speech from the same nine-language task, dramatic performance degradation (with an error rate of 40%) was observed. Based on our analysis of conversational speech, two methods: (1) pre-processing and, (2) post-processing, were proposed. Without the presence of training data from conversational speech database, the final system (the baseline system enhanced by the two proposed methods) obtained an error rate of 24%, a substantial improvement (with 41% error reduction) compared with the baseline system
Keywords :
error statistics; natural languages; speech recognition; LDC Conversational Telephone Speech Database; conversational speech; conversational telephone speech; error rate; language identification; nine-language monologue test set; performance degradation; phonotactic constraints; post-processing; pre-processing; Databases; Degradation; Error analysis; Natural languages; Speech analysis; Speech enhancement; Speech processing; System testing; Telephony; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.543239