DocumentCode :
417185
Title :
Unsupervised and active learning in automatic speech recognition for call classification
Author :
Hakkani-Tur, Dilek ; Tur, Gokhan ; Rahim, Mazin ; Riccardi, Giuseppe
Author_Institution :
AT&T Labs.-Res., USA
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
A key challenge in rapidly building spoken natural language dialog applications is minimizing the manual effort required in transcribing and labeling speech data. This task is not only expensive but also time consuming. We present a novel approach that aims at reducing the amount of manually transcribed in-domain data required for building automatic speech recognition (ASR) models in spoken language dialog systems. Our method is based on mining relevant text from various conversational systems and Web sites. An iterative process is employed where the performance of the models can be improved through both unsupervised and active learning of the ASR models. We have evaluated the robustness of our approach on a call classification task that has been selected from AT&T VoiceToneSM customer care. Our results indicate that with unsupervised learning it is possible to achieve a call classification performance that is only 1.5% lower than the upper bound set when using all available in-domain transcribed data.
Keywords :
data mining; interactive systems; iterative methods; natural language interfaces; natural languages; speech recognition; speech-based user interfaces; unsupervised learning; AT&T VoiceTone; Web sites; active learning; automatic speech recognition; call classification; conversational systems; data mining; iterative process; labeling speech data; spoken language dialog systems; spoken natural language dialog applications; transcribing speech data; unsupervised learning; Automatic speech recognition; Data mining; Humans; Iterative methods; Labeling; Natural languages; Robustness; Routing; Unsupervised learning; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326014
Filename :
1326014
Link To Document :
بازگشت