Title :
Information retrieval methods for automatic speech recognition
Author :
Xiao, Xiaoqiang ; Droppo, Jasha ; Acero, Alex
Author_Institution :
Dept. of Electr. Eng., Pennsylvania State Univ., University Park, PA, USA
Abstract :
In this paper, we use information retrieval (IR) techniques to improve a speech recognition (ASR) system. The potential benefits include improved speed, accuracy, and scalability. Where conventional HMM-based speech recognition systems decode words directly, our IR-based system first decodes subword units. These are then mapped to a target word by the IR system. In this decoupled system, the IR serves as a lightweight, data-driven pronunciation model. Our proposed method is evaluated in the Windows Live Search for Mobile (WLS4M) task, and our best system has 12% fewer errors than a comparable HMM classifier. We show that even using an inexpensive IR weighting scheme (TF-IDF) yields a 3% relative error rate reduction while maintaining all of the advantages of the IR approach.
Keywords :
hidden Markov models; information retrieval; speech recognition; HMM; automatic speech recognition; data-driven pronunciation model; decoupled system; hidden Markov models; information retrieval weighting scheme; windows live search for mobile task; Automatic speech recognition; Background noise; Decoding; Dictionaries; Engines; Feature extraction; Hidden Markov models; Information retrieval; Power system modeling; Speech recognition; Direct Modeling; Information Retrieval; Language Model; Speech Recognition; Vector Space Model;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495229