Title :
Semantic cache model driven speech recognition
Author :
Lecouteux, Benjamin ; Nocera, Pascal ; Linarès, Georges
Author_Institution :
LIA-CERI, Univ. of Avignon, Avignon, France
Abstract :
This paper proposes an improved semantic based cache model: our method boils down to using the first pass of the ASR system, associated to confidence scores and semantic fields, for driving the second pass. In previous papers, we had introduced a Driven Decoding Algorithm (DDA), which allows us to combine speech recognition systems, by guiding the search algorithm of a primary ASR system by the one-best hypothesis of an auxiliary system. We propose a strategy using DDA to drive a semantic cache, according to the confidence measures. The combination between semantic-cache and DDA optimizes the new decoding process, like an unsupervised language model adaptation. Experiments evaluate the proposed method on 8 hours of speech. Results show that semantic-DDA yields significant improvements to the baseline: we obtain a 4% word error rate relative improvement without acoustic adaptation, and 1.9% after adaptation with a 3xRT ASR system.
Keywords :
search problems; speech coding; speech recognition; ASR system; driven decoding algorithm; search algorithm; semantic cache model; semantic-DDA; speech recognition; Acoustic measurements; Adaptation model; Automatic speech recognition; Context modeling; Decoding; Error analysis; History; Probability; Speech analysis; Speech recognition; Latent Semantic Analysis; cache model; driven decoding; speech recognition;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495642