Title :
An efficient beam pruning with a reward considering the potential to reach various words on a lexical tree
Author :
Kato, Tsuneo ; Fujita, Kengo ; Nishizawa, Nobuyuki
Author_Institution :
User Interface Lab., KDDI R& D Labs. Inc., Fujimino, Japan
Abstract :
This paper presents an efficient frame-synchronous beam pruning for automatic speech recognition. With conventional beam pruning, hypotheses that have a greater potential to reach various words on a lexical tree are likely to be pruned out, since this potential is not taken into account. To make the beam pruning less restrictive for hypotheses with a greater potential and vice versa, the proposed method adds a reward as a monotonically increasing function of the number of reachable words from the node where a hypothesis stays on a lexical tree, to the likelihood of the hypothesis. The reward is designed not to collapse the ASR probabilistic framework. The proposed method reduces the processing time from 30% to 70% for grammar-based tasks. For a language-model-based dictation task, it also causes an additional reduction from the processing time of the beam pruning with the language model look-ahead technique.
Keywords :
grammars; speech processing; speech recognition; text analysis; trees (mathematics); ASR probabilistic framework; automatic speech recognition; dictation task; efficient beam pruning; frame-synchronous beam pruning; grammar-based tasks; language-model; lexical tree; reachable words; Acoustic beams; Acoustic devices; Automatic speech recognition; Hidden Markov models; Histograms; Laboratories; Natural languages; Search engines; User interfaces; Vocabulary; frame synchronous beam search; lexical tree; pruning;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495098