Title :
Optimized large vocabulary WFST speech recognition system
Author :
Guo, Yuhong ; Li, Ta ; Si, Yujing ; Pan, Jielin ; Yan, Yonghong
Author_Institution :
Key Lab. of Speech Acoust. & Content Understanding, Inst. of Acoust., Beijing, China
Abstract :
Speech recognition decoder is an important part of large vocabulary speech recognition application. The speed and the accuracy is the main concern of its application. Recently, weighted finite state transducers (WFST) has become the dominant description of decoding network. However, the large memory and time cost of constructing the final WFST decoding network is the bottleneck of this technique. The goal of this article is to construct a tight, flexible WFST decoding network as well as a fast, scalable decoder. A tight representation of silence in speech is proposed and the decoding algorithm with improved pruning strategies is also suggested. The experimental results show that the proposed network presentation will cut off 37% memory cost and 19% time cost of constructing the final decoding network. And with the decoding strategies of WFST feature specified beams the proposed decoder´s efficiency and accuracy are also significantly improved.
Keywords :
finite state machines; speech coding; speech recognition; vocabulary; WFST decoding network; decoding algorithm; memory cost; optimized large vocabulary WFST speech recognition system; pruning strategy; scalable decoder; speech recognition decoder; time cost; weighted finite state transducer; Accuracy; Decoding; Hidden Markov models; Speech; Speech recognition; Structural beams; Transducers; optimization; speech recognition; weighted finite state transducer;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6234200