مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimized large vocabulary WFST speech recognition system

DocumentCode :

2550206

Title :

Optimized large vocabulary WFST speech recognition system

Author :

Guo, Yuhong ; Li, Ta ; Si, Yujing ; Pan, Jielin ; Yan, Yonghong

Author_Institution :

Key Lab. of Speech Acoust. & Content Understanding, Inst. of Acoust., Beijing, China

fYear :

2012

fDate :

29-31 May 2012

Firstpage :

1243

Lastpage :

1247

Abstract :

Speech recognition decoder is an important part of large vocabulary speech recognition application. The speed and the accuracy is the main concern of its application. Recently, weighted finite state transducers (WFST) has become the dominant description of decoding network. However, the large memory and time cost of constructing the final WFST decoding network is the bottleneck of this technique. The goal of this article is to construct a tight, flexible WFST decoding network as well as a fast, scalable decoder. A tight representation of silence in speech is proposed and the decoding algorithm with improved pruning strategies is also suggested. The experimental results show that the proposed network presentation will cut off 37% memory cost and 19% time cost of constructing the final decoding network. And with the decoding strategies of WFST feature specified beams the proposed decoder´s efficiency and accuracy are also significantly improved.

Keywords :

finite state machines; speech coding; speech recognition; vocabulary; WFST decoding network; decoding algorithm; memory cost; optimized large vocabulary WFST speech recognition system; pruning strategy; scalable decoder; speech recognition decoder; time cost; weighted finite state transducer; Accuracy; Decoding; Hidden Markov models; Speech; Speech recognition; Structural beams; Transducers; optimization; speech recognition; weighted finite state transducer;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on

Conference_Location :

Sichuan

Print_ISBN :

978-1-4673-0025-4

Type :

conf

DOI :

10.1109/FSKD.2012.6234200

Filename :

6234200

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2550206