Title :
A comparison of dynamic WFST decoding approaches
Author :
Dixon, Paul R. ; Hori, Chiori ; Kashioka, Hideki
Author_Institution :
Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
Abstract :
In this paper we perform a comparison of lookahead composition and on-the-fly hypothesis rescoring using a common decoder. The results on a large vocabulary speech recognition task illustrate the differences in the behaviour of these algorithms in terms of error rate, real time factor, memory usage and internal statistics of the decoder. The evaluations were performed when the decoder was operated at either the state or arc level. The results show the dynamic approaches also work well at the state level even though there is greater dynamic construction cost.
Keywords :
error statistics; speech coding; speech recognition; arc level; decoder; dynamic WFST decoding; dynamic construction cost; error rate; internal statistics; large vocabulary speech recognition task; lookahead composition; memory usage; on-the-fly hypothesis rescoring; real time factor; state level; weighted finite state transducer; Acoustic beams; Acoustics; Decoding; Heuristic algorithms; Speech recognition; Transducers; Vocabulary; Speech recognition; WFST; on-the-fly composition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288847