Title :
Lattice generation with accurate word boundary in WFST framework
Author :
Yuhong Guo ; Yujing Si ; Yong Liu ; Jielin Pan ; Yonghong Yan
Author_Institution :
Key Lab. of Speech Acoust. & Content Understanding, Inst. of Acoust., Beijing, China
Abstract :
This paper presents an algorithm to generate the speech recognition lattice with accurate word boundary in weighted finite-state transducer (WFST) decoding framework. In traditional WFST lattice generation algorithms, the transformation from context-dependent phone lattice to word lattice does not yield accurate time boundaries between words. Meanwhile, this lattice is not a Standard Lattice Format nor is it compatible with existing toolkits. The lattice without word boundary can only be used in the area where the word boundary is not needed. In this paper, we propose a lexicon matching algorithm based on token passing to transform the phone lattice to the word lattice. This algorithm generates standard lattices with accurate word boundary. The experiments show that the proposed lattice generation algorithm has an good lattice quality and good algorithm efficiency.
Keywords :
speech recognition; string matching; transducers; word processing; WFST lattice generation algorithms; context-dependent phone lattice transform; lattice quality; speech recognition lattice; time boundaries; token passing-based lexicon matching algorithm; weighted finite-state transducer decoding framework; word boundary; Acoustics; Decoding; Hidden Markov models; Lattices; Signal processing algorithms; Speech recognition; Transducers; lattice generation; speech recognition; weighted finite-state transducers;
Conference_Titel :
Image and Signal Processing (CISP), 2012 5th International Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4673-0965-3
DOI :
10.1109/CISP.2012.6469905