DocumentCode
598987
Title
Lattice generation with accurate word boundary in WFST framework
Author
Yuhong Guo ; Yujing Si ; Yong Liu ; Jielin Pan ; Yonghong Yan
Author_Institution
Key Lab. of Speech Acoust. & Content Understanding, Inst. of Acoust., Beijing, China
fYear
2012
fDate
16-18 Oct. 2012
Firstpage
1592
Lastpage
1595
Abstract
This paper presents an algorithm to generate the speech recognition lattice with accurate word boundary in weighted finite-state transducer (WFST) decoding framework. In traditional WFST lattice generation algorithms, the transformation from context-dependent phone lattice to word lattice does not yield accurate time boundaries between words. Meanwhile, this lattice is not a Standard Lattice Format nor is it compatible with existing toolkits. The lattice without word boundary can only be used in the area where the word boundary is not needed. In this paper, we propose a lexicon matching algorithm based on token passing to transform the phone lattice to the word lattice. This algorithm generates standard lattices with accurate word boundary. The experiments show that the proposed lattice generation algorithm has an good lattice quality and good algorithm efficiency.
Keywords
speech recognition; string matching; transducers; word processing; WFST lattice generation algorithms; context-dependent phone lattice transform; lattice quality; speech recognition lattice; time boundaries; token passing-based lexicon matching algorithm; weighted finite-state transducer decoding framework; word boundary; Acoustics; Decoding; Hidden Markov models; Lattices; Signal processing algorithms; Speech recognition; Transducers; lattice generation; speech recognition; weighted finite-state transducers;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Signal Processing (CISP), 2012 5th International Congress on
Conference_Location
Chongqing
Print_ISBN
978-1-4673-0965-3
Type
conf
DOI
10.1109/CISP.2012.6469905
Filename
6469905
Link To Document