Lattice generation with accurate word boundary in WFST framework

Author

Yuhong Guo ; Yujing Si ; Yong Liu ; Jielin Pan ; Yonghong Yan

Author_Institution

Key Lab. of Speech Acoust. & Content Understanding, Inst. of Acoust., Beijing, China

fYear

2012

fDate

16-18 Oct. 2012

Firstpage

1592

Lastpage

1595

Abstract

This paper presents an algorithm to generate the speech recognition lattice with accurate word boundary in weighted finite-state transducer (WFST) decoding framework. In traditional WFST lattice generation algorithms, the transformation from context-dependent phone lattice to word lattice does not yield accurate time boundaries between words. Meanwhile, this lattice is not a Standard Lattice Format nor is it compatible with existing toolkits. The lattice without word boundary can only be used in the area where the word boundary is not needed. In this paper, we propose a lexicon matching algorithm based on token passing to transform the phone lattice to the word lattice. This algorithm generates standard lattices with accurate word boundary. The experiments show that the proposed lattice generation algorithm has an good lattice quality and good algorithm efficiency.

Keywords

speech recognition; string matching; transducers; word processing; WFST lattice generation algorithms; context-dependent phone lattice transform; lattice quality; speech recognition lattice; time boundaries; token passing-based lexicon matching algorithm; weighted finite-state transducer decoding framework; word boundary; Acoustics; Decoding; Hidden Markov models; Lattices; Signal processing algorithms; Speech recognition; Transducers; lattice generation; speech recognition; weighted finite-state transducers;

fLanguage

English

Publisher

ieee

Conference_Titel

Image and Signal Processing (CISP), 2012 5th International Congress on

Conference_Location

Chongqing

Print_ISBN

978-1-4673-0965-3

Type

conf

DOI

10.1109/CISP.2012.6469905

Filename

6469905