Title :
Improvements on transducing syllable lattice to word lattice for keyword search
Author :
Hang Su ; Van Tung Pham ; Yanzhang He ; Hieronymus, James
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA, USA
Abstract :
This paper investigates a weighted finite state transducer (WFST) based syllable decoding and transduction method for keyword search (KWS), and compares it with sub-word search and phone confusion methods in detail. Acoustic context dependent phone models are trained from word forced alignments and then used for syllable decoding and lattice generation. Out-of-vocabulary (OOV) keyword pronunciations are produced using a grapheme-to-syllable (G2S) system and then used to construct a lexical transducer. The lexical transducer is then composed with a keyword-boosted language model (LM) to transduce the syllable lattices to word lattices for final KWS. Word Error Rates (WER) and KWS results are reported for 5 different languages. It is shown that the syllable transduction method gives comparable KWS results to the syllable search and phone confusion methods. Combination of these three methods further improves OOV KWS performance.
Keywords :
lattice theory; speech recognition; KWS; WFST; grapheme-to-syllable system; keyword search; keyword-boosted language model; multilingual speech; out-of-vocabulary keyword pronunciations; speech recognition; syllable decoding; syllable lattice; transduction method; weighted finite state transducer; word error rates; word lattice; Acoustics; Decoding; Keyword search; Lattices; Speech recognition; Training; Transducers; Keyword Search; OOV Keywords; Speech Recognition; Syllable Transduction; WFST;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178868