Title :
Syllable based keyword search: Transducing syllable lattices to word lattices
Author :
Hang Su ; Hieronymus, James ; Yanzhang He ; Fosler-Lussier, Eric ; Wegmann, Steven
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA, USA
Abstract :
This paper presents a weighted finite state transducer (WFST) based syllable decoding and transduction framework for keyword search (KWS). Acoustic context dependent phone models are trained from word forced alignments. Then syllable decoding is done with lattices generated using a syllable lexicon and language model (LM). To process out-of-vocabulary (OOV) keywords, pronunciations are produced using a grapheme-to-syllable (G2S) system. A syllable to word lexical transducer containing both in-vocabulary (IV) and OOV keywords is then constructed and composed with a keyword-boosted LM transducer. The composed transducer is then used to transduce syllable lattices to word lattices for final KWS. We show that our method can effectively perform KWS on both IV and OOV keywords, and yields up to 0.03 Actual Term-Weighted Value (ATWV) improvement over searching keywords directly in subword lattices. Word Error Rates (WER) and KWS results are reported for three different languages.
Keywords :
acoustic signal processing; finite state machines; information retrieval; natural language processing; speech coding; vocabulary; ATWV; G2S system; IV keywords; KWS; LM; OOV keywords; WFST; acoustic context dependent phone models; actual term-weighted value; grapheme-to-syllable system; in-vocabulary keywords; keyword-boosted LM transducer; language model; out-of-vocabulary keywords; syllable based keyword search; syllable decoding; syllable lattices-word lattices transduction; syllable lexicon; syllable-word lexical transducer; weighted finite state transducer based syllable decoding; word error rates; word forced alignments; Acoustics; Decoding; Hidden Markov models; Lattices; Training; Training data; Transducers; Keyword Search; Lattice Transduction; OOV Keywords; Speech Recognition; Syllable Decoding; WFST;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078623