مرکز منطقه ای اطلاع رساني علوم و فناوري - Syllable based keyword search: Transducing syllable lattices to word lattices

DocumentCode :

3585076

Title :

Syllable based keyword search: Transducing syllable lattices to word lattices

Author :

Hang Su ; Hieronymus, James ; Yanzhang He ; Fosler-Lussier, Eric ; Wegmann, Steven

Author_Institution :

Int. Comput. Sci. Inst., Berkeley, CA, USA

fYear :

2014

Firstpage :

489

Lastpage :

494

Abstract :

This paper presents a weighted finite state transducer (WFST) based syllable decoding and transduction framework for keyword search (KWS). Acoustic context dependent phone models are trained from word forced alignments. Then syllable decoding is done with lattices generated using a syllable lexicon and language model (LM). To process out-of-vocabulary (OOV) keywords, pronunciations are produced using a grapheme-to-syllable (G2S) system. A syllable to word lexical transducer containing both in-vocabulary (IV) and OOV keywords is then constructed and composed with a keyword-boosted LM transducer. The composed transducer is then used to transduce syllable lattices to word lattices for final KWS. We show that our method can effectively perform KWS on both IV and OOV keywords, and yields up to 0.03 Actual Term-Weighted Value (ATWV) improvement over searching keywords directly in subword lattices. Word Error Rates (WER) and KWS results are reported for three different languages.

Keywords :

acoustic signal processing; finite state machines; information retrieval; natural language processing; speech coding; vocabulary; ATWV; G2S system; IV keywords; KWS; LM; OOV keywords; WFST; acoustic context dependent phone models; actual term-weighted value; grapheme-to-syllable system; in-vocabulary keywords; keyword-boosted LM transducer; language model; out-of-vocabulary keywords; syllable based keyword search; syllable decoding; syllable lattices-word lattices transduction; syllable lexicon; syllable-word lexical transducer; weighted finite state transducer based syllable decoding; word error rates; word forced alignments; Acoustics; Decoding; Hidden Markov models; Lattices; Training; Training data; Transducers; Keyword Search; Lattice Transduction; OOV Keywords; Speech Recognition; Syllable Decoding; WFST;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Spoken Language Technology Workshop (SLT), 2014 IEEE

Type :

conf

DOI :

10.1109/SLT.2014.7078623

Filename :

7078623

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3585076