DocumentCode :
3629823
Title :
Efficient combination of n-gram language models and recognition grammars in real-time LVCSR decoder
Author :
Ales Prazak;Pavel Ircing;Jan Svec;Josef Psutka
Author_Institution :
SpeechTech s.r.o., Plze?, Czech Republic
fYear :
2008
Firstpage :
587
Lastpage :
591
Abstract :
The paper presents a method for incorporation of regular grammars into n-gram language models. Such composite model then benefits from both language modeling formalisms - a grammar yields robust probability estimates for well-defined phrases with fixed structure whereas the n-gram provides better coverage of casual speech. Moreover, the grammar allows adding new words to the phrase pattern while taking advantage of the existing structural (context) information. The proposed method for grammar incorporation allows the use of combined models in our in-house real-time decoder which is designed to work only with standard n-gram language model. The performance of the combined model was tested in the dictation task where a simple grammar was designed for date entries. A statistically significant improvement of WER was achieved.
Keywords :
"Decoding","Natural languages","Training data","Robustness","Testing","Vocabulary","Labeling","Cybernetics","Yield estimation","Speech"
Publisher :
ieee
Conference_Titel :
Signal Processing, 2008. ICSP 2008. 9th International Conference on
ISSN :
2164-5221
Print_ISBN :
978-1-4244-2178-7
Electronic_ISBN :
2164-523X
Type :
conf
DOI :
10.1109/ICOSP.2008.4697201
Filename :
4697201
Link To Document :
بازگشت