DocumentCode :
730788
Title :
Improved strategies for a zero oov rate LVCSR system
Author :
Shaik, M. Ali Basha ; Mousa, Amr El-Desoky ; Hahn, Stefan ; Schluter, Ralf ; Ney, Hermann
Author_Institution :
Human Language Technol. & Pattern Recognition - Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5048
Lastpage :
5052
Abstract :
In this work, multiple hierarchical language modeling strategies for a zero OOV rate large vocabulary continuous speech recognition system are investigated. In our previously proposed hierarchical approach, a full-word language model and a context independent character-level LM (CLM) are directly used during search. The novelty of this work is to jointly model the character-level prior and the pronunciation probabilities, to introduce across-word context into the characterlevel LM, and to properly normalize the character-level LM using prefix-tree based normalization for the hierarchical approach. Significant reductions in-terms of word error rates (WER) on the best full-word Quaero Polish LVCSR system are reported.
Keywords :
hierarchical systems; speech recognition; vocabulary; context independent character-level LM; hierarchical language modeling; pronunciation probabilities; vocabulary continuous speech recognition system; word error rates; zero OOV rate LVCSR system; Acoustics; Character recognition; Computational modeling; Context; Context modeling; Speech recognition; Vocabulary; LVCSR; OOV; hierarchical; prefix-tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178932
Filename :
7178932
Link To Document :
بازگشت