DocumentCode :
3568173
Title :
Russian sub-word based speech recognition using pocketsphinx engine
Author :
Zablotskiy, Sergey ; Sidorov, Maxim
Author_Institution :
Institute of Communications Engineering, Ulm University, Germany
Volume :
2
fYear :
2014
Firstpage :
840
Lastpage :
844
Abstract :
Russian is a synthetic language with a large morpheme-per-word ratio and highly inflective nature. These two peculiarities increase the lexicon size for Russian automatic speech recognition (ASR) by tens of times in comparison to that for English covering the same out-of-vocabulary (OOV) rate. The employment of sub-word units is a widely spread state-of-the-art approach to reduce the abundant lexicon and lower the perplexity (PP) of the language model. The choice of sub-word units affects the accuracy of the entire speech recognition system, its performance as well as the complexity of the spoken phrase synthesis. Here, different recognition units are investigated using pocketsphinx-engine while recognizing the vocabulary of several million word forms. A designed text normalization approach is also briefly presented. This rule-based algorithm allows keeping diverse Russian abbreviations and numerals in the language model (LM) and avoiding the statistics distortion. The approach is directly applicable and useful for Russian text-to-speech translation as well.
Keywords :
Acoustics; Dictionaries; Educational institutions; Smoothing methods; Speech; Speech recognition; Vocabulary; LVCSR; Russian; Sub-words;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics in Control, Automation and Robotics (ICINCO), 2014 11th International Conference on
Type :
conf
Filename :
7049704
Link To Document :
بازگشت