Title :
Phoneme set selection for russian speech recognition
Author :
Vazhenina, Daria ; Markov, Konstantin
Author_Institution :
Human Interface Lab., Univ. of Aizu, Aizu, Japan
Abstract :
In this paper, we describe a method for phoneme set selection based on combination of phonological and statistical information and its application for Russian speech recognition. For Russian language, currently used phoneme sets are mostly rule-based or heuristically derived from the standard SAMPA or IPA phonetic alphabets. However, for some other languages, statistical methods have been found useful for phoneme set optimization. In Russian language, almost all phonemes come in pairs: consonants can be hard or soft and vowels stressed or unstressed. First, we start with a big phoneme set and then gradually reduce it by merging phoneme pairs. Decision, which pair to merge, is based on phonetic pronunciation rules and statistics obtained from confusion matrix of phoneme recognition experiments. Applying this approach to the IPA Russian phonetic set, we first reduced it to 47 phonemes, which were used as initial set in the subsequent speech model training. Based on the phoneme confusion results, we derived several other phoneme sets with different number of phonemes down to 27. Speech recognition experiments using these sets showed that the reduced phoneme sets are better than the initial phoneme set for phoneme recognition and as good for word level speech recognition.
Keywords :
set theory; speech recognition; statistical analysis; Russian language; Russian speech recognition; phoneme set selection; phonetic pronunciation rule; phonological information; statistical information; statistical method; word level speech recognition; Computational modeling; Electronic publishing; Encyclopedias; Internet; Speech; Speech recognition; Phoneme set; Russian language; Speech recognition;
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
DOI :
10.1109/NLPKE.2011.6138246