Title :
Efficient adaptations of the SphinxTrain procedure for building a robust ASR system in Slovak
Author :
Kacur, J. ; Vojtko, J.
Author_Institution :
Dept. of Telecommun., Slovak Univ. of Technol., Bratislava
Abstract :
In the following article we discuss the practical and theoretical aspects of the building of Slovak ASR system using the SPHINX system and its SphinxTrain adaptation procedure for CI and CD HMM models. Concerning issues are ranging from the optimal setting of the number of states per model, through the adjustment of the number of tied states for context dependent HMMspsila, number of Gaussian mixtures, and training scenarios regarding the spelled recordings and background models. All experiments and results were obtained for the MOBILDAT-SK speech database that contains 32500 recordings from 1100 speakers. Obtained CI and CD HMM models achieved WER around 5% for application words which qualifies them for a use in practical applications. Furthermore the suggested and realized modifications to the classical SphinxTrain procedure for a given database and the Slovak language brought improved overall results as well.
Keywords :
Gaussian processes; hidden Markov models; natural language processing; speech recognition; Gaussian mixtures; HMM model; SPHINX system; Slovak ASR system; SphinxTrain procedure; context dependent HMM; robust ASR system; Automatic speech recognition; CD recording; Context modeling; Databases; Decision trees; Hidden Markov models; Information technology; Robustness; Speech processing; Speech recognition; Automatic speech recognition; HMM; MobilDat; SPHINX; SphinxTrain;
Conference_Titel :
Systems, Signals and Image Processing, 2008. IWSSIP 2008. 15th International Conference on
Conference_Location :
Bratislava
Print_ISBN :
978-80-227-2856-0
Electronic_ISBN :
978-80-227-2880-5
DOI :
10.1109/IWSSIP.2008.4604352