DocumentCode :
695050
Title :
Towards Building an Intelligent Voice System for Kazakh: Acoustic Database and System Design
Author :
Yessenbayev, Zhandos ; Karabalayeva, Muslima ; Shamayeva, Firuza
Author_Institution :
Dept. of Comput. Sci., Nazarbayev Univ., Astana, Kazakhstan
fYear :
2013
fDate :
10-13 Sept. 2013
Firstpage :
393
Lastpage :
397
Abstract :
In this paper we describe our initiative to build an intelligent voice system for Kazakh over the telephone lines. In particular, we collected the first acoustic database of Kazakh telephone speech containing common words and phrases uttered by 169 native speakers to train an acoustic model. The database has more than 17 hours of speech and is balanced according to the gender, region and age groups. The training was performed using CMU Sphinx Toolkits and exploited the context-dependent tied-state continuous Hidden Markov Models with 8 Gaussian mixtures per state. The experiments show that the best WER of 4, 1% on test data is obtained with 2000 senones and the dimension of the feature vectors of 23. Later, this model was used in the system´s implementation. While designing the system, we tried to focus on friendly graphical user interface and all-in-one functionality. The system is intended to help easy and fast deployment of speech-enabled applications for the industry, governmental and educational institutions.
Keywords :
Gaussian processes; acoustic signal processing; graphical user interfaces; mixture models; speech recognition; vectors; CMU Sphinx Toolkits; Gaussian mixtures; Kazakh telephone speech; acoustic database; all-in-one functionality; context-dependent tied-state continuous hidden Markov models; feature vectors; graphical user interface; intelligent voice system; speech-enabled applications; system design; telephone lines; Acoustics; Databases; Educational institutions; Graphical user interfaces; Hidden Markov models; Speech; Speech recognition; acoustic database; acoustic model for Kazakh; intelligent voice system; telephone speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Modelling and Simulation (EUROSIM), 2013 8th EUROSIM Congress on
Conference_Location :
Cardiff
Type :
conf
DOI :
10.1109/EUROSIM.2013.75
Filename :
7004975
Link To Document :
بازگشت