DocumentCode :
3398157
Title :
Automatic speech recognition for Bangla digits
Author :
Muhammad, Ghulam ; Alotaibi, Yousef A. ; Huda, Mohammad Nurul
Author_Institution :
Coll. of Comput. & Inf. Sci., King Saud Univ., Riyadh, Saudi Arabia
fYear :
2009
fDate :
21-23 Dec. 2009
Firstpage :
379
Lastpage :
383
Abstract :
In this paper, we introduce a system for Bangla digit automatic speech recognition (ASR). Though Bangla is one of the largely spoken languages in the world, only a few works on Bangla ASR can be found in the literature, especially on Bangladeshi accented Bangla. In this work, the corpus is collected from natives in Bangladesh. Mel-frequency cepstral coefficients (MFCCs) based features and hidden Markov model (HMM) based classifiers are used for recognition. Experimental results show comparatively high recognition performance (more than 95%) for first six digits (0 - 5) and low performance (less than 90%) for the next four digits (6 - 9). We notice two confused pairs of digits: one with (6) and (9), and the other with (7) and (8), in the experiments. We also find that different dialects in Bangladesh have a greater role on this confusion.
Keywords :
hidden Markov models; natural language processing; speech recognition; Bangla digits; Bangladesh; Mel-frequency cepstral coefficients; automatic speech recognition; hidden Markov model; recognition performance; Artificial neural networks; Automatic speech recognition; Cepstral analysis; Databases; Educational institutions; Hidden Markov models; Information technology; Natural languages; Speech recognition; Speech synthesis; Automatic speech recognition; Bangla digit; Bangla phoneme; hidden Markov model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers and Information Technology, 2009. ICCIT '09. 12th International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4244-6281-0
Type :
conf
DOI :
10.1109/ICCIT.2009.5407267
Filename :
5407267
Link To Document :
بازگشت