Title :
Evaluation of Bangla word recognition performance using acoustic features
Author :
Hossain, Md Shahadat ; Lisa, Nusrat Jahan ; Islam, Gazi Md Moshfiqul ; Hassan, Foyzul ; Hasan, Mohammad Mahedi ; Rahman, Sharif Mohammad Musfiqur ; Kotwal, Mohammed Rokibul Alam ; Huda, Mohammad Nurul
Author_Institution :
United Int. Univ., Dhaka, Bangladesh
Abstract :
In this paper, we have prepared a medium size Bangla speech corpus and compare performances of different acoustic features for Bangla word recognition. Most of the Bangla automatic speech recognition (ASR) system uses a small number of speakers, but 40 speakers selected from a wide area of Bangladesh, where Bangla is used as a native language, are involved here. In the experiments, mel-frequency cepstral coefficients (MFCCs) and local features (LFs) are inputted to the hidden Markov model (HMM) based classifiers for obtaining word recognition performance. From the experiments, it is shown that MFCC-based method of 39 dimensions provides a higher word correct rate (WCR) than the other methods investigated. Moreover, a higher WCR is obtained by the MFCC39-based method with fewer mixture components in the HMM.
Keywords :
acoustic signal processing; hidden Markov models; natural language processing; signal classification; speech recognition; ASR system; Bangla speech corpus; Bangla word recognition performance; Bangladesh; HMM based classifier; MFCC; acoustic feature; automatic speech recognition; hidden Markov model; local feature; mel-frequency cepstral coefficients; word correct rate; Computer applications; Conferences; Industrial electronics; Automatic Speech recognition; Hidden Markov Model; Local Features; Mel-Frequency Cepstral Coefficients;
Conference_Titel :
Computer Applications and Industrial Electronics (ICCAIE), 2010 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-9054-7
DOI :
10.1109/ICCAIE.2010.5735130