Title :
Hybridization of two stage Multilayer Neural Networks based Bangla ASR incorporating dynamic parameters
Author :
Kotwal, Mohammed Rokibul Alam ; Razzaque, Md Abdur ; Hossen, Arif ; Huda, Mohammad Nurul
Author_Institution :
Dept. of Comput. Sci. & Eng., United Int. Univ., Dhaka, Bangladesh
Abstract :
This paper presents a hybridization of Multilayer Neural Network-based Bangla phoneme recognition method for Automatic Speech Recognition (ASR) incorporating dynamic parameters. The method consists of four stages: at first stage, a multilayer neural network (MLN) converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities. Phoneme probabilities from the first stage are inserted into second stage MLN for obtaining more accurate phoneme probabilities with reduced context where the third stage computes dynamic (velocity (A) and acceleration (AA)) parameters from the phoneme probabilities by using three point linear regressions (LRs). Finally, the phoneme probabilities, dynamic parameters, A and AA, and the input MFCCs, combined as hybrid features, are fed into a hidden Markov model (HMM) based classifier to obtain more accurate phoneme strings. From the experiments on Bangla speech corpus prepared by us, it is observed that the proposed method provides higher phoneme recognition performance than the existing method. Moreover, it requires a fewer mixture components in the HMMs.
Keywords :
cepstral analysis; hidden Markov models; natural language processing; neural nets; probability; regression analysis; signal classification; speech recognition; Bangla ASR; Bangla phoneme recognition; HMM based classifier; acoustic features; automatic speech recognition; dynamic parameter; hidden Markov model; hybridization; linear regression; mel frequency cepstral coefficient; phoneme probabilities; two stage multilayer neural network; Accuracy; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Nonhomogeneous media; Speech; automatic speech recognition; dynamic parameters; hidden Markov model; linear regression; mel frequency cepstral coefficients; multilayer neural network;
Conference_Titel :
Hybrid Intelligent Systems (HIS), 2011 11th International Conference on
Conference_Location :
Melacca
Print_ISBN :
978-1-4577-2151-9
DOI :
10.1109/HIS.2011.6122099