DocumentCode :
2656087
Title :
Bangla phoneme recognition for ASR using multilayer neural network
Author :
Kotwal, Mohammed Rokibul Alam ; Banik, Manoj ; Eity, Qamrum Nahar ; Huda, Mohammad Nurul ; Muhammad, Ghulam ; Alotaibi, Yousef Ajami
Author_Institution :
Dept. of CSE, United Int. Univ., Dhaka, Bangladesh
fYear :
2010
fDate :
23-25 Dec. 2010
Firstpage :
103
Lastpage :
107
Abstract :
This paper presents a Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The method consists of two stages: i) a multilayer neural network (MLN), which converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities and ii) the phoneme probabilities obtained from the first stage and corresponding Δ and ΔΔ parameters calculated by linear regression (LR) are inserted into a hidden Markov model (HMM) based classifier to obtain more accurate phoneme strings. From the experiments on Bangla speech corpus prepared by us, it is observed that the proposed method provides higher phoneme recognition performance than the existing method. Moreover, it requires a fewer mixture components in the HMMs.
Keywords :
hidden Markov models; neural nets; regression analysis; speech recognition; Bangla phoneme recognition; automatic speech recognition; hidden Markov model; linear regression; mel frequency cepstral coefficients; multilayer neural network; Computers; Conferences; Information technology; Acoustic Features; Automatic Speech Recognition; Hidden Markov Models; Multilayer Neural Network; Phoneme Probabilities;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (ICCIT), 2010 13th International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4244-8496-6
Type :
conf
DOI :
10.1109/ICCITECHN.2010.5723837
Filename :
5723837
Link To Document :
بازگشت