Title :
Which One is Dominant for Neural Network Based Speech Recognition -- Δ or Δ Δ Articulatory Parameters?
Author :
Huda, Mohammad Nurul ; Muhammad, Ghulam ; Hasan, Mohammad Mahedi ; Kotwal, Mohammed Rokibul Alam ; Hassan, Foyzul ; Islam, Gazi Md Moshfiqul ; Hossain, Md Shahadat ; Rahman, Chowdhury Mofizur
Author_Institution :
Dept. of CSE, United Int. Univ., Dhaka, Bangladesh
Abstract :
This paper presents a method that describes the effect of articulatory velocity coefficient (Δ) on neural network based speech recognition. The method consists of three stages: a) two multilayer neural networks (MLNs), where second MLN takes Δ articulatory parameters as input b) Inhibition/Enhancement (In/En) network and c) Gram-Schmidt orthogonalization before connecting with a hidden Markov model (HMM) based classifier. From the experiments on Japanese Newspaper Article Sentences (JNAS), it is shown that velocity coefficient has more effect on the phoneme recognition performance. Moreover, the proposed phoneme recognizer with articulatory velocity coefficient provides higher phoneme accuracy with fewer mixture components in HMMs.
Keywords :
hidden Markov models; neural nets; speech recognition; Gram Schmidt orthogonalization; Japanese newspaper article sentences; articulatory velocity coefficient; hidden Markov model; inhibition-enhancement network; multilayer neural networks; neural network; phoneme recognition performance; speech recognition; Artificial neural networks; Context; Electronic mail; Feature extraction; Hidden Markov models; Speech; Speech recognition; Gram-Schmidt orthogonalization; Inhibition/Enhancement network; articulatory acceleration coefficient; articulatory velocity coefficient; multilayer neural networks;
Conference_Titel :
Intelligent Computing and Cognitive Informatics (ICICCI), 2010 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-6640-5
Electronic_ISBN :
978-1-4244-6641-2
DOI :
10.1109/ICICCI.2010.38