Title :
DPF-based japanese phoneme recognition using tandem MLNs
Author :
Kotwal, Mohammed Rokibul Alam ; Islam, Gazi Md Moshfiqul ; Hassan, Foyzul ; Muhammad, Ghulam ; Banik, Manoj ; Hossain, Md Shahadat ; Hasan, Mohammad Mahedi ; Huda, Mohammad Nurul
Author_Institution :
Dept. of CSE, United Int. Univ., Dhaka, Bangladesh
Abstract :
This paper presents a method for automatic phoneme recognition for Japanese language using tandem MLNs. The method comprises three stages: (i) multilayer neural network (MLN) that converts acoustic features into distinctive phonetic features DPFs, (ii) MLN that combines DPFs and acoustic features as input and generates a 45 dimensional DPF vector with less context effect and (iii) the 45 dimensional feature vector generated by the second MLN are inserted into a hidden Markov model (HMM) based classifier to obtain more accurate phoneme strings from the input speech. From the experiments on Japanese Newspaper Article Sentences (JNAS), it is observed that the proposed method provides a higher phoneme correct rate and improves phoneme accuracy tremendously over the method based on a single MLN. Moreover, it requires fewer mixture components in HMMs.
Keywords :
hidden Markov models; natural language processing; neural nets; speech recognition; DPF based Japanese phoneme recognition; Japanese newspaper article sentences; hidden Markov model; multilayer neural network; tandem MLN; Electronic mail; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; distinctive phonetic features; hidden Markov model; multilayer neural network;
Conference_Titel :
Hybrid Intelligent Systems (HIS), 2010 10th International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-7363-2
DOI :
10.1109/HIS.2010.5600078