Title :
A comparison of Gaussian Mixture Modeling (GMM) and Hidden Markov Modeling (HMM) based approaches for Automatic Phoneme Recognition in Kannada
Author :
Kannadaguli, Prashanth ; Bhat, Vidya
Author_Institution :
Dept. of Electron. & Commun. Eng., Manipal Inst. of Technol., Manipal, India
Abstract :
We build and compare phoneme recognition systems based on Gaussian Mixture Modeling (GMM) which is a static modeling scheme and Hidden Markov Modeling (HMM) which is a Dynamic modeling scheme. Both models were built by using Stochastic pattern recognition and Acoustic phonetic schemes to recognise phonemes. Since our native language is Kannada, a rich South Indian Language, we have used 15 Kannada phonemes to train and test these models. Since Mel - Frequency Cepstral Coefficients (MFCC) are well known Acoustic features of speech, we have used the same in speech feature extraction. Finally performance analysis of both models in terms of Phoneme Error Rate (PER) justifies the fact that Dynamic modeling yields better results over Static modeling and can be used in developing Automatic Speech Recognition systems.
Keywords :
Gaussian processes; cepstral analysis; feature extraction; hidden Markov models; natural language processing; speech recognition; GMM; Gaussian mixture modeling; Kannada phonemes; MFCC; PER; South Indian Language; acoustic phonetic schemes; acoustic speech feature extraction; automatic phoneme recognition; automatic speech recognition systems; dynamic modeling scheme; hidden Markov modeling; mel-frequency cepstral coefficients; phoneme error rate; stochastic pattern recognition; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Testing; Training; GMM; HMM; Kannada; MFCC; PER; Pattern Recognition; Phoneme Modeling;
Conference_Titel :
Signal Processing and Communication (ICSC), 2015 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-6760-5
DOI :
10.1109/ICSPCom.2015.7150658