DocumentCode :
1646915
Title :
Continuous density Hidden Markov Model for context dependent Hindi speech recognition
Author :
Sinha, S. ; Agrawal, S.S. ; Jain, Abhishek
Author_Institution :
Dept. of IT, Birla Inst. of Technol., Ranchi, India
fYear :
2013
Firstpage :
1953
Lastpage :
1958
Abstract :
With the advancement in technology and the inherent advantage of voice based communication due to its variability, speed and convenience has driven attention towards mechanical recognition of speech. Literature survey of research in this area shows that almost every system uses Gaussian Mixture Hidden Markov model for optimal performance in recognition of speech. Evaluation of Gaussian likelihood dominates the total computational load in using this statistical approach. The appropriate selection of Gaussian mixture is very important. Current choice of mixture component is arbitrary with little justification. Also the standard set for European languages can not be used in Hindi speech recognition due to mismatch in database size of the languages. Parameter estimation with too many or few component may inappropriately estimate the mixture model. Therefore, number of mixture is important for expectation maximization process. In this research work, the authors estimate number of Gaussian mixture component for Hindi database based upon the size of vocabulary. MFCC and PLP features along with its extended version has been used as speech feature. HLDA is applied for feature reduction while using extended features.
Keywords :
Gaussian processes; audio databases; expectation-maximisation algorithm; feature extraction; hidden Markov models; natural language processing; speech recognition; voice communication; Gaussian likelihood speech; Gaussian mixture component; Gaussian mixture hidden Markov model; Gaussian mixture selection; HLDA; Hindi database; MFCC feature; Mel frequency cepstral coefficients; PLP feature; context dependent Hindi speech recognition; continuous density hidden Markov model; database size mismatch; expectation maximization process; feature reduction; heteroscedastic discriminant analysis; mechanical speech evaluation; perceptual linear prediction coefficients; speech feature; statistical approach; vocabulary size; voice based communication; Databases; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Vectors; GMM; HLDA; Hindi speech Recognition; MFCC; PLP;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI), 2013 International Conference on
Conference_Location :
Mysore
Print_ISBN :
978-1-4799-2432-5
Type :
conf
DOI :
10.1109/ICACCI.2013.6637481
Filename :
6637481
Link To Document :
بازگشت