مرکز منطقه ای اطلاع رساني علوم و فناوري - Continuous density Hidden Markov Model for context dependent Hindi speech recognition

DocumentCode :

1646915

Title :

Continuous density Hidden Markov Model for context dependent Hindi speech recognition

Author :

Sinha, S. ; Agrawal, S.S. ; Jain, Abhishek

Author_Institution :

Dept. of IT, Birla Inst. of Technol., Ranchi, India

fYear :

2013

Firstpage :

1953

Lastpage :

1958

Abstract :

With the advancement in technology and the inherent advantage of voice based communication due to its variability, speed and convenience has driven attention towards mechanical recognition of speech. Literature survey of research in this area shows that almost every system uses Gaussian Mixture Hidden Markov model for optimal performance in recognition of speech. Evaluation of Gaussian likelihood dominates the total computational load in using this statistical approach. The appropriate selection of Gaussian mixture is very important. Current choice of mixture component is arbitrary with little justification. Also the standard set for European languages can not be used in Hindi speech recognition due to mismatch in database size of the languages. Parameter estimation with too many or few component may inappropriately estimate the mixture model. Therefore, number of mixture is important for expectation maximization process. In this research work, the authors estimate number of Gaussian mixture component for Hindi database based upon the size of vocabulary. MFCC and PLP features along with its extended version has been used as speech feature. HLDA is applied for feature reduction while using extended features.

Keywords :

Gaussian processes; audio databases; expectation-maximisation algorithm; feature extraction; hidden Markov models; natural language processing; speech recognition; voice communication; Gaussian likelihood speech; Gaussian mixture component; Gaussian mixture hidden Markov model; Gaussian mixture selection; HLDA; Hindi database; MFCC feature; Mel frequency cepstral coefficients; PLP feature; context dependent Hindi speech recognition; continuous density hidden Markov model; database size mismatch; expectation maximization process; feature reduction; heteroscedastic discriminant analysis; mechanical speech evaluation; perceptual linear prediction coefficients; speech feature; statistical approach; vocabulary size; voice based communication; Databases; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Vectors; GMM; HLDA; Hindi speech Recognition; MFCC; PLP;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Advances in Computing, Communications and Informatics (ICACCI), 2013 International Conference on

Conference_Location :

Mysore

Print_ISBN :

978-1-4799-2432-5

Type :

conf

DOI :

10.1109/ICACCI.2013.6637481

Filename :

6637481

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1646915