DocumentCode :
2369504
Title :
On HMM Speech Recognition Based on Complex Speech Analysis
Author :
KINJO, Tatsuhiko ; Funaki, Keiichi
Author_Institution :
Graduate Sch. of Sci. & Eng., Ryukyus Univ., Okinawa
fYear :
2006
fDate :
6-10 Nov. 2006
Firstpage :
3477
Lastpage :
3480
Abstract :
In speech recognition, LPC cepstrum based on LPC or MFCC based on Mel-frequency filter bank are widely used as a feature extraction that determines the performance. However, these are not being regarded as the best feature extraction. In this paper, we introduce a complex speech analysis for an analytic speech signal to HMM speech recognition. A complex speech analysis can estimate more accurate speech spectrum in low frequencies, as a result, it is expected that the speech analysis can perform well as a feature extractor in speech recognition. The MMSE-based time-varying complex AR speech analysis is adopted and the estimated complex parameters are converted to LPCCs and MFCCs as a feature vector for HTK (HMM tool kit) in order to realize the HMM speech recognition. Through continuous speech recognition experiments with the converted LPCCs and MFCCs, it was found that the complex speech analysis method would not perform well than the real one
Keywords :
autoregressive processes; feature extraction; filtering theory; hidden Markov models; least mean squares methods; parameter estimation; speech recognition; HMM; HMM tool kit; LPC cepstrum; MFCC; MMSE; Mel-frequency filter bank; analytic speech signal; complex parameters estimation; complex speech analysis; feature extraction; speech recognition; speech spectrum; time-varying complex AR speech analysis; Cepstral analysis; Cepstrum; Feature extraction; Filter bank; Frequency estimation; Hidden Markov models; Linear predictive coding; Mel frequency cepstral coefficient; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
IEEE Industrial Electronics, IECON 2006 - 32nd Annual Conference on
Conference_Location :
Paris
ISSN :
1553-572X
Print_ISBN :
1-4244-0390-1
Type :
conf
DOI :
10.1109/IECON.2006.347837
Filename :
4153275
Link To Document :
بازگشت