DocumentCode :
3546858
Title :
Wavelet transform to hybrid support vector machine and hidden Markov model for speech recognition
Author :
Shao, Yu ; Chang, Chip-Hong
Author_Institution :
Center for Integrated Circuits & Syst., Nanyang Technol. Univ., Singapore
fYear :
2005
fDate :
23-26 May 2005
Firstpage :
3833
Abstract :
Recent work in machine learning has focused on models, such as the support vector machine (SVM), that automatically control generalization and parameterization as part of the overall optimization process. In this paper we construct a wavelet transform to hybrid support vector machine and hidden Markov models (WSVM/HMM) speech recognition system to deal with arbitrary nonlinear functions. The proposed hybrid system has overcome the problem of acoustic modeling in state-of-the-art speech recognition systems that usually relies on HMM with Gaussian emission densities. HMM suffer from intrinsic limitations, mainly due to their arbitrary parametric assumption. Artificial neural networks appear to be a promising alternative, but they have historically failed as a general solution to the acoustic modeling problem. The proposed system has successfully exploited the benefits of the wavelet technique, HMM and SVM in a unified framework. In this hybrid system, the WSVM is trained to estimate the emission probabilities of the states of the HMM. Simulations on benchmark databases show that speech recognition systems built around the hybrid WSVM/HMM provide excellent word recognition ratio and have performance superior to many other systems.
Keywords :
generalisation (artificial intelligence); hidden Markov models; learning (artificial intelligence); nonlinear functions; speech recognition; support vector machines; wavelet transforms; SVM; WSVM/HMM; acoustic modeling; arbitrary nonlinear functions; emission probabilities; generalization; hidden Markov models; hybrid support vector machine; machine learning; parameterization; performance; speech recognition; speech recognition system; training; wavelet transform; word recognition ratio; Acoustic emission; Artificial neural networks; Automatic control; Databases; Hidden Markov models; Machine learning; Speech recognition; State estimation; Support vector machines; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN :
0-7803-8834-8
Type :
conf
DOI :
10.1109/ISCAS.2005.1465466
Filename :
1465466
Link To Document :
بازگشت