DocumentCode
621977
Title
Hybrid SVM/HMM model for the recognition of Arabic triphones-based continuous speech
Author
Zarrouk, Elyes ; Benayed, Yassine ; Gargouri, Faiez
Author_Institution
MIRACL, Multimedia Inf. Syst. & Adv. Comput. Lab., Univ. of Sfax, Sfax, Tunisia
fYear
2013
fDate
18-21 March 2013
Firstpage
1
Lastpage
7
Abstract
Even if the progress of Hidden Markov Models (HMM) is huge, those models lack a discriminatory ability especially on speech recognition. In order to ameliorate the results of recognition systems, we apply Support Vectors Machine (SVM) as an estimator of posterior probabilities since they are characterized by a high predictive power and discrimination. Moreover, they are based on a structural risk minimization (SRM) where the aim is to set up a classifier that minimizes a bound on the expected risk, rather than the empirical risk. In this paper, we describe the use of the hybrid model SVM/HMM for Arabic triphones-based continuous speech. Furthermore, our work incorporates the stage of preparing language models. It consists in a novel approach for automatic labeling with respect to syntax and grammar rules of the Arabic language. The best results are obtained with the proposed system SVM/HMM when we achieve 76.96% as the best recognition rate of a tested speaker. The speech recognizer was evaluated with ARABIC_DB corpus and performs at 11.42% WER as compared to 13.32% with triphones mixture-Gaussian HMM system.
Keywords
grammars; hidden Markov models; minimisation; natural language processing; pattern classification; probability; risk analysis; speech recognition; support vector machines; ARABIC_DB corpus; Arabic language; Arabic triphones-based continuous speech recognition; SRM; automatic labeling; classifier; expected risk bound minimization; grammar rules; hidden Markov models; hybrid SVM-HMM model; posterior probability estimator; predictive power; structural risk minimization; support vector machine; syntax; Acoustics; Hidden Markov models; Kernel; Labeling; Speech; Speech recognition; Support vector machines; Automatic speech recognition; Support Vector machine; automatic labeling; hybrid system; triphones-based continuous speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Signals & Devices (SSD), 2013 10th International Multi-Conference on
Conference_Location
Hammamet
Print_ISBN
978-1-4673-6459-1
Electronic_ISBN
978-1-4673-6458-4
Type
conf
DOI
10.1109/SSD.2013.6564036
Filename
6564036
Link To Document