Hybrid SVM/HMM model for the recognition of Arabic triphones-based continuous speech

Author

Zarrouk, Elyes ; Benayed, Yassine ; Gargouri, Faiez

Author_Institution

MIRACL, Multimedia Inf. Syst. & Adv. Comput. Lab., Univ. of Sfax, Sfax, Tunisia

fYear

2013

fDate

18-21 March 2013

Firstpage

1

Lastpage

7

Abstract

Even if the progress of Hidden Markov Models (HMM) is huge, those models lack a discriminatory ability especially on speech recognition. In order to ameliorate the results of recognition systems, we apply Support Vectors Machine (SVM) as an estimator of posterior probabilities since they are characterized by a high predictive power and discrimination. Moreover, they are based on a structural risk minimization (SRM) where the aim is to set up a classifier that minimizes a bound on the expected risk, rather than the empirical risk. In this paper, we describe the use of the hybrid model SVM/HMM for Arabic triphones-based continuous speech. Furthermore, our work incorporates the stage of preparing language models. It consists in a novel approach for automatic labeling with respect to syntax and grammar rules of the Arabic language. The best results are obtained with the proposed system SVM/HMM when we achieve 76.96% as the best recognition rate of a tested speaker. The speech recognizer was evaluated with ARABIC_DB corpus and performs at 11.42% WER as compared to 13.32% with triphones mixture-Gaussian HMM system.

Keywords

grammars; hidden Markov models; minimisation; natural language processing; pattern classification; probability; risk analysis; speech recognition; support vector machines; ARABIC_DB corpus; Arabic language; Arabic triphones-based continuous speech recognition; SRM; automatic labeling; classifier; expected risk bound minimization; grammar rules; hidden Markov models; hybrid SVM-HMM model; posterior probability estimator; predictive power; structural risk minimization; support vector machine; syntax; Acoustics; Hidden Markov models; Kernel; Labeling; Speech; Speech recognition; Support vector machines; Automatic speech recognition; Support Vector machine; automatic labeling; hybrid system; triphones-based continuous speech;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Signals & Devices (SSD), 2013 10th International Multi-Conference on

Conference_Location

Hammamet

Print_ISBN

978-1-4673-6459-1

Electronic_ISBN

978-1-4673-6458-4

Type

conf

DOI

10.1109/SSD.2013.6564036

Filename

6564036