DocumentCode :
1611260
Title :
Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron
Author :
Ben Nasr, Mounir ; Talbi, Mohamed ; Cherif, Adnane
Author_Institution :
Dept. of Electron., Fac. of Sci. of Tunis, Tunis, Tunisia
fYear :
2012
Firstpage :
803
Lastpage :
808
Abstract :
In this paper, we have proposed a new technique of Arabic Speech Recognition (ASR) with monolocutor and a reduced vocabulary. This technique consists at first step in using our proper speech database containing Arabic speech words which are recorded by a mono-locutor. The second step consists in features extracting from those recorded words. The third step is to classify those extracted features. This extraction is performed by computing at first step, the Mel Frequency Cepstral Coefficients (MFCCs) from each recorded word, then the Bionic Wavelet Transform (BWT) is applied to the vector obtained from the concatenation of the computed MFCCs. The obtained bionic wavelet coefficients are then concatenated to construct one input of a Multi-Layer Perceptual (MLP) used for features classification. In the MLP learning and test phases, we have used eleven Arabic words and each of them is repeated twenty five times by the same locutor. A simulation program is performed to test the performance of the proposed technique and shows a classification rate equals to 99.39%. We have also introduced a module of denoising as a phase of preprocessing. In this denoising module, we have treated the case of white noise and we have used the Wiener filtering. In case of SNR=5dB, the obtained recognition rate is equals to 78.7% and in case of SNR=10dB, it is equals to 93.9%.
Keywords :
Wiener filters; multilayer perceptrons; natural language processing; signal classification; signal denoising; speech processing; vectors; wavelet transforms; white noise; ASR; Arabic speech recognition; BWT; MFCC; MLP; Mel frequency cepstral coefficients; Wiener filtering; bionic wavelet transform; denoising; features classification; multilayer perceptron; speech database; vector; white noise; Feature extraction; Mel frequency cepstral coefficient; Noise reduction; Speech; Speech recognition; Wavelet transforms; Bionic Wavelet Transforms (BWT); Feature Extraction; Mel-Frequency Cepstral Coefficients (MFCCs); Multi-Layer Perceptron (MLP); Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
Conference_Location :
Sousse
Print_ISBN :
978-1-4673-1657-6
Type :
conf
DOI :
10.1109/SETIT.2012.6482017
Filename :
6482017
Link To Document :
بازگشت