DocumentCode
1611260
Title
Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron
Author
Ben Nasr, Mounir ; Talbi, Mohamed ; Cherif, Adnane
Author_Institution
Dept. of Electron., Fac. of Sci. of Tunis, Tunis, Tunisia
fYear
2012
Firstpage
803
Lastpage
808
Abstract
In this paper, we have proposed a new technique of Arabic Speech Recognition (ASR) with monolocutor and a reduced vocabulary. This technique consists at first step in using our proper speech database containing Arabic speech words which are recorded by a mono-locutor. The second step consists in features extracting from those recorded words. The third step is to classify those extracted features. This extraction is performed by computing at first step, the Mel Frequency Cepstral Coefficients (MFCCs) from each recorded word, then the Bionic Wavelet Transform (BWT) is applied to the vector obtained from the concatenation of the computed MFCCs. The obtained bionic wavelet coefficients are then concatenated to construct one input of a Multi-Layer Perceptual (MLP) used for features classification. In the MLP learning and test phases, we have used eleven Arabic words and each of them is repeated twenty five times by the same locutor. A simulation program is performed to test the performance of the proposed technique and shows a classification rate equals to 99.39%. We have also introduced a module of denoising as a phase of preprocessing. In this denoising module, we have treated the case of white noise and we have used the Wiener filtering. In case of SNR=5dB, the obtained recognition rate is equals to 78.7% and in case of SNR=10dB, it is equals to 93.9%.
Keywords
Wiener filters; multilayer perceptrons; natural language processing; signal classification; signal denoising; speech processing; vectors; wavelet transforms; white noise; ASR; Arabic speech recognition; BWT; MFCC; MLP; Mel frequency cepstral coefficients; Wiener filtering; bionic wavelet transform; denoising; features classification; multilayer perceptron; speech database; vector; white noise; Feature extraction; Mel frequency cepstral coefficient; Noise reduction; Speech; Speech recognition; Wavelet transforms; Bionic Wavelet Transforms (BWT); Feature Extraction; Mel-Frequency Cepstral Coefficients (MFCCs); Multi-Layer Perceptron (MLP); Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
Conference_Location
Sousse
Print_ISBN
978-1-4673-1657-6
Type
conf
DOI
10.1109/SETIT.2012.6482017
Filename
6482017
Link To Document