• DocumentCode
    1611260
  • Title

    Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron

  • Author

    Ben Nasr, Mounir ; Talbi, Mohamed ; Cherif, Adnane

  • Author_Institution
    Dept. of Electron., Fac. of Sci. of Tunis, Tunis, Tunisia
  • fYear
    2012
  • Firstpage
    803
  • Lastpage
    808
  • Abstract
    In this paper, we have proposed a new technique of Arabic Speech Recognition (ASR) with monolocutor and a reduced vocabulary. This technique consists at first step in using our proper speech database containing Arabic speech words which are recorded by a mono-locutor. The second step consists in features extracting from those recorded words. The third step is to classify those extracted features. This extraction is performed by computing at first step, the Mel Frequency Cepstral Coefficients (MFCCs) from each recorded word, then the Bionic Wavelet Transform (BWT) is applied to the vector obtained from the concatenation of the computed MFCCs. The obtained bionic wavelet coefficients are then concatenated to construct one input of a Multi-Layer Perceptual (MLP) used for features classification. In the MLP learning and test phases, we have used eleven Arabic words and each of them is repeated twenty five times by the same locutor. A simulation program is performed to test the performance of the proposed technique and shows a classification rate equals to 99.39%. We have also introduced a module of denoising as a phase of preprocessing. In this denoising module, we have treated the case of white noise and we have used the Wiener filtering. In case of SNR=5dB, the obtained recognition rate is equals to 78.7% and in case of SNR=10dB, it is equals to 93.9%.
  • Keywords
    Wiener filters; multilayer perceptrons; natural language processing; signal classification; signal denoising; speech processing; vectors; wavelet transforms; white noise; ASR; Arabic speech recognition; BWT; MFCC; MLP; Mel frequency cepstral coefficients; Wiener filtering; bionic wavelet transform; denoising; features classification; multilayer perceptron; speech database; vector; white noise; Feature extraction; Mel frequency cepstral coefficient; Noise reduction; Speech; Speech recognition; Wavelet transforms; Bionic Wavelet Transforms (BWT); Feature Extraction; Mel-Frequency Cepstral Coefficients (MFCCs); Multi-Layer Perceptron (MLP); Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
  • Conference_Location
    Sousse
  • Print_ISBN
    978-1-4673-1657-6
  • Type

    conf

  • DOI
    10.1109/SETIT.2012.6482017
  • Filename
    6482017