Title :
Multi-band based recognition of spoken Arabic numerals using wavelet transform
Author :
Alkhaldi, Weaam ; Fakhr, Waleed ; Hamdy, Nadder
Author_Institution :
Arab Acad. for Sci. & Technol., Alexandria, Egypt
Abstract :
Automatic speech recognition (ASR) using multi-band decomposition provides high recognition rates especially in noisy environments. The discrete wavelet transform (DWT) is known to be an efficient tool for decomposing signals into frequency sub-bands. The concept of feature recombination (FC) as applied to the recognition of spoken Arabic numerals is suggested. Utterances are decomposed using DWT before cepstral coefficients of the resulting sub-bands are calculated. The obtained coefficients are concatenated to form a single feature vector that is used as an input to the speech classifier, e.g. a hidden Markov model (HMM), to compute the likelihood. Simulation results have demonstrated that the achieved correct recognition rates using the suggested method are comparable with the full-band ASR (conventional) system.
Keywords :
cepstral analysis; discrete wavelet transforms; feature extraction; hidden Markov models; natural languages; speech recognition; DWT; HMM; automatic speech recognition; cepstral coefficients; discrete wavelet transform; feature recombination; feature vector; frequency sub-bands; full-band ASR system; hidden Markov model; multi-band based recognition; noisy environments; recognition rates; signal decomposition; simulation results; speech classifier; spoken Arabic numerals; Automatic speech recognition; Cepstral analysis; Continuous wavelet transforms; Discrete wavelet transforms; Frequency; Hidden Markov models; Speech analysis; Speech recognition; Wavelet analysis; Wavelet transforms;
Conference_Titel :
Radio Science Conference, 2002. (NRSC 2002). Proceedings of the Nineteenth National
Print_ISBN :
977-5031-72-9
DOI :
10.1109/NRSC.2002.1022626