Title :
New low rate wavelet models for the recognition of single spoken digits
Author :
Karam, J.R. ; Phillips, W.J. ; Robertson, W.
Author_Institution :
Dept. of Eng. Math., Dalhousie Univ., Halifax, NS, Canada
Abstract :
This paper describes three models acquired by applying various wavelet analysis techniques to subwords for the purpose of speaker independent single digit recognition. We emphasize the parameterization of the subwords according to a Mel scale in the cases of the sampled continuous wavelet transform (SCWT) and the wavelet packet decomposition (WPD). When using the discrete wavelet transform (DWT), a logarithmic segmentation is obtained and with it comes a very low parameter representation with a reduction of 3:1 when compared with the other two introduced models and with the Mel scale model. The DWT model has advantage over the other two due to its simplicity and fast implementation. Previous work by Phillips, Tosuner and Robertson (1995), based on preprocessing using traditional Fourier transform (FT) followed by a radial basis functions artificial neural network (RBF-ANN) yielded recognition in 90% range. Our results show that these new models outperformed the Mel scale model
Keywords :
discrete wavelet transforms; speech recognition; BF-ANN; DWT; Fourier transform; Mel scale model; discrete wavelet transform; feature vectors; logarithmic segmentation; low rate wavelet models; parameter representation; preprocessing; radial basis functions artificial neural network; sampled continuous wavelet transform; single spoken digits recognition; speaker independent single digit recognition; subwords; wavelet analysis techniques; wavelet packet decomposition; Artificial neural networks; Continuous wavelet transforms; Discrete wavelet transforms; Fourier transforms; Frequency; Mathematics; Sampling methods; Wavelet analysis; Wavelet packets; Wavelet transforms;
Conference_Titel :
Electrical and Computer Engineering, 2000 Canadian Conference on
Conference_Location :
Halifax, NS
Print_ISBN :
0-7803-5957-7
DOI :
10.1109/CCECE.2000.849724