DocumentCode :
3396955
Title :
Comparison between wavelet packet transform, Bark Wavelet & MFCC for robust speech recognition tasks
Author :
Tohidypour, Hamid Reza ; Seyyedsalehi, Seyyed Ali ; Behbood, Hossein
Author_Institution :
Dept. of Biomed. Eng., Amirkabir Univ., Tehran, Iran
Volume :
2
fYear :
2010
fDate :
30-31 May 2010
Firstpage :
329
Lastpage :
332
Abstract :
Although Wavelet Transformation has multi resolution properties, it is not optimized for speech recognition tasks. There are two major perspectives, the first approach is based on selection of similar frequencies in perceptual auditory scale using wavelet packet and the second one involves frequency aspects of continuous wavelet leading to Bark Wavelet. This paper shows that because ordinary wavelet packet transform is time variant, it´s filter bank has high overlapping and exact auditory band width in Bark scale cannot be achieved, this transform works weaker than perceptual representations for speech recognition, specially under noisy conditions. Mel Frequency Cepstral Coefficient is one of the famous methods using for speech recognition and is optimized for speech recognition. This paper shows that because time-frequency localization capability of bark wavelet transform together with its multi-resolution property makes it more suitable than Discrete Cosine transform, Bark wavelet works better than MFCC and wavelet packet in noisy conditions.
Keywords :
discrete cosine transforms; discrete wavelet transforms; speech recognition; Bark wavelet; Mel frequency cepstral coefficients; discrete cosine transform; speech recognition; wavelet packet transform; Continuous wavelet transforms; Discrete wavelet transforms; Filter bank; Mel frequency cepstral coefficient; Optimization methods; Robustness; Speech recognition; Time frequency analysis; Wavelet packets; Wavelet transforms; Bark wavelet; FARSDAT Database; Mel Frequency Cepstral Coefficient (MFCC); Robust Speech Recognition; Time Delay Neural Network (TDNN); Wavelet Packet Transform;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Mechatronics and Automation (ICIMA), 2010 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-7653-4
Type :
conf
DOI :
10.1109/ICINDMA.2010.5538304
Filename :
5538304
Link To Document :
بازگشت