Title :
Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition
Author :
Nehe, N.S. ; Holambe, R.S.
Author_Institution :
Dept. of Instrum. Eng., S.G.G.S. Inst. of Eng. & Technol., Nanded
Abstract :
This paper presents the use of auditory perception based admissible wavelet packet tree (WPT) for partitioning of speech frequencies into different bands based on the Mel scale or the Bark Scale. The proposed WPTs selected using root mean square error (RMSE) criterion mimic the Mel scale or the bark scale more accurately and hence the human auditory system. Performance of the features obtained from the proposed WPTs is compared with Mel frequency cepstral coefficients (MFCC). The algorithms are evaluated using NIST TI-46 isolated-word database using hidden Markov model (HMM) as a classifier. Experimental results show that the performance of proposed features is better than MFCC and other wavelet features for isolated word recognition (IWR).
Keywords :
feature extraction; hearing; mean square error methods; speech recognition; trees (mathematics); wavelet transforms; Bark scale; Mel frequency cepstral coefficient; feature obtained; hidden Markov model; human auditory perception; root mean square error; speech feature extraction; speech recognition; wavelet packet tree; Auditory system; Discrete wavelet transforms; Frequency conversion; Humans; Information systems; Instruments; Mel frequency cepstral coefficient; Region 10; Speech recognition; Wavelet packets; Human Auditory Perception; Isolated Word Recognition; Wavelet Packet Tree;
Conference_Titel :
Industrial and Information Systems, 2008. ICIIS 2008. IEEE Region 10 and the Third international Conference on
Conference_Location :
Kharagpur
Print_ISBN :
978-1-4244-2806-9
Electronic_ISBN :
978-1-4244-2806-9
DOI :
10.1109/ICIINFS.2008.4798363