DocumentCode
2951560
Title
Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition
Author
Nehe, N.S. ; Holambe, R.S.
Author_Institution
Dept. of Instrum. Eng., S.G.G.S. Inst. of Eng. & Technol., Nanded
fYear
2008
fDate
8-10 Dec. 2008
Firstpage
1
Lastpage
5
Abstract
This paper presents the use of auditory perception based admissible wavelet packet tree (WPT) for partitioning of speech frequencies into different bands based on the Mel scale or the Bark Scale. The proposed WPTs selected using root mean square error (RMSE) criterion mimic the Mel scale or the bark scale more accurately and hence the human auditory system. Performance of the features obtained from the proposed WPTs is compared with Mel frequency cepstral coefficients (MFCC). The algorithms are evaluated using NIST TI-46 isolated-word database using hidden Markov model (HMM) as a classifier. Experimental results show that the performance of proposed features is better than MFCC and other wavelet features for isolated word recognition (IWR).
Keywords
feature extraction; hearing; mean square error methods; speech recognition; trees (mathematics); wavelet transforms; Bark scale; Mel frequency cepstral coefficient; feature obtained; hidden Markov model; human auditory perception; root mean square error; speech feature extraction; speech recognition; wavelet packet tree; Auditory system; Discrete wavelet transforms; Frequency conversion; Humans; Information systems; Instruments; Mel frequency cepstral coefficient; Region 10; Speech recognition; Wavelet packets; Human Auditory Perception; Isolated Word Recognition; Wavelet Packet Tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial and Information Systems, 2008. ICIIS 2008. IEEE Region 10 and the Third international Conference on
Conference_Location
Kharagpur
Print_ISBN
978-1-4244-2806-9
Electronic_ISBN
978-1-4244-2806-9
Type
conf
DOI
10.1109/ICIINFS.2008.4798363
Filename
4798363
Link To Document