• DocumentCode
    2951560
  • Title

    Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition

  • Author

    Nehe, N.S. ; Holambe, R.S.

  • Author_Institution
    Dept. of Instrum. Eng., S.G.G.S. Inst. of Eng. & Technol., Nanded
  • fYear
    2008
  • fDate
    8-10 Dec. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper presents the use of auditory perception based admissible wavelet packet tree (WPT) for partitioning of speech frequencies into different bands based on the Mel scale or the Bark Scale. The proposed WPTs selected using root mean square error (RMSE) criterion mimic the Mel scale or the bark scale more accurately and hence the human auditory system. Performance of the features obtained from the proposed WPTs is compared with Mel frequency cepstral coefficients (MFCC). The algorithms are evaluated using NIST TI-46 isolated-word database using hidden Markov model (HMM) as a classifier. Experimental results show that the performance of proposed features is better than MFCC and other wavelet features for isolated word recognition (IWR).
  • Keywords
    feature extraction; hearing; mean square error methods; speech recognition; trees (mathematics); wavelet transforms; Bark scale; Mel frequency cepstral coefficient; feature obtained; hidden Markov model; human auditory perception; root mean square error; speech feature extraction; speech recognition; wavelet packet tree; Auditory system; Discrete wavelet transforms; Frequency conversion; Humans; Information systems; Instruments; Mel frequency cepstral coefficient; Region 10; Speech recognition; Wavelet packets; Human Auditory Perception; Isolated Word Recognition; Wavelet Packet Tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial and Information Systems, 2008. ICIIS 2008. IEEE Region 10 and the Third international Conference on
  • Conference_Location
    Kharagpur
  • Print_ISBN
    978-1-4244-2806-9
  • Electronic_ISBN
    978-1-4244-2806-9
  • Type

    conf

  • DOI
    10.1109/ICIINFS.2008.4798363
  • Filename
    4798363