• DocumentCode
    1454200
  • Title

    Auditory processing of speech signals for robust speech recognition in real-world noisy environments

  • Author

    Kim, Doh-Suk ; Lee, Soo-Young ; Kil, Rhee M.

  • Author_Institution
    Lucent Technol., AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    7
  • Issue
    1
  • fYear
    1999
  • fDate
    1/1/1999 12:00:00 AM
  • Firstpage
    55
  • Lastpage
    69
  • Abstract
    This paper presents a new approach to an auditory model for robust speech recognition in noisy environments. The proposed model consists of cochlear bandpass filters and nonlinear operations in which frequency information of the signal is obtained by zero-crossing intervals. Intensity information is also incorporated by a peak detector and a compressive nonlinearity. The robustness of the zero-crossings in spectral estimation is verified by analyzing the variance of the level-crossing intervals as a function of the crossing level values. Compared with other auditory models, the proposed auditory model is computationally efficient, free from many unknown parameters, and able to serve as a robust front-end for speech recognition in noisy environments. Experimental results of speech recognition demonstrate the robustness of the proposed method in various types of noisy environments
  • Keywords
    band-pass filters; ear; feature extraction; filtering theory; hearing; noise; spectral analysis; speech processing; speech recognition; auditory model; auditory processing; cochlear bandpass filters; compressive nonlinearity; computationally efficient model; experimental results; feature extraction; frequency information; intensity information; level-crossing intervals variance; nonlinear operations; peak detector; real-world noisy environments; robust front-end; robust speech recognition; spectral estimation; speech signals; zero-crossing intervals; Analysis of variance; Band pass filters; Computational modeling; Detectors; Frequency; Robustness; Signal processing; Speech processing; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.736331
  • Filename
    736331