• DocumentCode
    302083
  • Title

    Feature extraction based on zero-crossings with peak amplitudes for robust speech recognition in noisy environments

  • Author

    Kim, Doh-Suk ; Jeong, Jae-Hoon ; Kim, Jae Weon ; Lee, Soo Young

  • Author_Institution
    Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Taejon, South Korea
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    61
  • Abstract
    The ensemble interval histogram (EIH) is an auditory model which can be used as a robust “front-end” for speech recognition systems. The utilization of multiple level-crossing detectors in the EIH provides frequency and intensity information, which may be useful for speech processing. Proper determination of the number of levels and the level values is very important for reliable performance of the system. An analytic relationship is developed for the variance and SNR of the level-crossing intervals as a function of the crossing level value, and a new feature extraction method based on zero-crossings with peak amplitudes is proposed for robust speech recognition in noisy environments. The proposed method not only can preserve intensity information, but also is robust to noise in estimating the frequency information without the need to determine the level values and the number of levels. Experimental results show the robustness of the proposed method
  • Keywords
    feature extraction; frequency estimation; hearing; noise; signal detection; speech processing; speech recognition; SNR; auditory model; ensemble interval histogram; experimental results; feature extraction; frequency information estimation; intensity information; level crossing intervals; multiple level crossing detectors; noisy environments; peak amplitudes; robust front-end; robust speech recognition; speech processing; speech recognition systems; system performance; variance; zero-crossings; Analysis of variance; Detectors; Feature extraction; Frequency estimation; Histograms; Noise level; Noise robustness; Speech processing; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.540290
  • Filename
    540290