• DocumentCode
    2018377
  • Title

    Auditory front-ends for noise-robust automatic speech recognition

  • Author

    Yeh, Ja-Zang ; Chen, Chia-Ping

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    205
  • Lastpage
    208
  • Abstract
    In this paper we investigate a noise-robust feature extraction method, which is based on the auditory masking effect, for automatic speech recognition systems. We physically model the basilar membrane as a cascade system of simple harmonic oscillators, and mathematically analyze the motion of the basilar membrane due to speech signals. Based on the analysis, we can identify a correlational factor for the coupled motion of the oscillators, which can be used to partially explain the masking effect. Accordingly, we insert an auditory module in the speech feature extraction process. The proposed methodology is evaluated on the Aurora 2.0 noisy-digit speech database, and it achieves significant improvements.
  • Keywords
    cascade systems; feature extraction; speech intelligibility; speech recognition; auditory front end; auditory masking effect; basilar membrane; cascade system; correlational factor; coupled motion; noise robust automatic speech recognition; noise robust feature extraction method; noisy digit speech database; simple harmonic oscillator; speech signal; Biomembranes; Feature extraction; Harmonic analysis; Noise measurement; Oscillators; Speech; Speech recognition; auditory model; frequency masking; noise robustness; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684910
  • Filename
    5684910