Auditory front-ends for noise-robust automatic speech recognition

Author

Yeh, Ja-Zang ; Chen, Chia-Ping

Author_Institution

Dept. of Comput. Sci. & Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan

fYear

2010

fDate

Nov. 29 2010-Dec. 3 2010

Firstpage

205

Lastpage

208

Abstract

In this paper we investigate a noise-robust feature extraction method, which is based on the auditory masking effect, for automatic speech recognition systems. We physically model the basilar membrane as a cascade system of simple harmonic oscillators, and mathematically analyze the motion of the basilar membrane due to speech signals. Based on the analysis, we can identify a correlational factor for the coupled motion of the oscillators, which can be used to partially explain the masking effect. Accordingly, we insert an auditory module in the speech feature extraction process. The proposed methodology is evaluated on the Aurora 2.0 noisy-digit speech database, and it achieves significant improvements.

Keywords

cascade systems; feature extraction; speech intelligibility; speech recognition; auditory front end; auditory masking effect; basilar membrane; cascade system; correlational factor; coupled motion; noise robust automatic speech recognition; noise robust feature extraction method; noisy digit speech database; simple harmonic oscillator; speech signal; Biomembranes; Feature extraction; Harmonic analysis; Noise measurement; Oscillators; Speech; Speech recognition; auditory model; frequency masking; noise robustness; speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on

Conference_Location

Tainan

Print_ISBN

978-1-4244-6244-5

Type

conf

DOI

10.1109/ISCSLP.2010.5684910

Filename

5684910