DocumentCode
2018377
Title
Auditory front-ends for noise-robust automatic speech recognition
Author
Yeh, Ja-Zang ; Chen, Chia-Ping
Author_Institution
Dept. of Comput. Sci. & Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
205
Lastpage
208
Abstract
In this paper we investigate a noise-robust feature extraction method, which is based on the auditory masking effect, for automatic speech recognition systems. We physically model the basilar membrane as a cascade system of simple harmonic oscillators, and mathematically analyze the motion of the basilar membrane due to speech signals. Based on the analysis, we can identify a correlational factor for the coupled motion of the oscillators, which can be used to partially explain the masking effect. Accordingly, we insert an auditory module in the speech feature extraction process. The proposed methodology is evaluated on the Aurora 2.0 noisy-digit speech database, and it achieves significant improvements.
Keywords
cascade systems; feature extraction; speech intelligibility; speech recognition; auditory front end; auditory masking effect; basilar membrane; cascade system; correlational factor; coupled motion; noise robust automatic speech recognition; noise robust feature extraction method; noisy digit speech database; simple harmonic oscillator; speech signal; Biomembranes; Feature extraction; Harmonic analysis; Noise measurement; Oscillators; Speech; Speech recognition; auditory model; frequency masking; noise robustness; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684910
Filename
5684910
Link To Document