Title :
Wavelet statistical model of speech for feature extraction and denoising
Author :
Yu, Shao ; Tong, Y.C. ; Chao, Wang
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
To improve the performance of automatic speech recognition (ASR) systems, a new method is introduced to extract features and reduce noise. Robust features are obtained from a wavelet packet transform (WPT)/local discriminant bases (LDB) stage, which can efficiently classify different speech utterances. The enhancement is performed by means of a feed-forward subsystem linked to the WPT/LDB stage, an ARMA/wavelet based discriminant function minimum (DFM) working as a blind adaptive filter (BAF), and an unvoiced speech enhancement stage. It also makes use of minimum description length (MDL) to reduce the number of wavelet coefficients for automatic recognition and re-synthesis. Simulation results showed that this new system is an efficient classifier and improves the robustness of ASR systems in various adverse noisy conditions.
Keywords :
adaptive filters; feature extraction; feedforward; signal denoising; speech enhancement; speech recognition; wavelet transforms; ARMA; automatic speech recognition systems; blind adaptive filter; discriminant function minimum; feature extraction; feed-forward subsystem; local discriminant bases; noise reduction; signal denoising; speech utterances; unvoiced speech enhancement; wavelet coefficients; wavelet packet transform; wavelet statistical model; Adaptive filters; Automatic speech recognition; Design for manufacture; Feature extraction; Feedforward systems; Noise reduction; Noise robustness; Speech enhancement; Wavelet packets; Wavelet transforms;
Conference_Titel :
Circuits and Systems, 2004. Proceedings. The 2004 IEEE Asia-Pacific Conference on
Print_ISBN :
0-7803-8660-4
DOI :
10.1109/APCCAS.2004.1412724