• DocumentCode
    430669
  • Title

    Wavelet statistical model of speech for feature extraction and denoising

  • Author

    Yu, Shao ; Tong, Y.C. ; Chao, Wang

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    1
  • fYear
    2004
  • fDate
    6-9 Dec. 2004
  • Firstpage
    189
  • Abstract
    To improve the performance of automatic speech recognition (ASR) systems, a new method is introduced to extract features and reduce noise. Robust features are obtained from a wavelet packet transform (WPT)/local discriminant bases (LDB) stage, which can efficiently classify different speech utterances. The enhancement is performed by means of a feed-forward subsystem linked to the WPT/LDB stage, an ARMA/wavelet based discriminant function minimum (DFM) working as a blind adaptive filter (BAF), and an unvoiced speech enhancement stage. It also makes use of minimum description length (MDL) to reduce the number of wavelet coefficients for automatic recognition and re-synthesis. Simulation results showed that this new system is an efficient classifier and improves the robustness of ASR systems in various adverse noisy conditions.
  • Keywords
    adaptive filters; feature extraction; feedforward; signal denoising; speech enhancement; speech recognition; wavelet transforms; ARMA; automatic speech recognition systems; blind adaptive filter; discriminant function minimum; feature extraction; feed-forward subsystem; local discriminant bases; noise reduction; signal denoising; speech utterances; unvoiced speech enhancement; wavelet coefficients; wavelet packet transform; wavelet statistical model; Adaptive filters; Automatic speech recognition; Design for manufacture; Feature extraction; Feedforward systems; Noise reduction; Noise robustness; Speech enhancement; Wavelet packets; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2004. Proceedings. The 2004 IEEE Asia-Pacific Conference on
  • Print_ISBN
    0-7803-8660-4
  • Type

    conf

  • DOI
    10.1109/APCCAS.2004.1412724
  • Filename
    1412724