• DocumentCode
    2402276
  • Title

    Speech enhancement in noisy environment using voice activity detection and wavelet thresholding

  • Author

    Borisagar, Komal R. ; Kamdar, Dipesh G. ; Sedani, Bhavin S. ; Kulkarni, G.R.

  • Author_Institution
    Dept. of Electron. & Commun., AITS, Rajkot, India
  • fYear
    2010
  • fDate
    28-29 Dec. 2010
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Voice activity detection (VAD) is an outstanding problem for speech transmission, enhancement and recognition. The variety and the varying nature of speech and background noise make it especially challenging. In the past years, many features emphasizing the differences between speech and noise have been proposed for their robustness. However an important problem in many areas of speech processing is the determination of presence of speech periods in a given signal. This task can be identified as a statistical hypothesis problem and its purpose is the determination to which category or class a given signal belongs. Also the classification task is often not as trivial as it appears since the increasing level of background noise leading to numerous detection errors. The selection of an adequate feature vector for signal detection and a robust decision rule is a challenging problem that affects the performance of VADs. Most algorithms are effective in numerous applications but often cause detection errors mainly due to the loss of discriminating power of the decision rule at lower SNRs. In this paper, it has been tried to extract the characteristics of noise by the VAD algorithm which can be used to smooth out the signal in silence part from the noisy environment. For further noise reduction signal then filtered in the wavelet domain using thresholding.
  • Keywords
    acoustic signal detection; signal denoising; smoothing methods; speech enhancement; speech recognition; statistical analysis; voice communication; wavelet transforms; background noise; classification task; feature vector; noisy environment; signal detection errors; signal smoothing; speech enhancement; speech periods; speech processing; speech recognition; speech transmission; statistical hypothesis problem; voice activity detection; wavelet domain; wavelet thresholding; Noise; Noise measurement; Noise reduction; Speech; Speech enhancement; Wavelet transforms; Discrete wavelet transform; Multiresolution approximation; Voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Computing Research (ICCIC), 2010 IEEE International Conference on
  • Conference_Location
    Coimbatore
  • Print_ISBN
    978-1-4244-5965-0
  • Electronic_ISBN
    978-1-4244-5967-4
  • Type

    conf

  • DOI
    10.1109/ICCIC.2010.5705834
  • Filename
    5705834