DocumentCode :
2402276
Title :
Speech enhancement in noisy environment using voice activity detection and wavelet thresholding
Author :
Borisagar, Komal R. ; Kamdar, Dipesh G. ; Sedani, Bhavin S. ; Kulkarni, G.R.
Author_Institution :
Dept. of Electron. & Commun., AITS, Rajkot, India
fYear :
2010
fDate :
28-29 Dec. 2010
Firstpage :
1
Lastpage :
5
Abstract :
Voice activity detection (VAD) is an outstanding problem for speech transmission, enhancement and recognition. The variety and the varying nature of speech and background noise make it especially challenging. In the past years, many features emphasizing the differences between speech and noise have been proposed for their robustness. However an important problem in many areas of speech processing is the determination of presence of speech periods in a given signal. This task can be identified as a statistical hypothesis problem and its purpose is the determination to which category or class a given signal belongs. Also the classification task is often not as trivial as it appears since the increasing level of background noise leading to numerous detection errors. The selection of an adequate feature vector for signal detection and a robust decision rule is a challenging problem that affects the performance of VADs. Most algorithms are effective in numerous applications but often cause detection errors mainly due to the loss of discriminating power of the decision rule at lower SNRs. In this paper, it has been tried to extract the characteristics of noise by the VAD algorithm which can be used to smooth out the signal in silence part from the noisy environment. For further noise reduction signal then filtered in the wavelet domain using thresholding.
Keywords :
acoustic signal detection; signal denoising; smoothing methods; speech enhancement; speech recognition; statistical analysis; voice communication; wavelet transforms; background noise; classification task; feature vector; noisy environment; signal detection errors; signal smoothing; speech enhancement; speech periods; speech processing; speech recognition; speech transmission; statistical hypothesis problem; voice activity detection; wavelet domain; wavelet thresholding; Noise; Noise measurement; Noise reduction; Speech; Speech enhancement; Wavelet transforms; Discrete wavelet transform; Multiresolution approximation; Voice activity detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Computing Research (ICCIC), 2010 IEEE International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4244-5965-0
Electronic_ISBN :
978-1-4244-5967-4
Type :
conf
DOI :
10.1109/ICCIC.2010.5705834
Filename :
5705834
Link To Document :
بازگشت