DocumentCode :
1736900
Title :
Performance analysis of wavelet subband based voice activity detection in cocktail party environment
Author :
Pham, Tuan V. ; Stark, Michael ; Rank, Erhard
Author_Institution :
Electron. & Telecommun. Eng. Dept., Danang Univ. of Technol., Danang, Vietnam
fYear :
2010
Firstpage :
85
Lastpage :
88
Abstract :
In this paper, we analyze the performance of wavelet-based voice activity detection (VAD) algorithms with respect to the detection of target speech. In addition, the state-of-the-art VAD standardized for the G. 729 B, the ETSI AFE ES 202 050 are evaluated extensively. Experimental results on a self-built cocktail party corpus including different target-interference speech activity conditions are provided. Results show that: (i) the wavelet-based VAD algorithms are superior to other VADmethods in terms of classification measures; (ii) the robustness of the wavelet feature still holds in a completely mismatched environment.
Keywords :
interference (wave); speech synthesis; ETSI AFE ES 202 050; VAD; cocktail party environment; performance analysis; target-interference speech; voice activity detection; wavelet subband; neural network; percentile filter; voice activity detection; wavelet subband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Technologies for Communications (ATC), 2010 International Conference on
Conference_Location :
Ho Chi Minh City
Print_ISBN :
978-1-4244-8875-9
Type :
conf
DOI :
10.1109/ATC.2010.5672718
Filename :
5672718
Link To Document :
بازگشت