DocumentCode :
699995
Title :
Improving the detection efficiency of the VMR-WB VAD algorithm on music signals
Author :
Malenovsky, Vladimir ; Jelinek, Milan
Author_Institution :
Speech & audio Res. group, Univ. of Sherbrooke, Sherbrooke, QC, Canada
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
Speech codecs are usually equipped with voice activity detection (VAD) algorithm to enable efficient coding of inactive frames and the discontinuous transmission mode (DTX). High VAD efficiency for speech in noisy environments is often traded off against its robustness for music. This is also the case of the VMR-WB codec recently standardized by 3GPP2. Its VAD fails to detect portions of some critical music samples. In this contribution we propose a method to improve the performance of the VMR-WB VAD on music signals. The idea is to measure the stability of tones in the spectral domain by means of per-tone correlation analysis. By using this approach, the music detection accuracy is increased to ~99% and the problem of misclassification is significantly reduced. The proposed method has been implemented in the G.718 codec being currently standardized by the ITU-T.
Keywords :
correlation methods; music; signal detection; speech codecs; speech coding; 3GPP2; DTX; G.718 codec; VAD algorithm; VMR-WB VAD algorithm; discontinuous transmission mode; inactive frames; music detection accuracy; music detection efficiency; music signals; noisy environments; per-tone correlation analysis; spectral domain; speech codecs; tone stability; voice activity detection algorithm; Accuracy; Multiple signal classification; Noise; Signal processing algorithms; Speech; Stability analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080527
Link To Document :
بازگشت