Title :
Improving the detection efficiency of the VMR-WB VAD algorithm on music signals
Author :
Malenovsky, Vladimir ; Jelinek, Milan
Author_Institution :
Speech & audio Res. group, Univ. of Sherbrooke, Sherbrooke, QC, Canada
Abstract :
Speech codecs are usually equipped with voice activity detection (VAD) algorithm to enable efficient coding of inactive frames and the discontinuous transmission mode (DTX). High VAD efficiency for speech in noisy environments is often traded off against its robustness for music. This is also the case of the VMR-WB codec recently standardized by 3GPP2. Its VAD fails to detect portions of some critical music samples. In this contribution we propose a method to improve the performance of the VMR-WB VAD on music signals. The idea is to measure the stability of tones in the spectral domain by means of per-tone correlation analysis. By using this approach, the music detection accuracy is increased to ~99% and the problem of misclassification is significantly reduced. The proposed method has been implemented in the G.718 codec being currently standardized by the ITU-T.
Keywords :
correlation methods; music; signal detection; speech codecs; speech coding; 3GPP2; DTX; G.718 codec; VAD algorithm; VMR-WB VAD algorithm; discontinuous transmission mode; inactive frames; music detection accuracy; music detection efficiency; music signals; noisy environments; per-tone correlation analysis; spectral domain; speech codecs; tone stability; voice activity detection algorithm; Accuracy; Multiple signal classification; Noise; Signal processing algorithms; Speech; Stability analysis;
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne