DocumentCode
699995
Title
Improving the detection efficiency of the VMR-WB VAD algorithm on music signals
Author
Malenovsky, Vladimir ; Jelinek, Milan
Author_Institution
Speech & audio Res. group, Univ. of Sherbrooke, Sherbrooke, QC, Canada
fYear
2008
fDate
25-29 Aug. 2008
Firstpage
1
Lastpage
5
Abstract
Speech codecs are usually equipped with voice activity detection (VAD) algorithm to enable efficient coding of inactive frames and the discontinuous transmission mode (DTX). High VAD efficiency for speech in noisy environments is often traded off against its robustness for music. This is also the case of the VMR-WB codec recently standardized by 3GPP2. Its VAD fails to detect portions of some critical music samples. In this contribution we propose a method to improve the performance of the VMR-WB VAD on music signals. The idea is to measure the stability of tones in the spectral domain by means of per-tone correlation analysis. By using this approach, the music detection accuracy is increased to ~99% and the problem of misclassification is significantly reduced. The proposed method has been implemented in the G.718 codec being currently standardized by the ITU-T.
Keywords
correlation methods; music; signal detection; speech codecs; speech coding; 3GPP2; DTX; G.718 codec; VAD algorithm; VMR-WB VAD algorithm; discontinuous transmission mode; inactive frames; music detection accuracy; music detection efficiency; music signals; noisy environments; per-tone correlation analysis; spectral domain; speech codecs; tone stability; voice activity detection algorithm; Accuracy; Multiple signal classification; Noise; Signal processing algorithms; Speech; Stability analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2008 16th European
Conference_Location
Lausanne
ISSN
2219-5491
Type
conf
Filename
7080527
Link To Document