Title :
Maximum Margin Clustering Based Statistical VAD With Multiple Observation Compound Feature
Author :
Wu, Ji ; Zhang, Xiao-Lei
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fDate :
5/1/2011 12:00:00 AM
Abstract :
In this letter, we propose a new robust feature and an unsupervised learning approach for statistical voice activity detection (VAD). Maximum margin clustering (MMC), as an unsupervised classifier, can improve the robustness of support vector machine (SVM) based VAD while requiring no data labeling for model training. In the MMC framework, the multiple observation compound feature (MO-CF) is proposed to improve accuracy. MO-CF is composed of two subfeatures-multiple observation signal-to-noise ratio (MO-SNR) and multiple observation maximum probability (MO-MP). The contributions of the two subfeatures are balanced by a factor which is chosen to yield the largest area under the ROC curve (AUC) of the performance. The proposed approach obtains improved performance over seven commonly used VAD techniques in the experiments covering various noisy scenarios with low SNRs.
Keywords :
pattern clustering; probability; speech processing; statistical analysis; support vector machines; unsupervised learning; MMC; ROC curve; VAD; maximum margin clustering; maximum probability; multiple observation compound feature; signal to noise ratio; statistical analysis; support vector machine; unsupervised learning; voice activity detection; Compounds; Feature extraction; Kernel; Labeling; Speech; Support vector machines; Training; Maximum margin clustering; multiple observation compound feature; support vector machine; unsupervised learning; voice activity detection;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2011.2119482