DocumentCode :
1459849
Title :
Maximum Margin Clustering Based Statistical VAD With Multiple Observation Compound Feature
Author :
Wu, Ji ; Zhang, Xiao-Lei
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Volume :
18
Issue :
5
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
283
Lastpage :
286
Abstract :
In this letter, we propose a new robust feature and an unsupervised learning approach for statistical voice activity detection (VAD). Maximum margin clustering (MMC), as an unsupervised classifier, can improve the robustness of support vector machine (SVM) based VAD while requiring no data labeling for model training. In the MMC framework, the multiple observation compound feature (MO-CF) is proposed to improve accuracy. MO-CF is composed of two subfeatures-multiple observation signal-to-noise ratio (MO-SNR) and multiple observation maximum probability (MO-MP). The contributions of the two subfeatures are balanced by a factor which is chosen to yield the largest area under the ROC curve (AUC) of the performance. The proposed approach obtains improved performance over seven commonly used VAD techniques in the experiments covering various noisy scenarios with low SNRs.
Keywords :
pattern clustering; probability; speech processing; statistical analysis; support vector machines; unsupervised learning; MMC; ROC curve; VAD; maximum margin clustering; maximum probability; multiple observation compound feature; signal to noise ratio; statistical analysis; support vector machine; unsupervised learning; voice activity detection; Compounds; Feature extraction; Kernel; Labeling; Speech; Support vector machines; Training; Maximum margin clustering; multiple observation compound feature; support vector machine; unsupervised learning; voice activity detection;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2011.2119482
Filename :
5720516
Link To Document :
بازگشت