Title : 
Voice Activity Detection Via Noise Reducing Using Non-Negative Sparse Coding
         
        
            Author : 
Teng, Peng ; Jia, Yunde
         
        
            Author_Institution : 
Beijing Lab of Intelligent Information Technology, and the School of Computer Science, Beijing Institute of Technology, Beijing, China
         
        
        
        
        
        
        
        
            Abstract : 
This letter presents a voice activity detection (VAD) approach using non-negative sparse coding to improve the detection performance in low signal-to-noise ratio (SNR) conditions. The basic idea is to use features extracted from a noise-reduced representation of original audio signals. We decompose the magnitude spectrum of an audio signal on a speech dictionary learned from clean speech and a noise dictionary learned from noise samples. Only coefficients corresponding to the speech dictionary are considered and used as the noise-reduced representation of the signal for feature extraction. A conditional random field (CRF) is used to model the correlation between feature sequences and voice activity labels along audio signals. Then, we assign the voice activity labels for a given audio by decoding the CRF. Experimental results demonstrate that our VAD approach has a good performance in low SNR conditions.
         
        
            Keywords : 
Dictionaries; Encoding; Feature extraction; Signal to noise ratio; Speech; Vectors; Conditional random fields; noise reducing; non-negative sparse coding; voice activity detection;
         
        
        
            Journal_Title : 
Signal Processing Letters, IEEE
         
        
        
        
        
            DOI : 
10.1109/LSP.2013.2252615