Title : 
Gaussian process regression for voice activity detection and speech enhancement
         
        
            Author : 
Park, Sunho ; Choi, Seungjin
         
        
            Author_Institution : 
Dept. of Comput. Sci., Pohang Univ. of Sci. & Technol., Pohang
         
        
        
        
        
        
            Abstract : 
Gaussian process (GP) model is a flexible nonparametric Bayesian method that is widely used in regression and classification. In this paper we present a probabilistic method where we solve voice activity detection (VAD) and speech enhancement in a single framework of GP regression, modeling clean speech by a GP smoother. Optimized hyperparameters in GP models lead us to a novel VAD method since learned length-scale parameters in covariance functions are much different between voiced and unvoiced frames. Clean speech is estimated by posterior means in GP models. Numerical experiments confirm the validity of our method.
         
        
            Keywords : 
Bayes methods; Gaussian processes; nonparametric statistics; probability; regression analysis; signal detection; speech enhancement; Gaussian process regression; covariance functions; nonparametric Bayesian method; optimized hyperparameters; probabilistic method; signal classification; speech enhancement; voice activity detection; Bayesian methods; Gaussian noise; Gaussian processes; Kernel; Random processes; Signal processing; Speech enhancement; Speech processing; Wiener filter; Working environment noise;
         
        
        
        
            Conference_Titel : 
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
         
        
            Conference_Location : 
Hong Kong
         
        
        
            Print_ISBN : 
978-1-4244-1820-6
         
        
            Electronic_ISBN : 
1098-7576
         
        
        
            DOI : 
10.1109/IJCNN.2008.4634203