Title : 
Learning to maximize signal-to-noise ratio for reverberant speech segregation
         
        
            Author : 
Jin, Zhaozhang ; Wang, DeLiang
         
        
            Author_Institution : 
Dept. of Comput. Sci., Ohio State Univ., Columbus, OH
         
        
        
        
        
        
            Abstract : 
Monaural speech segregation in reverberant environments is a very difficult problem. We develop a supervised learning approach by proposing an objective function that directly relates to the computational goal of maximizing signal-to-noise ratio. The model trained using this new objective function yields significantly better results for time-frequency unit labeling. In our segregation system, a segmentation and grouping framework is utilized to form reliable segments under reverberant conditions and organize them into streams. Systematic evaluations show very promising results.
         
        
            Keywords : 
learning (artificial intelligence); speech processing; grouping framework; monaural speech segregation; objective function; reverberant speech segregation; segmentation framework; signal-to-noise ratio; supervised learning approach; time-frequency unit labeling; Filtering; Image analysis; Labeling; Power harmonic filters; Reverberation; Robustness; Signal to noise ratio; Speech; Supervised learning; Time frequency analysis; Computational auditory scene analysis; monaural speech segregation; objective function; room reverberation; supervised learning;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
         
        
            Conference_Location : 
Taipei
         
        
        
            Print_ISBN : 
978-1-4244-2353-8
         
        
            Electronic_ISBN : 
1520-6149
         
        
        
            DOI : 
10.1109/ICASSP.2009.4960677