مرکز منطقه ای اطلاع رساني علوم و فناوري - A Supervised Learning Approach to Monaural Segregation of Reverberant Speech

DocumentCode :

1196223

Title :

A Supervised Learning Approach to Monaural Segregation of Reverberant Speech

Author :

Jin, Zhaozhang ; Wang, DeLiang

Author_Institution :

Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH

Volume :

Issue :

fYear :

2009

fDate :

5/1/2009 12:00:00 AM

Firstpage :

625

Lastpage :

638

Abstract :

A major source of signal degradation in real environments is room reverberation. Monaural speech segregation in reverberant environments is a particularly challenging problem. Although inverse filtering has been proposed to partially restore the harmonicity of reverberant speech before segregation, this approach is sensitive to specific source/receiver and room configurations. This paper proposes a supervised learning approach to monaural segregation of reverberant voiced speech, which learns to map from a set of pitch-based auditory features to a grouping cue encoding the posterior probability of a time-frequency (T-F) unit being target dominant given observed features. We devise a novel objective function for the learning process, which directly relates to the goal of maximizing signal-to-noise ratio. The models trained using this objective function yield significantly better T-F unit labeling. A segmentation and grouping framework is utilized to form reliable segments under reverberant conditions and organize them into streams. Systematic evaluations show that our approach produces very promising results under various reverberant conditions and generalizes well to new utterances and new speakers.

Keywords :

learning (artificial intelligence); reverberation; speech processing; inverse filtering; monaural speech segregation; reverberant speech; signal degradation; supervised learning approach; Degradation; Encoding; Filtering; Labeling; Reverberation; Signal restoration; Signal to noise ratio; Speech; Supervised learning; Time frequency analysis; Computational auditory scene analysis (CASA); monaural segregation; room reverberation; speech separation; supervised learning;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.2010633

Filename :

4802176

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1196223