مرکز منطقه ای اطلاع رساني علوم و فناوري - An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures

DocumentCode :

582927

Title :

An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures

Author :

Guo, Tiying ; Lin, Qiuhua ; Gong, Xiaofeng

Author_Institution :

Sch. of Inf. & Commun. Eng., Dalian Univ. of Technol., Dalian, China

fYear :

2012

fDate :

15-17 July 2012

Firstpage :

694

Lastpage :

698

Abstract :

Speech separation has been studied for decades, to which one challenge is the underdetermined problem, where there are more sources than microphones. To solve this problem, Pedersen et al. proposed recently an effective algorithm called BLUES (BLind Underdetermined Extraction of Sources) by combining ICA and time-frequency masking, and it works well on instantaneous/convolutive mixtures of both speech and music. One key ingredient to BLUES is the stopping criterion of the separation process, where the condition number of the outputs is compared with a fixed threshold in the original version. However, as audio recordings are always varying in speech sources and their number, using a fixed threshold would not fit in with these changes, and then deteriorate the overall performance. As such, we propose a threshold update strategy to improve BLUES by adapting the threshold with an increasing rate to find the most suitable condition number. A new criterion based on detection of the number of the sources is then presented to stop the algorithm. The experiments are carried out by using the synthetic and real recorded underdetermined mixtures. The results show that our approach obtains improved performance compared to the original BLUES when the number of the speeches included in the underdetermined mixtures is increased.

Keywords :

audio recording; blind source separation; convolution; independent component analysis; music; performance evaluation; speech processing; time-frequency analysis; BLUES; ICA; adaptive condition number threshold; audio recordings; blind underdetermined extraction of sources; convolutive mixtures; instantaneous mixtures; music; performance improvement; separation process stopping criterion; source detection; speech sources; threshold update strategy; time-frequency masking; underdetermined speech mixture separation; Blind source separation; Independent component analysis; Microphones; Speech; Speech processing; Time frequency analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Control and Information Processing (ICICIP), 2012 Third International Conference on

Conference_Location :

Dalian

Print_ISBN :

978-1-4577-2144-1

Type :

conf

DOI :

10.1109/ICICIP.2012.6391505

Filename :

6391505

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=582927