مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

542184

Title :

Co-channel speaker segment separation

Author :

Smolenski, Brett Y ; Yantorno, Robert E. ; Benincasa, Daniel S. ; Wenndt, Stanley J.

Author_Institution :

Temple University/ECE Dept. 12th & Norris Streets, Philadelphia, Pa 19122-6077, USA

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

A novel approach to co-channel speaker separation is presented here. The technique uses the statistical properties of combinations of high Target-to-Interferer Ratio (TIR) speech segments, which were extracted from a 0 dB overall TIR co-channel utterance. The problem is broken down into making three simpler decisions. First, closed-set speaker identification technology is used on combinations of high TIR speech segments to determine which speakers are generating the co-channel speech. Next, the proportion of segments belonging to each speaker is estimated using a bimodal model. Lastly, a maximum likelihood decision is made as to which two combinations of segments best represent the two speakers. Using this approach at least one of the speakers could readily be identified when the speaker contributed a segment that was 160 ms or more in length. Once the speakers were determined, greater than 90% reliable speaker separation was obtained.

Keywords :

Artificial intelligence; Atmospheric modeling; Databases; Force; Laboratories; Speech; Speech processing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743670

Filename :

5743670

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542184