Co-channel speaker separation

Author

Morgan, David P. ; George, E.B. ; Lee, Leonard T. ; Kay, Stephen M.

Author_Institution

Signal Process. Center of Technol., Lockheed Sanders Inc., Nashua, NH, USA

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

828

Abstract

This paper describes a system for the automatic separation of two-talker co-channel speech. This system is based on a frame-by-frame speaker separation algorithm that exploits a pitch estimate of the stronger talker derived from the co-channel signal. The concept underlying this approach is to recover the stronger talker´s speech by enhancing harmonic frequencies and formants given a multi-resolution pitch estimate. The weaker talker´s speech is obtained from the residual signal created when the harmonics and formants of the stronger talker are suppressed. A maximum likelihood speaker assignment algorithm is used to place the recovered frames from the target and interfering talkers in separate channels. The system has been tested at target-to-interferer ratios (TIRs) from -18 to 18 dB with human listening tests, and with machine-based tests employing a keyword spotting system on the Switchboard Corpus for target talkers at 6, 12, and 18 dB TIR

Keywords

cochannel interference; harmonics; interference suppression; maximum likelihood estimation; speech enhancement; Switchboard Corpus; automatic separation; co-channel speaker separation; formants; frame-by-frame speaker separation algorithm; harmonic frequency enhancement; human listening tests; keyword spotting system; machine-based tests; maximum likelihood speaker assignment algorithm; multi-resolution pitch estimate; target-to-interferer ratios; two-talker co-channel speech; Frequency estimation; Humans; Maximum likelihood detection; Maximum likelihood estimation; Power harmonic filters; Signal processing; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing; System testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479822

Filename

479822