• DocumentCode
    294683
  • Title

    Co-channel speaker separation

  • Author

    Morgan, David P. ; George, E.B. ; Lee, Leonard T. ; Kay, Stephen M.

  • Author_Institution
    Signal Process. Center of Technol., Lockheed Sanders Inc., Nashua, NH, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    828
  • Abstract
    This paper describes a system for the automatic separation of two-talker co-channel speech. This system is based on a frame-by-frame speaker separation algorithm that exploits a pitch estimate of the stronger talker derived from the co-channel signal. The concept underlying this approach is to recover the stronger talker´s speech by enhancing harmonic frequencies and formants given a multi-resolution pitch estimate. The weaker talker´s speech is obtained from the residual signal created when the harmonics and formants of the stronger talker are suppressed. A maximum likelihood speaker assignment algorithm is used to place the recovered frames from the target and interfering talkers in separate channels. The system has been tested at target-to-interferer ratios (TIRs) from -18 to 18 dB with human listening tests, and with machine-based tests employing a keyword spotting system on the Switchboard Corpus for target talkers at 6, 12, and 18 dB TIR
  • Keywords
    cochannel interference; harmonics; interference suppression; maximum likelihood estimation; speech enhancement; Switchboard Corpus; automatic separation; co-channel speaker separation; formants; frame-by-frame speaker separation algorithm; harmonic frequency enhancement; human listening tests; keyword spotting system; machine-based tests; maximum likelihood speaker assignment algorithm; multi-resolution pitch estimate; target-to-interferer ratios; two-talker co-channel speech; Frequency estimation; Humans; Maximum likelihood detection; Maximum likelihood estimation; Power harmonic filters; Signal processing; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479822
  • Filename
    479822