• DocumentCode
    2574802
  • Title

    Source enumeration of speech mixtures using pitch harmonics

  • Author

    Gilbert, Keith D. ; Payton, Karen L.

  • Author_Institution
    Electr. & Comput. Eng. Dept., Univ. of Massachusetts Dartmouth, Dartmouth, MA, USA
  • fYear
    2009
  • fDate
    18-21 Oct. 2009
  • Firstpage
    89
  • Lastpage
    92
  • Abstract
    This paper proposes a method to simultaneously estimate the number, pitches, and relative locations of individual speech sources within instantaneous and non-instantaneous linear mixtures containing additive white Gaussian noise. The algorithm makes no assumptions about the number of sources or the number of sensors, and is therefore applicable to over-, under-, and precisely-determined scenarios. The method is hypothesis-based and employs a power-spectrum-based FIR filter derived from probability distributions of speech pitch harmonics. This harmonic windowing function (HWF) dramatically improves time-difference of arrival (TDOA) estimates over standard cross-correlation for low SNR. The pitch estimation component of the algorithm implicitly performs voiced-region detection and does not require prior knowledge about voicing. Cumulative pitch and TDOA estimates from the HWF form the basis for robust source enumeration across a wide range of SNR.
  • Keywords
    AWGN; FIR filters; correlation methods; harmonics; signal detection; speech processing; statistical distributions; time-of-arrival estimation; SNR; additive white Gaussian noise; cross-correlation; noninstantaneous linear mixture; pitch estimation component; power-spectrum-based FIR filter; probability distribution; speech mixture source enumeration; speech pitch harmonic windowing function; time-difference-of-arrival estimation; voiced-region detection; Acoustical engineering; Application software; Conferences; Frequency estimation; Histograms; Matrix decomposition; Power harmonic filters; Resonance; Speech; USA Councils; Source enumeration; linear mixtures; multi-pitch extraction; pitch harmonics; real-time;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
  • Conference_Location
    New Paltz, NY
  • ISSN
    1931-1168
  • Print_ISBN
    978-1-4244-3678-1
  • Electronic_ISBN
    1931-1168
  • Type

    conf

  • DOI
    10.1109/ASPAA.2009.5346491
  • Filename
    5346491