DocumentCode
2574802
Title
Source enumeration of speech mixtures using pitch harmonics
Author
Gilbert, Keith D. ; Payton, Karen L.
Author_Institution
Electr. & Comput. Eng. Dept., Univ. of Massachusetts Dartmouth, Dartmouth, MA, USA
fYear
2009
fDate
18-21 Oct. 2009
Firstpage
89
Lastpage
92
Abstract
This paper proposes a method to simultaneously estimate the number, pitches, and relative locations of individual speech sources within instantaneous and non-instantaneous linear mixtures containing additive white Gaussian noise. The algorithm makes no assumptions about the number of sources or the number of sensors, and is therefore applicable to over-, under-, and precisely-determined scenarios. The method is hypothesis-based and employs a power-spectrum-based FIR filter derived from probability distributions of speech pitch harmonics. This harmonic windowing function (HWF) dramatically improves time-difference of arrival (TDOA) estimates over standard cross-correlation for low SNR. The pitch estimation component of the algorithm implicitly performs voiced-region detection and does not require prior knowledge about voicing. Cumulative pitch and TDOA estimates from the HWF form the basis for robust source enumeration across a wide range of SNR.
Keywords
AWGN; FIR filters; correlation methods; harmonics; signal detection; speech processing; statistical distributions; time-of-arrival estimation; SNR; additive white Gaussian noise; cross-correlation; noninstantaneous linear mixture; pitch estimation component; power-spectrum-based FIR filter; probability distribution; speech mixture source enumeration; speech pitch harmonic windowing function; time-difference-of-arrival estimation; voiced-region detection; Acoustical engineering; Application software; Conferences; Frequency estimation; Histograms; Matrix decomposition; Power harmonic filters; Resonance; Speech; USA Councils; Source enumeration; linear mixtures; multi-pitch extraction; pitch harmonics; real-time;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location
New Paltz, NY
ISSN
1931-1168
Print_ISBN
978-1-4244-3678-1
Electronic_ISBN
1931-1168
Type
conf
DOI
10.1109/ASPAA.2009.5346491
Filename
5346491
Link To Document