Title :
Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds
Author :
Katmeoka, H. ; Nishimoto, Takuya ; Sagayama, Shigeki
Author_Institution :
Graduate Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Japan
Abstract :
A method for the separation of harmonic structures of cochannel input concurrent sounds is described. A model for multiple harmonic structures is constructed with a mixture of tied Gaussian mixtures, from which a single harmonic structure is modeled. Our algorithm enables estimation of both the number and the shape of the underlying harmonic structures, based on a maximum likelihood estimation of the model parameters using the EM algorithm and an information criterion. It operates without restriction on the number of mixed sounds and varieties of sound sources, and extracts accurate fundamental frequencies continuously with simple procedures in the spectral domain. Experiments showed high performance of the algorithm for both simultaneous speech and polyphonic music.
Keywords :
Gaussian processes; audio signal processing; harmonic analysis; harmonics; maximum likelihood estimation; music; optimisation; spectral analysis; speech processing; EM algorithm; cochannel input concurrent sounds; fundamental frequencies; harmonic structure separation; information criterion; maximum likelihood estimation; mixed sounds; polyphonic music; simultaneous speech; spectral domain; tied Gaussian mixture model; Acoustic signal processing; Frequency; Information science; Instruments; Maximum likelihood estimation; Music; Robustness; Shape; Signal processing algorithms; Speech analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326822