DocumentCode :
2806839
Title :
Informed source separation of underdetermined instantaneous stereo mixtures using source index embedding
Author :
Parvaix, Mathieu ; Girin, Laurent
Author_Institution :
Grenoble Lab. of Images, Speech, Signal & Autom. (GIPSA-Lab.), Grenoble Inst. of Technol., Grenoble, France
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
245
Lastpage :
248
Abstract :
In this paper, we address the issue of underdetermined source separation of non-stationary audio sources from a stereo (i.e. 2-channel) linear instantaneous mixture. This problem is addressed with a specific coder-decoder configuration. At the coder, source signals are assumed to be available before the mixing is processed. A time-frequency (TF) analysis of each source enables to select the one or two predominant sources (among I>2) in each TF region, and a corresponding source(s) index code is imperceptibly embedded into the mix signals using a watermarking technique. At the decoder level, where the original sources signals are unknown, the extraction of the watermark enables to locally reduce the underdetermined configuration to an (over)determined configuration. Sources signals can then be estimated using a classical (over)determined separation technique. Thereby several instruments or voice signals can be separated from stereo mixtures, enabling separate manipulation of the source signals during restitution (i.e. remastering).
Keywords :
audio signal processing; codecs; source separation; speech processing; time-frequency analysis; watermarking; coder-decoder configuration; index code; informed source separation; nonstationary audio source; source index embedding; time-frequency analysis; underdetermined instantaneous stereo mixture; voice signal separation; watermark extraction; watermarking technique; Codecs; Data mining; Decoding; Instruments; Multiple signal classification; Signal analysis; Signal processing; Source separation; Speech processing; Watermarking; audio processing; remastering; speech processing; underdetermined source separation; watermarking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495983
Filename :
5495983
Link To Document :
بازگشت