DocumentCode :
2552438
Title :
Nonnegative CCA for Audiovisual Source Separation
Author :
Sigg, Christian ; Fischer, Bernd ; Ommer, Björn ; Roth, Volker ; Buhmann, Joachim
Author_Institution :
ETH Zurich, Zurich
fYear :
2007
fDate :
27-29 Aug. 2007
Firstpage :
253
Lastpage :
258
Abstract :
We present a method for finding correlated components in audio and video signals. The new technique is applied to the task of identifying sources in video and separating them in audio. The concept of canonical correlation analysis is reformulated such that it incorporates nonnegativity and sparsity constraints on the coefficients of projection directions. Nonnegativity ensures that projections are compatible with an interpretation as energy signals. Sparsity ensures that coefficient weight concentrates on individual sources. By finding multiple conjugate directions we finally obtain a component based decomposition of both data modalities. Experiments effectively demonstrate the potential and benefits of this approach.
Keywords :
audio signal processing; audio-visual systems; correlation methods; iterative methods; source separation; video signal processing; audio signals; audiovisual source separation; canonical correlation analysis; component based decomposition; correlated components; iterated regression; nonnegative CCA; nonnegativity constraints; sparsity constraints; video signals; Face detection; Finite impulse response filter; Frequency; Layout; Microphone arrays; Pixel; Source separation; Speech analysis; Streaming media; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing, 2007 IEEE Workshop on
Conference_Location :
Thessaloniki
ISSN :
1551-2541
Print_ISBN :
978-1-4244-1566-3
Electronic_ISBN :
1551-2541
Type :
conf
DOI :
10.1109/MLSP.2007.4414315
Filename :
4414315
Link To Document :
بازگشت