Title :
Nonnegative CCA for Audiovisual Source Separation
Author :
Sigg, Christian ; Fischer, Bernd ; Ommer, Björn ; Roth, Volker ; Buhmann, Joachim
Author_Institution :
ETH Zurich, Zurich
Abstract :
We present a method for finding correlated components in audio and video signals. The new technique is applied to the task of identifying sources in video and separating them in audio. The concept of canonical correlation analysis is reformulated such that it incorporates nonnegativity and sparsity constraints on the coefficients of projection directions. Nonnegativity ensures that projections are compatible with an interpretation as energy signals. Sparsity ensures that coefficient weight concentrates on individual sources. By finding multiple conjugate directions we finally obtain a component based decomposition of both data modalities. Experiments effectively demonstrate the potential and benefits of this approach.
Keywords :
audio signal processing; audio-visual systems; correlation methods; iterative methods; source separation; video signal processing; audio signals; audiovisual source separation; canonical correlation analysis; component based decomposition; correlated components; iterated regression; nonnegative CCA; nonnegativity constraints; sparsity constraints; video signals; Face detection; Finite impulse response filter; Frequency; Layout; Microphone arrays; Pixel; Source separation; Speech analysis; Streaming media; Vectors;
Conference_Titel :
Machine Learning for Signal Processing, 2007 IEEE Workshop on
Conference_Location :
Thessaloniki
Print_ISBN :
978-1-4244-1566-3
Electronic_ISBN :
1551-2541
DOI :
10.1109/MLSP.2007.4414315