DocumentCode
2379527
Title
Analysis of multimodal signals using redundant representations
Author
Monaci, Gianluca ; Escoda, Oscar Divorra ; Vandergheynst, Pierre
Author_Institution
Signal Process. Inst., Ecole Polytech. Fed. de Lausanne, Switzerland
Volume
3
fYear
2005
fDate
14-14 Sept. 2005
Firstpage
46
Lastpage
49
Abstract
In this work we explore the potentialities of a framework for the representation of audio-visual signals using decompositions on overcomplete dictionaries. Redundant decompositions may describe audio-visual sequences in a concise fashion, preserving good representation properties thanks to the use of redundant, well designed, dictionaries. We expect that this helps us overcome two typical problems of multimodal fusion algorithms. On one hand, classical representation techniques, like pixel-based measures (for the video) or Fourier-like transforms (for the audio), take into account only marginally the physics of the problem. On the other hand, the input signals have large dimensionality. The results we obtain by making use of sparse decompositions of audio-visual signals over redundant codebooks are encouraging and show the potentialities of the proposed approach to multimodal signal representation.
Keywords
audio signal processing; signal representation; signal resolution; Fourier-like transforms; audio-visual signals; multimodal fusion algorithms; multimodal signals; overcomplete dictionaries; pixel-based measures; redundant representations; Cepstral analysis; Dictionaries; Fourier transforms; Matching pursuit algorithms; Mutual information; Physics; Signal analysis; Signal processing; Signal processing algorithms; Signal representations;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 2005. ICIP 2005. IEEE International Conference on
Conference_Location
Genova
Print_ISBN
0-7803-9134-9
Type
conf
DOI
10.1109/ICIP.2005.1530349
Filename
1530349
Link To Document