DocumentCode
42732
Title
Learning Joint Intensity-Depth Sparse Representations
Author
Tosic, Ivana ; Drewes, Sarah
Author_Institution
Helen Wills Neurosci. Inst., Berkeley, CA, USA
Volume
23
Issue
5
fYear
2014
fDate
May-14
Firstpage
2122
Lastpage
2132
Abstract
This paper presents a method for learning overcomplete dictionaries of atoms composed of two modalities that describe a 3D scene: 1) image intensity and 2) scene depth. We propose a novel joint basis pursuit (JBP) algorithm that finds related sparse features in two modalities using conic programming and we integrate it into a two-step dictionary learning algorithm. The JBP differs from related convex algorithms because it finds joint sparsity models with different atoms and different coefficient values for intensity and depth. This is crucial for recovering generative models where the same sparse underlying causes (3D features) give rise to different signals (intensity and depth). We give a bound for recovery error of sparse coefficients obtained by JBP, and show numerically that JBP is superior to the group lasso algorithm. When applied to the Middlebury depth-intensity database, our learning algorithm converges to a set of related features, such as pairs of depth and intensity edges or image textures and depth slants. Finally, we show that JBP outperforms state of the art methods on depth inpainting for time-of-flight and Microsoft Kinect 3D data.
Keywords
convex programming; image representation; image texture; learning (artificial intelligence); numerical analysis; 3D scene; JBP algorithm; Microsoft Kinect 3D data; Middlebury depth-intensity database; conic programming; convex algorithms; depth inpainting; generative models; group lasso algorithm; image intensity; image textures; intensity edges; joint basis pursuit algorithm; joint sparsity models; learning joint intensity-depth sparse representations; learning overcomplete dictionaries; recovery error; scene depth; sparse coefficients; time-of-flight; two-step dictionary learning algorithm; Approximation methods; Couplings; Dictionaries; Indexes; Joints; Three-dimensional displays; Vectors; Sparse approximations; dictionary learning; hybrid image-depth sensors;
fLanguage
English
Journal_Title
Image Processing, IEEE Transactions on
Publisher
ieee
ISSN
1057-7149
Type
jour
DOI
10.1109/TIP.2014.2312645
Filename
6775324
Link To Document