• DocumentCode
    42732
  • Title

    Learning Joint Intensity-Depth Sparse Representations

  • Author

    Tosic, Ivana ; Drewes, Sarah

  • Author_Institution
    Helen Wills Neurosci. Inst., Berkeley, CA, USA
  • Volume
    23
  • Issue
    5
  • fYear
    2014
  • fDate
    May-14
  • Firstpage
    2122
  • Lastpage
    2132
  • Abstract
    This paper presents a method for learning overcomplete dictionaries of atoms composed of two modalities that describe a 3D scene: 1) image intensity and 2) scene depth. We propose a novel joint basis pursuit (JBP) algorithm that finds related sparse features in two modalities using conic programming and we integrate it into a two-step dictionary learning algorithm. The JBP differs from related convex algorithms because it finds joint sparsity models with different atoms and different coefficient values for intensity and depth. This is crucial for recovering generative models where the same sparse underlying causes (3D features) give rise to different signals (intensity and depth). We give a bound for recovery error of sparse coefficients obtained by JBP, and show numerically that JBP is superior to the group lasso algorithm. When applied to the Middlebury depth-intensity database, our learning algorithm converges to a set of related features, such as pairs of depth and intensity edges or image textures and depth slants. Finally, we show that JBP outperforms state of the art methods on depth inpainting for time-of-flight and Microsoft Kinect 3D data.
  • Keywords
    convex programming; image representation; image texture; learning (artificial intelligence); numerical analysis; 3D scene; JBP algorithm; Microsoft Kinect 3D data; Middlebury depth-intensity database; conic programming; convex algorithms; depth inpainting; generative models; group lasso algorithm; image intensity; image textures; intensity edges; joint basis pursuit algorithm; joint sparsity models; learning joint intensity-depth sparse representations; learning overcomplete dictionaries; recovery error; scene depth; sparse coefficients; time-of-flight; two-step dictionary learning algorithm; Approximation methods; Couplings; Dictionaries; Indexes; Joints; Three-dimensional displays; Vectors; Sparse approximations; dictionary learning; hybrid image-depth sensors;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2014.2312645
  • Filename
    6775324