• DocumentCode
    729796
  • Title

    Multi-cue Normalized Non-Negative Sparse Encoder for image classification

  • Author

    Shizhou Zhang ; Jinjun Wang ; Yudong Liang ; Yihong Gong ; Nanning Zheng

  • Author_Institution
    Inst. of Artificial Intell. & Robot., Xi´an Jiaotong Univ., Xi´an, China
  • fYear
    2015
  • fDate
    June 29 2015-July 3 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Recently, the sparse coding based image representation has achieved state-of-the-art recognition results on many benchmarks. In this paper, we propose Multi-cue Normalized Non-Negative Sparse Encoder (MN3SE) which enforces both the non-negative constraint and the shift-invariant constraint on top of the traditional sparse coding criteria, and takes multi-cue to further boost the performance. The former constraint reduces information loose by the negative coefficients and improves the coding stability, and the latter allows the sparseness to be self-adaptive to the local feature. The proposed coding scheme is then approximated by an neural network based encoder for speed-up. More importantly, the multi-layer neural network architecture allows us to apply a multi-task learning strategy to fuse information from multi-cue. Specifically, we take one type of descriptor, such as SIFT as the input, and enforce the learned encoder to produce sparse code that can reconstruct not only SIFT but also other types of descriptors such as color moments. In this way, we could achieve not only 10 to 33 times speed up for sparse-coding, the multi-cue enforced learning strategy gives the image feature extracted by MN3SE superior image classification accuracy.
  • Keywords
    feature extraction; image classification; image coding; image reconstruction; learning (artificial intelligence); neural nets; transforms; MN3SE; SIFT; coding stability; descriptor reconstruction; image classification; image feature extraction; multicue enforced learning strategy; multicue normalized nonnegative sparse encoder; multilayer neural network architecture; multitask learning strategy; negative coefficients; nonnegative constraint; shift-invariant constraint; sparse coding criteria; Accuracy; Encoding; Feature extraction; Image coding; Image color analysis; Image reconstruction; Training; Image classification; Multi-cue; Non-Negative constraint; Shift-invariant constraint; Sparse Encoder;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2015 IEEE International Conference on
  • Conference_Location
    Turin
  • Type

    conf

  • DOI
    10.1109/ICME.2015.7177531
  • Filename
    7177531