Title :
Multi-cue Normalized Non-Negative Sparse Encoder for image classification
Author :
Shizhou Zhang ; Jinjun Wang ; Yudong Liang ; Yihong Gong ; Nanning Zheng
Author_Institution :
Inst. of Artificial Intell. & Robot., Xi´an Jiaotong Univ., Xi´an, China
fDate :
June 29 2015-July 3 2015
Abstract :
Recently, the sparse coding based image representation has achieved state-of-the-art recognition results on many benchmarks. In this paper, we propose Multi-cue Normalized Non-Negative Sparse Encoder (MN3SE) which enforces both the non-negative constraint and the shift-invariant constraint on top of the traditional sparse coding criteria, and takes multi-cue to further boost the performance. The former constraint reduces information loose by the negative coefficients and improves the coding stability, and the latter allows the sparseness to be self-adaptive to the local feature. The proposed coding scheme is then approximated by an neural network based encoder for speed-up. More importantly, the multi-layer neural network architecture allows us to apply a multi-task learning strategy to fuse information from multi-cue. Specifically, we take one type of descriptor, such as SIFT as the input, and enforce the learned encoder to produce sparse code that can reconstruct not only SIFT but also other types of descriptors such as color moments. In this way, we could achieve not only 10 to 33 times speed up for sparse-coding, the multi-cue enforced learning strategy gives the image feature extracted by MN3SE superior image classification accuracy.
Keywords :
feature extraction; image classification; image coding; image reconstruction; learning (artificial intelligence); neural nets; transforms; MN3SE; SIFT; coding stability; descriptor reconstruction; image classification; image feature extraction; multicue enforced learning strategy; multicue normalized nonnegative sparse encoder; multilayer neural network architecture; multitask learning strategy; negative coefficients; nonnegative constraint; shift-invariant constraint; sparse coding criteria; Accuracy; Encoding; Feature extraction; Image coding; Image color analysis; Image reconstruction; Training; Image classification; Multi-cue; Non-Negative constraint; Shift-invariant constraint; Sparse Encoder;
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
DOI :
10.1109/ICME.2015.7177531