Title :
Learning Midlevel Image Features for Natural Scene and Texture Classification
Author :
Le Borgne, Hervé ; Guérin-Dugué, Anne ; O´Connor, Noel E.
Author_Institution :
Commissariat A I´´Energie Atornique (CEA-LIST), Paris
fDate :
3/1/2007 12:00:00 AM
Abstract :
This paper deals with coding of natural scenes in order to extract semantic information. We present a new scheme to project natural scenes onto a basis in which each dimension encodes statistically independent information. Basis extraction is performed by independent component analysis (ICA) applied to image patches culled from natural scenes. The study of the resulting coding units (coding filters) extracted from well-chosen categories of images shows that they adapt and respond selectively to discriminant features in natural scenes. Given this basis, we define global and local image signatures relying on the maximal activity of filters on the input image. Locally, the construction of the signature takes into account the spatial distribution of the maximal responses within the image. We propose a criterion to reduce the size of the space of representation for faster computation. The proposed approach is tested in the context of texture classification (111 classes), as well as natural scenes classification (11 categories, 2037 images). Using a common protocol, the other commonly used descriptors have at most 47.7% accuracy on average while our method obtains performances of up to 63.8%. We show that this advantage does not depend on the size of the signature and demonstrate the efficiency of the proposed criterion to select ICA filters and reduce the dimension
Keywords :
feature extraction; filtering theory; image classification; image coding; image texture; independent component analysis; natural scenes; coding filters; image patches; image signatures; independent component analysis; learning midlevel image features; natural scenes coding; semantic information extraction; spatial distribution; texture classification; Content based retrieval; Data mining; Filters; Humans; Image coding; Image databases; Image retrieval; Independent component analysis; Layout; Testing; Gabor approximation; Independent component analysis (ICA); natural scene analysis; sparse coding;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2007.890635