مرکز منطقه ای اطلاع رساني علوم و فناوري - Unbalanced learning in content-based image classification and retrieval

DocumentCode :

2286035

Title :

Unbalanced learning in content-based image classification and retrieval

Author :

Piras, Luca ; Giacinto, Giorgio

Author_Institution :

Dept. of Electr. & Electron. Eng., Univ. of Cagliari, Cagliari, Italy

fYear :

2010

fDate :

19-23 July 2010

Firstpage :

Lastpage :

Abstract :

Nowadays very large archives of digital images can be easily produced thanks to the availability of digital cameras as standalone devices, or embedded into a number of portable devices. Each personal computer is typically a repository for thousands of images, while the Internet can be seen as a very large repository. One of the most severe problems in the classification and retrieval of images from very large repositories is the very limited number of elements belonging to each semantic class compared to the number of images in the repository. As a consequence, an even smaller fraction of images per semantic class can be used as training set in a classification problem, or as a query in a content-based image retrieval problem. In this paper we propose a technique aimed at artificially increasing the number of examples in the training set in order to improve the learning capabilities, reducing the unbalance between the semantic class of interest, and all other images. The proposed approach is tailored to classification and relevance feedback techniques based on the Nearest-Neighbor paradigm. A number of new points in the feature space are created based on the available training patterns, so that they better represent the distribution of the semantic class of interest. These new points are created according to the k-NN paradigm, and take into account both relevant and non-relevant images with respect to the semantic class of interest. The proposed approach allows increasing the generalization capability of NN techniques, and mitigates the risk of classifier over-training on few patterns. Reported experiments show the effectiveness of the proposed technique in Content-Based Image Retrieval tasks, where the Nearest-Neighbor approach is used to exploit user´s relevance feedback. The improvement in precision and recall gained in one feature space allows also to outperform the improvement in performances attained by combining different feature spaces.

Keywords :

content-based retrieval; image classification; learning (artificial intelligence); relevance feedback; Internet; classifier overtraining; content-based image classification; content-based image retrieval problem; digital cameras; k-NN paradigm; nearest-neighbor paradigm; personal computer; relevance feedback techniques; unbalanced learning; Artificial neural networks; Image color analysis; Image retrieval; Noise; Noise measurement; Semantics; Training; Artificial Pattern Injection; Image Classification; Image Retrieval; Small Sample-Size; Unbalanced Learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo (ICME), 2010 IEEE International Conference on

Conference_Location :

Suntec City

ISSN :

1945-7871

Print_ISBN :

978-1-4244-7491-2

Type :

conf

DOI :

10.1109/ICME.2010.5583045

Filename :

5583045

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2286035