DocumentCode :
2209146
Title :
QMAS: Querying, Mining and Summarization of Multi-modal Databases
Author :
Cordeiro, Robson L F ; Guo, Fan ; Haverkamp, Donna S. ; Horne, James H. ; Hughes, Ellen K. ; Kim, Gunhee ; Traina, Agma J M ; Traina, Caetano ; Faloutsos, Christos
Author_Institution :
Univ. of Sao Paulo, São Carlos, Brazil
fYear :
2010
fDate :
13-17 Dec. 2010
Firstpage :
785
Lastpage :
790
Abstract :
Given a large collection of images, very few of which have labels, how can we guess the labels of the remaining majority, and how can we spot those images that need brand new labels, different from the existing ones? Current automatic labeling techniques usually scale super linearly with the data size, and/or they fail when only a tiny amount of labeled data is provided. In this paper, we propose QMAS (Querying, Mining And Summarization of Multi-modal Databases), a fast solution to the following problems: (i) low-labor labeling (L3) - given a collection of images, very few of which are labeled with keywords, find the most suitable labels for the remaining ones, and (ii) mining and attention routing - in the same setting, find clusters, the top-NO outlier images, and the top-NR representative images. We report experiments on real satellite images, two large sets (1.5 GB and 2.25 GB) of proprietary images and a smaller set (17 MB) of public images. We show that QMAS scales linearly with the data size, being up to 40 times faster than top competitors (GCap), obtaining better or equal accuracy. In contrast to other methods, QMAS does low-labor labeling (L3), that is, it works even with tiny initial label sets. It also solves both presented problems and spots tiles that potentially require new labels.
Keywords :
data mining; image recognition; query processing; visual databases; QMAS; image collection; labeling technique; low labor labeling; multimodal databases mining; multimodal databases querying; multimodal databases summarization; public image; satellite image; Automatic labeling; Clustering; Summarization and Multi-modal databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-4786
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2010.150
Filename :
5694039
Link To Document :
بازگشت