مرکز منطقه ای اطلاع رساني علوم و فناوري - Semantic Bag-of-Words Models for Visual Concept Detection and Annotation

DocumentCode :

2729221

Title :

Semantic Bag-of-Words Models for Visual Concept Detection and Annotation

Author :

Yu Zhang ; Bres, Stephane ; Liming Chen

Author_Institution :

LIRIS, Univ. de Lyon, Lyon, France

fYear :

2012

fDate :

25-29 Nov. 2012

Firstpage :

289

Lastpage :

295

Abstract :

This paper presents a novel method for building textual feature defined on semantic distance and describes multi-model approach for Visual Concept Detection and Annotation(VCDA). Nowadays, the tags associated with images have been popularly used in the VCDA task, because they contain valuable information about image content that can hardly be described by low-level visual features. Traditionally the term frequencies model is used to capture this useful text information. However, the shortcoming in the term frequencies model lies that the valuable semantic information can not be captured. To solve this problem, we propose the semantic bag-of-words(BoW) model which use WordNet-based distance to construct the codebook and assign the tags. The advantages of this approach are two-fold: (1) It can capture tags semantic information that is hardly described by the term frequencies model. (2) It solves the high dimensionality issue of the codebook vocabulary construction, reducing the size of the tags representation. Furthermore, we employ a strong Multiple Kernel Learning (MKL) classifier to fuse the visual model and the text model. The experimental results on the Image CLEF 2011 show that our approach effectively improves the recognition accuracy.

Keywords :

image classification; image representation; learning (artificial intelligence); Image CLEF 2011; WordNet-based distance; codebook vocabulary construction; multimodel approach; multiple kernel learning classifier; recognition accuracy; semantic bag-of-words model; semantic distance; tags representation; tags semantic information; text model; textual feature; visual concept annotation; visual concept detection; visual model; Biological system modeling; Dictionaries; Feature extraction; Kernel; Semantics; Vectors; Visualization; Multiple Kernel Learning; Word-Net; multi-model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Image Technology and Internet Based Systems (SITIS), 2012 Eighth International Conference on

Conference_Location :

Naples

Print_ISBN :

978-1-4673-5152-2

Type :

conf

DOI :

10.1109/SITIS.2012.50

Filename :

6395108

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2729221