Title of article :
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval
Author/Authors :
Zhang، نويسنده , , Shiliang and Tian، نويسنده , , Qi and Hua، نويسنده , , Gang and Huang، نويسنده , , Qingming and Gao، نويسنده , , Wen، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Pages :
14
From page :
16
To page :
29
Abstract :
The ever increasing Internet image collection densely samples the real world objects, scenes, etc. and is commonly accompanied with multiple metadata such as textual descriptions and user comments. Such image data has potential to serve as a knowledge source for large-scale image applications. Facilitated by such publically available and ever-increasing loosely annotated image data on the Internet, we propose a scalable data-driven solution for annotating and retrieving Web-scale image data. We extrapolate from large-scale loosely annotated images a compact and informative representation, namely ObjectPatchNet. Each vertex in ObjectPatchNet, which is called as an ObjectPatchNode, is defined as a collection of discriminative image patches annotated with object category labels. The edge linking two ObjectPatchNodes models the co-occurrence relationship among different objects in the same image. Therefore, ObjectPatchNet models not only probabilistically labeled image patches, but also the contextual relationship between objects. It is well suited to scalable image annotation task. Besides, we further take ObjectPatchNet as a visual vocabulary with semantic labels, and hence are able to easily develop inverted file indexing for efficient semantic image retrieval. ObjectPatchNet is tested on both large-scale image annotation and large-scale image retrieval applications. Experimental results manifest that ObjectPatchNet is both discriminative and efficient in these applications.
Keywords :
Visual vocabulary , Large-scale image retrieval , Image annotation
Journal title :
Computer Vision and Image Understanding
Serial Year :
2014
Journal title :
Computer Vision and Image Understanding
Record number :
1697087
Link To Document :
بازگشت