DocumentCode
2726398
Title
A Novel Approach to Collect Training Images from WWW for Image Thesaurus Building
Author
Park, Joohyoun ; Nang, Jongho
Author_Institution
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul
fYear
2007
fDate
1-5 April 2007
Firstpage
301
Lastpage
306
Abstract
This paper introduces a novel approach to change gathered images from WWW into training images to build an image thesaurus. The requirements for being training images are a large number of images and with highly relevant to a given concept. To fulfil these requirements, a system should be able to collect a large number of relevant images to a given concept from WWW by the proposed criterion of relevance to the concept for each image. Then, the irrelevant images would be filtered out by the modified hierarchical clustering method based on the weighted combination of 5 MPEG-7 visual descriptors and the proposed criterion of relevance to the concept for each cluster. Upon experimental results, the precision of the set of images generated by the proposed method is about 18% higher than that of the set of images generated by other methods
Keywords
Internet; content-based retrieval; image coding; image retrieval; information filtering; thesauri; MPEG-7 visual descriptors; World Wide Web; auto image annotation; content based image retrieval; hierarchical clustering; image filtering; image thesaurus building; training image collection; Computational intelligence; Content based retrieval; HTML; Image analysis; Image generation; Image retrieval; MPEG 7 Standard; Signal processing; Thesauri; World Wide Web; Auto Image Annotation; Content Based Image Retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence in Image and Signal Processing, 2007. CIISP 2007. IEEE Symposium on
Conference_Location
Honolulu, HI
Print_ISBN
1-4244-0707-9
Type
conf
DOI
10.1109/CIISP.2007.369185
Filename
4221435
Link To Document