DocumentCode :
635400
Title :
An evaluation of content-based duplicate image detection methods for web search
Author :
Thomee, Bart ; Huiskes, Mark J. ; Bakker, Erwin M. ; Lew, Michael S.
Author_Institution :
Yahoo! Res., Barcelona, Spain
fYear :
2013
fDate :
15-19 July 2013
Firstpage :
1
Lastpage :
6
Abstract :
The world wide web is filled with billions of images and duplicates of images can frequently be found on many websites. These duplicates can be exact copies or differ slightly in their visual content. In this paper we provide a comparative study on how well content-based duplicate image detection methods are able to detect the duplicates of a query image. We conduct a survey to better understand in which ways such images on the internet differ from each other and use these observations to form a realistic and challenging duplicate image detection scenario. The methods we evaluate in our study are representative techniques from the research literature. In our evaluation, we target the performance of each method in relation to their descriptor size, description time and matching time, to assess their feasibility of application to large image collections (> 1 million).
Keywords :
Internet; image processing; image retrieval; Internet; Web search; Websites; World Wide Web; content-based duplicate image detection methods; large image collections; query image; Accuracy; Discrete wavelet transforms; Image color analysis; Image representation; Internet; Visualization; Content-based duplicate image detection; image redundancy; web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
ISSN :
1945-7871
Type :
conf
DOI :
10.1109/ICME.2013.6607451
Filename :
6607451
Link To Document :
بازگشت