Title :
Density-based similarity measures for content based search
Author :
Porter, Reid ; Ruggiero, Christy ; Hush, Don
Author_Institution :
Los Alamos Nat. Lab., Los Alamos, NM, USA
Abstract :
We consider the query by multiple example problem where the goal is to identify database samples whose content is similar to a collection of query samples. To assess the similarity we use a relative content density which quantifies the relative concentration of the query distribution to the database distribution. If the database distribution is a mixture of the query distribution and a background distribution then it can be shown that database samples whose relative content density is greater than a particular threshold ¿ are more likely to have been generated by the query distribution than the background distribution. We describe an algorithm for predicting samples with relative content density greater than ¿ that is computationally efficient and possesses strong performance guarantees. We also show empirical results for applications in computer network monitoring and image segmentation.
Keywords :
content-based retrieval; database management systems; background distribution; computer network monitoring; content based search; database distribution; density-based similarity measure; image segmentation; query by multiple example; query distribution; relative content density; Application software; Computer networks; Computerized monitoring; Density measurement; Image databases; Image segmentation; Laboratories; Prediction algorithms; Q measurement; Spatial databases;
Conference_Titel :
Signals, Systems and Computers, 2009 Conference Record of the Forty-Third Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
978-1-4244-5825-7
DOI :
10.1109/ACSSC.2009.5469837