DocumentCode :
660911
Title :
A Similarity Search System Based on the Hamming Distance of Social Profiles
Author :
da Silva Villaca, R. ; Bernardes de Paula, Luciano ; Pasquini, R. ; Ferreira Magalhaes, Mauricio
Author_Institution :
Sch. of Electr. & Comput. Eng. (FEEC), UNICAMP, Campinas, Brazil
fYear :
2013
fDate :
16-18 Sept. 2013
Firstpage :
90
Lastpage :
93
Abstract :
The goal of a similarity search system is to allow users to retrieve data that presents a required similarity level in a certain dataset. For example, such dataset may be applied in the social media scenario, where huge amounts of data represent users in a social network. This paper uses a Vector Space Model (VSM) to represent users´ profiles and the Random Hyper plane Hashing (RHH) function to create indexes for them. Both VSM and RHH compose an alternative to address the challenge of performing similarity searches over the huge amount of data present in the social media scenario: the Hamming similarity. In order to evaluate the effectiveness of our proposal, this paper brings examples of reference profiles, used for performing queries, and presents results regarding the correlation between cosine and Hamming similarity and the frequency distribution of Hamming distances among identifiers of users´ profiles. In short, the results indicate that Hamming similarity can be useful for the development of similarity search systems for social media.
Keywords :
query formulation; social networking (online); Hamming distances; Hamming similarity; RHH function; VSM; data retrieval; frequency distribution; queries; random hyper plane hashing; reference profiles; similarity level; similarity search systems; similarity searches; social media; social network; social profiles; users profiles; vector space model; Correlation; Databases; Hamming distance; Measurement; Prototypes; Social network services; Vectors; Hamming distance; RHH; Similarity Search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing (ICSC), 2013 IEEE Seventh International Conference on
Conference_Location :
Irvine, CA
Type :
conf
DOI :
10.1109/ICSC.2013.24
Filename :
6693499
Link To Document :
بازگشت