DocumentCode :
3246379
Title :
Efficient Support for Similarity Searches in DHT-Based Peer-to-Peer Systems
Author :
Jun Gao ; Steenkiste, Peter
Author_Institution :
Riverbed Technol. Inc., Fremont
fYear :
2007
fDate :
24-28 June 2007
Firstpage :
1867
Lastpage :
1874
Abstract :
Distributed hash tables (DHTs) provide a scalable and robust building block for content discovery in distributed applications such as Peer-to-Peer (P2P) systems. However, the basic DHT put/get API only supports simple exact queries. In this paper, we present a DHT-based system that efficiently supports similarity queries on multidimensional datasets. Our system embeds a logical kd-tree into the DHT´s identifier space to form a distributed indexing structure, the distributed kd-tree (DKDT). We avoid creating bottlenecks, which are typical in tree- based systems, by relying on fully distributed protocols for tree management and data registrations and queries. We propose tree compressing and node shrinking techniques to efficiently support applications with high dimensionality datasets. Simulation results using both synthetic and real data show the effectiveness of our system.
Keywords :
cryptography; database indexing; peer-to-peer computing; protocols; query processing; tree data structures; application program interface; content discovery; data registration; distributed hash table; distributed kd-tree indexing structure; distributed protocol; multidimensional dataset; node shrinking technique; peer-to-peer system; similarity query search; tree management; Communications Society; Indexing; Large-scale systems; Monitoring; Multidimensional systems; Nearest neighbor searches; Neural networks; Peer to peer computing; Protocols; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, 2007. ICC '07. IEEE International Conference on
Conference_Location :
Glasgow
Print_ISBN :
1-4244-0353-7
Type :
conf
DOI :
10.1109/ICC.2007.311
Filename :
4288982
Link To Document :
بازگشت