Title :
Huffman-DHT: Index Structure Refinement Scheme for P2P Information Retrieval
Author :
Kurasawa, Hisashi ; Takasu, Atsuhiro ; Adachi, Jun
Author_Institution :
Univ. of Tokyo, Tokyo
fDate :
July 28 2008-Aug. 1 2008
Abstract :
Peer-to-peer information retrieval (P2P IR) systems using a distributed index on a distributed hash table (DHT) can make highly precise searches for documents relevant to a query. However, these systems require a heavy index construction cost, and cause unfair index management costs due to the unbalanced term frequency distribution. We propose a new node access scheme for P2P IR that we call Huffman-DHT. Huffman-DHT uses an algorithm similar to Huffman coding, and modifies the DHT structure based on the term distribution. Huffman-DHT distributes the index construction cost among the nodes equally, and achieves load balancing.
Keywords :
Huffman codes; indexing; information retrieval; peer-to-peer computing; table lookup; text analysis; Huffman coding; Huffman-DHT; distributed hash table; distributed index; document search; index management; index structure refinement; load balancing; node access; peer-to-peer information retrieval P2P IR systems; term frequency distribution; Costs; Frequency; Huffman coding; Indexing; Informatics; Information management; Information retrieval; Internet; Peer to peer computing; Telecommunication traffic; Huffman coding; Information Retrieval; Load balancing; Peer-to-Peer;
Conference_Titel :
Applications and the Internet, 2008. SAINT 2008. International Symposium on
Conference_Location :
Turku
Print_ISBN :
978-0-7695-3297-4
DOI :
10.1109/SAINT.2008.26