DocumentCode :
2350304
Title :
Applying Information Retrieval to Distributed Hash Table (DHT) Systems
Author :
Fantar, Sonia Gaied ; Youssef, Habib
Author_Institution :
Res. Unit Prince, Univ. of Sousse, Sousse, Tunisia
fYear :
2011
fDate :
9-13 May 2011
Firstpage :
1
Lastpage :
7
Abstract :
Distributed Peer to peer systems such as Chord allow peers to perform efficient searches using object identifiers rather than keywords. More specifically, they use a specific structure with some hashing scheme that allows peers to perform object lookup operations getting in return the address of the node storing the object. Lookups are achieved by following a path that increasingly progresses to the destination. These systems have been designed to optimize object retrieval by minimizing the number of messages and hops required to retrieve the object. The disadvantage is that they consider only the problem of searching for keys, and thus cannot capture the relevance of the documents stored in the system. This common problem with existing traditional distributed hash table (DHT) is done because they usually ignore the information retrieval algorithms, and thereby rely on keyword based searches. In this paper, we first propose to augment the P2P DHT system Chord with mechanisms for locating data using the information retrieval system LSI to facilitate content-based full-text search in large distributed information systems. Chord-LSI uses latent semantic indexing (LSI) to guide content placement in a Chord such that documents relevant to a query are likely be collocated on a small number of nodes. During a search, Chord-LSI transmit a small amount of data and search a small number of nodes. Simulation results show that Chord-LSI model is 17% more effective than Chord models.
Keywords :
content-based retrieval; file organisation; indexing; peer-to-peer computing; Chord; P2P DHT system; content placement; content-based full-text search; data location; distributed hash table system; distributed peer-to-peer system; document relevance; hashing scheme; hop minimization; information retrieval; key searching; large distributed information system; latent semantic indexing; message minimization; node address; object lookup operation; object retrieval; Indexing; Information retrieval; Large scale integration; Matrix decomposition; Peer to peer computing; Semantics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
New Technologies of Distributed Systems (NOTERE), 2011 11th Annual International Conference on
Conference_Location :
Paris
ISSN :
2162-1896
Print_ISBN :
978-1-4577-0729-2
Type :
conf
DOI :
10.1109/NOTERE.2011.5957990
Filename :
5957990
Link To Document :
بازگشت