Title :
Proof: A DHT-Based Peer-to-Peer Search Engine
Author :
Yang, Kai-Hsiang ; Ho, Jan-Ming
Author_Institution :
Inst. of Inf. Sci., Acad. Sinica, Taipei
Abstract :
In this paper we focus on building a large scale keyword search service over structured peer-to-peer (P2P) networks. Current state-of-the-art keyword search approaches for structured P2P systems are based on inverted list intersection. However, the biggest challenge in those approaches is that when the indices are distributed over peers, a simple query may cause a large amount of data to be transmitted over the network. We propose a new P2P keyword search scheme, called "Proof", to reduce network traffic for queries. The key idea is storing a content summary for each Web page in the inverted list, so that a query can be processed by only transmitting a small size of candidate results. Our simulation results showed that, compared with previous DHT-based P2P systems, Proof can dramatically reduce network traffic and computation time. It provides 100% precision and 90.09% recall of search results, at an acceptable cost of storage overhead, even when the number of peers and documents increases continually
Keywords :
Internet; file organisation; peer-to-peer computing; query processing; search engines; text analysis; DHT-based peer-to-peer search engine; Proof P2P keyword search service; Web page; distributed hash table; inverted list; query processing; Buildings; Computational modeling; Computer networks; Keyword search; Large-scale systems; Peer to peer computing; Search engines; Telecommunication traffic; Traffic control; Web pages;
Conference_Titel :
Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2747-7
DOI :
10.1109/WI.2006.137