مرکز منطقه ای اطلاع رساني علوم و فناوري - Detecting and Clustering Similar Results of Search Engine by Exploiting Web Page´s Contents

DocumentCode :

3471184

Title :

Detecting and Clustering Similar Results of Search Engine by Exploiting Web Page´s Contents

Author :

Gao, Kai ; WU, Hui-cong

Author_Institution :

Sch. of Inf. Sci. & Eng., Hebei Univ. of Sci. & Technol., Shijiazhuang

fYear :

2008

fDate :

12-14 Oct. 2008

Firstpage :

Lastpage :

Abstract :

This paper presents an approach to detect and cluster similar results of search engine based on analyzing pages´ URLs and their contents. A novel hash function, together with a Chinese key concept extractor module, has been used. The similar measurement on key concept overlap degree is proposed to cluster similar retrieval results. This can minimize the overlap effectively. The experimental results show the feasibility of the approach. On the basis of the above works, a search engine has been developed.

Keywords :

Internet; file organisation; search engines; Chinese key concept extractor; Web page contents; hash functions; search engines; Clustering algorithms; Educational institutions; Fingerprint recognition; Information science; Internet; Mechanical engineering; Parallel robots; Search engines; Uniform resource locators; Web pages;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on

Conference_Location :

Dalian

Print_ISBN :

978-1-4244-2107-7

Electronic_ISBN :

978-1-4244-2108-4

Type :

conf

DOI :

10.1109/WiCom.2008.2548

Filename :

4680737

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3471184