DocumentCode :
3202679
Title :
Web Page Clustering Based on Searching Keywords
Author :
Li, Taoying ; Chen, Yan
Author_Institution :
Transp. Manage. Coll., Dalian Maritime Univ., Dalian, China
Volume :
3
fYear :
2010
fDate :
11-12 May 2010
Firstpage :
1163
Lastpage :
1166
Abstract :
In order to improve searching results of Web pages and enhancing Web crawling operation, the Web page clustering based on searching keywords is proposed in this paper, which firstly employed matching degree between Web pages and searching keywords to decide the sequence of showing pages of searching results. Then clustering algorithm was chosen to group pages of searching results according to matching degree. Next we used duplicated pages deletion to detect and remove duplicated pages with same titles and abstracts. Finally, the proposed algorithm is applied in practice and results show that it is effective and feasible for solving information explosion on Web.
Keywords :
Internet; data mining; pattern clustering; Web crawling operation; Web page clustering; duplicated pages deletion; matching degree; searching keywords; Automation; Clustering algorithms; Couplings; Data mining; Explosions; Partitioning algorithms; Transportation; Web mining; Web pages; Web services; matching degree; searching degree; web clustering; web mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computation Technology and Automation (ICICTA), 2010 International Conference on
Conference_Location :
Changsha
Print_ISBN :
978-1-4244-7279-6
Electronic_ISBN :
978-1-4244-7280-2
Type :
conf
DOI :
10.1109/ICICTA.2010.53
Filename :
5523220
Link To Document :
بازگشت