Title :
Web Page Clustering Based on Searching Keywords
Author :
Li, Taoying ; Chen, Yan
Author_Institution :
Transp. Manage. Coll., Dalian Maritime Univ., Dalian, China
Abstract :
In order to improve searching results of Web pages and enhancing Web crawling operation, the Web page clustering based on searching keywords is proposed in this paper, which firstly employed matching degree between Web pages and searching keywords to decide the sequence of showing pages of searching results. Then clustering algorithm was chosen to group pages of searching results according to matching degree. Next we used duplicated pages deletion to detect and remove duplicated pages with same titles and abstracts. Finally, the proposed algorithm is applied in practice and results show that it is effective and feasible for solving information explosion on Web.
Keywords :
Internet; data mining; pattern clustering; Web crawling operation; Web page clustering; duplicated pages deletion; matching degree; searching keywords; Automation; Clustering algorithms; Couplings; Data mining; Explosions; Partitioning algorithms; Transportation; Web mining; Web pages; Web services; matching degree; searching degree; web clustering; web mining;
Conference_Titel :
Intelligent Computation Technology and Automation (ICICTA), 2010 International Conference on
Conference_Location :
Changsha
Print_ISBN :
978-1-4244-7279-6
Electronic_ISBN :
978-1-4244-7280-2
DOI :
10.1109/ICICTA.2010.53