Title :
The web software mining based on vector space model
Author :
Wang, Feijie ; Bai, Zhongying
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
The article designed a Web software mining system, discussed the techniques what used in the system and raised the solutions for issues in system. A Web crawler software has been designed and implemented according to the feature of the World Wide Web. Base on the information present by Web pages, the article improved feature selection method and key words weighted algorithm using Web text mining techniques and achieved a highly precise and efficient information mining. Finally the article used the improved mining algorithm to make a classification and clustering to Web crawler software.
Keywords :
Internet; data mining; text analysis; Web crawler software; Web pages; Web software mining; Web text mining; World Wide Web; feature selection method; key words weighted algorithm; vector space model; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Crawlers; Data mining; Dictionaries; Software algorithms; Software design; Software systems; Spatial databases; classification; feature selection; web mining; word segmentation;
Conference_Titel :
Future Information Networks, 2009. ICFIN 2009. First International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-5158-6
Electronic_ISBN :
978-1-4244-5159-3
DOI :
10.1109/ICFIN.2009.5339603