Title :
Web text categorization on GBODSS
Author :
Hu, Mingsheng ; Jia, Zhijuan
Author_Institution :
Inst. of Software Sci., Zhengzhou Teachers´´ Coll., Zhengzhou, China
Abstract :
Grid technology has the potential to improve the accessibility of digital libraries. The participants in Project GBODSS (grid-based open DSS) are in the process of developing a new open decision support system framework based on grid technologies. Automated text categorization has been extensively studied and various techniques for document categorization based on machine learning approaches have been proposed. However, most of these experimental prototypes, for the purpose of evaluating different techniques, have been restricted to the heterogeneous, autonomic, dynamic and distributed Internet environment. In this study, an approach based on support vector machines (SVMs) for Web text mining of large-scale systems on GBODSS is developed to support enterprise decision making. The experiments were conducted on different configurations of grid network and computation time was recorded for each operation. We analyzed our result with various grid configurations and it shows speed up of computation time is almost super linear.
Keywords :
Internet; decision making; decision support systems; digital libraries; grid computing; open systems; support vector machines; text analysis; Web text categorization; Web text mining; digital libraries; distributed Internet environment; document categorization; enterprise decision making; grid technology; grid-based open DSS; machine learning approach; open decision support system; support vector machines; Decision support systems; Grid computing; Internet; Large-scale systems; Machine learning; Prototypes; Software libraries; Support vector machines; Text categorization; Text mining; GBODSS; Grid Technology; Web Text Categorization;
Conference_Titel :
Computer Science & Education, 2009. ICCSE '09. 4th International Conference on
Conference_Location :
Nanning
Print_ISBN :
978-1-4244-3520-3
Electronic_ISBN :
978-1-4244-3521-0
DOI :
10.1109/ICCSE.2009.5228357