Title :
Multi-class SVM with negative data selection for Web page classification
Author :
Chen, Chih-Ming ; Lee, Hahn-Ming ; Kao, Ming-Tyan
Author_Institution :
Graduate Inst. of Learning Technol., Nat. Hualien Teachers Coll., Taiwan
Abstract :
Support vector machine (SVM) has been demonstrated its excellent performance in terms of solving document classification problem. In this paper, SVM with one-against-all structure is applied to solve Web page classification problems with multi-class. However, the main problem of SVM with one-against-all structure is that the negative data might be too huge so that the training time obviously increase. To solve this problem, a negative data selection method is presented to reduce a large amount of negative data for SVM. Experimental results show that the training time is obviously reduced. Moreover, the proposed method also keeps a high accuracy rate for Web page classification.
Keywords :
Web sites; data reduction; document handling; learning (artificial intelligence); pattern classification; problem solving; support vector machines; Web page classification; document classification problem; multiclass SVM; negative data reduction; negative data selection; problem solving; support vector machine; Computer science; Data engineering; Educational institutions; Electronic mail; Internet; Machine learning; Search engines; Support vector machine classification; Support vector machines; Web pages;
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
Print_ISBN :
0-7803-8359-1
DOI :
10.1109/IJCNN.2004.1380932