Title :
A new transductive support vector machine approach to text categorization
Author :
Sun, Fan ; Sun, Maosong
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
Transductive inference is well suited for text categorization tasks with an enormous amount of unlabeled data in addition to a small number of labeled data. We present a new transductive support vector machine approach for text categorization in order to make use of the large amount of unlabeled data efficiently. In our experiments the performance of transductive methods greatly exceeds that of conventional SVM. Experimental results also show that our algorithm outperforms Joachims´ TSVM, especially when there is a significant deviation between the distribution of training and test data.
Keywords :
classification; inference mechanisms; support vector machines; text analysis; text categorization; transductive inference; transductive support vector machine approach; unlabeled data; Inference algorithms; Laboratories; Machine learning; Simulated annealing; Sun; Support vector machine classification; Support vector machines; Testing; Text categorization; Training data;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598813