DocumentCode :
2500670
Title :
Enhancing Web Page Classification via Local Co-training
Author :
Du, Youtian ; Guan, Xiaohong ; Cai, Zhongmin
Author_Institution :
MOE Key Lab. for Intell. Networks & Network Security, Xi ´´an Jiaotong Univ., Xi´´an, China
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
2905
Lastpage :
2908
Abstract :
In this paper we propose a new multi-view semi-supervised learning algorithm called Local Co-Training(LCT). The proposed algorithm employs a set of local models with vector outputs to model the relations among examples in a local region on each view, and iteratively refines the dominant local models (i.e. the local models related to the unlabeled examples chosen for enriching the training set) using unlabeled examples by the co-training process. Compared with previous co-training style algorithms, local co-training has two advantages: firstly, it has higher classification precision by introducing local learning; secondly, only the dominant local models need to be updated, which significantly decreases the computational load. Experiments on WebKB and Cora datasets demonstrate that LCT algorithm can effectively exploit unlabeled data to improve the performance of web page classification.
Keywords :
Internet; learning (artificial intelligence); pattern classification; Cora datasets; Web page classification; WebKB datasets; local co-training; machine learning; multiview semi-supervised learning algorithm; Computational modeling; Error analysis; Machine learning; Support vector machines; Training; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.712
Filename :
5597065
Link To Document :
بازگشت