Title :
Enhancing Web Page Classification via Local Co-training
Author :
Du, Youtian ; Guan, Xiaohong ; Cai, Zhongmin
Author_Institution :
MOE Key Lab. for Intell. Networks & Network Security, Xi ´´an Jiaotong Univ., Xi´´an, China
Abstract :
In this paper we propose a new multi-view semi-supervised learning algorithm called Local Co-Training(LCT). The proposed algorithm employs a set of local models with vector outputs to model the relations among examples in a local region on each view, and iteratively refines the dominant local models (i.e. the local models related to the unlabeled examples chosen for enriching the training set) using unlabeled examples by the co-training process. Compared with previous co-training style algorithms, local co-training has two advantages: firstly, it has higher classification precision by introducing local learning; secondly, only the dominant local models need to be updated, which significantly decreases the computational load. Experiments on WebKB and Cora datasets demonstrate that LCT algorithm can effectively exploit unlabeled data to improve the performance of web page classification.
Keywords :
Internet; learning (artificial intelligence); pattern classification; Cora datasets; Web page classification; WebKB datasets; local co-training; machine learning; multiview semi-supervised learning algorithm; Computational modeling; Error analysis; Machine learning; Support vector machines; Training; Web pages;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.712