DocumentCode :
1966059
Title :
Feature selection for collective classification
Author :
Senliol, Baris ; Aral, Atakan ; Cataltepe, Zehra
Author_Institution :
Comput. Eng. Dept., Istanbul Tech. Univ., Istanbul, Turkey
fYear :
2009
fDate :
14-16 Sept. 2009
Firstpage :
286
Lastpage :
291
Abstract :
When in addition to node contents and labels, relations (links) between nodes and some unlabeled nodes are available, collective classification algorithms can be used. Collective classification algorithms, like ICA (iterative classification algorithm), determine labels for the unlabeled nodes based on the contents and/or labels of the neighboring nodes. Feature selection algorithms have been shown to improve classification accuracy for traditional machine learning algorithms. In this paper, we use a recent and successful feature selection algorithm, mRMR (minimum redundancy maximum relevance, Ding and Peng, 2003), on content features. On two scientific paper citation data sets, Cora and Citeseer, when only content information is used, we know that the selected features may result in almost as good performance as all the features. When feature selection is performed both on content and link information, even better classification accuracies are obtained. Feature selection considerably reduces the training time for both content only and ICA algorithms.
Keywords :
iterative methods; learning (artificial intelligence); pattern classification; collective classification algorithms; feature selection algorithms; iterative classification algorithm; machine learning algorithms; minimum redundancy maximum relevance; Chemicals; Classification algorithms; Computer networks; Independent component analysis; Inference algorithms; Iterative algorithms; Logistics; Machine learning algorithms; Pattern recognition; Testing; Citeseer; Collective Classification; Cora; Feature Selection; Iterative Classification Algorithm (ICA); Logistic Regression; Minimum Redundancy Maximum Relevance (mRMR);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on
Conference_Location :
Guzelyurt
Print_ISBN :
978-1-4244-5021-3
Electronic_ISBN :
978-1-4244-5023-7
Type :
conf
DOI :
10.1109/ISCIS.2009.5291828
Filename :
5291828
Link To Document :
بازگشت