DocumentCode :
60401
Title :
Automated Graph Regularized Projective Nonnegative Matrix Factorization for Document Clustering
Author :
Xiaobing Pei ; Tao Wu ; Chuanbo Chen
Author_Institution :
Sch. of Software, HuaZhong Univ. of Sci. & Technol., Wuhan, China
Volume :
44
Issue :
10
fYear :
2014
fDate :
Oct. 2014
Firstpage :
1821
Lastpage :
1831
Abstract :
In this paper, a novel projective nonnegative matrix factorization (PNMF) method for enhancing the clustering performance is presented, called automated graph regularized projective nonnegative matrix factorization (AGPNMF). The idea of AGPNMF is to extend the original PNMF by incorporating the automated graph regularized constraint into the PNMF decomposition. The key advantage of this approach is that AGPNMF simultaneously finds graph weights matrix and dimensionality reduction of data. AGPNMF seeks to extract the data representation space that preserves the local geometry structure. This character makes AGPNMF more intuitive and more powerful than the original method for clustering tasks. The kernel trick is used to extend AGPNMF model related to the input space by some nonlinear map. The proposed method has been applied to the problem of document clustering using the well-known Reuters-21578, TDT2, and SECTOR data sets. Our experimental evaluations show that the proposed method enhances the performance of PNMF for document clustering.
Keywords :
data reduction; data structures; document handling; graph theory; matrix decomposition; pattern clustering; AGPNMF; PNMF decomposition; Reuters-21578; SECTOR data sets; TDT2; automated graph regularized projective nonnegative matrix factorization; data dimensionality reduction; data representation space extraction; document clustering; graph weights matrix; local geometry structure; Clustering algorithms; Cybernetics; Geometry; Kernel; Linear programming; Matrix decomposition; Optimization; Clustering; projective nonnegative matrix factorization;
fLanguage :
English
Journal_Title :
Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
2168-2267
Type :
jour
DOI :
10.1109/TCYB.2013.2296117
Filename :
6712130
Link To Document :
بازگشت