DocumentCode :
2727591
Title :
Pairwise Constraints-Guided Non-negative Matrix Factorization for Document Clustering
Author :
Yang, Yu-Jiu ; Hu, Bao-Gang
fYear :
2007
fDate :
2-5 Nov. 2007
Firstpage :
250
Lastpage :
256
Abstract :
Nonnegative Matrix Factorization (NMF) has been proven to be effective in text mining. However, since NMF is a well-known unsupervised components analysis technique, the existing NMF method can not deal with prior constraints, which are beneficial to clustering or classification tasks. In this paper, we address the text clustering problem via a novel strategy, called Pairwise Constraintsguided Non-negative Matrix Factorization (PCNMF for short). Differing from the traditional NMF method, the proposed method can capture the available abundance prior constraints in original space, which result in more effective for clustering or information retrieval. Therefore, PCNMF enforces the discriminative capability in the reduced space. Utilizing the appropriate transformation, PCNMF represents as a new optimization problem, which can be efficiently solved by an iterative approach. The cluster membership of each document can be easily determined as the standard NMF. Empirical studies based on Benchmark document corpus demonstrate appealing results.
Keywords :
Automation; Computational efficiency; Data mining; Feature extraction; Information retrieval; Internet; Iterative methods; Laboratories; Pattern recognition; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, IEEE/WIC/ACM International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3026-0
Type :
conf
DOI :
10.1109/WI.2007.66
Filename :
4427095
Link To Document :
بازگشت