Title :
Joint Local and Global Consistency on Interdocument and Interword Relationships for Co-Clustering
Author :
Bing-Kun Bao ; Weiqing Min ; Teng Li ; Changsheng Xu
Author_Institution :
Nat. Lab. of Pattern Recognition, Beijing, China
Abstract :
Co-clustering has recently received a lot of attention due to its effectiveness in simultaneously partitioning words and documents by exploiting the relationships between them. However, most of the existing co-clustering methods neglect or only partially reveal the interword and interdocument relationships. To fully utilize those relationships, the local and global consistencies on both word and document spaces need to be considered, respectively. Local consistency indicates that the label of a word/document can be predicted from its neighbors, while global consistency enforces a smoothness constraint on words/documents labels over the whole data manifold. In this paper, we propose a novel co-clustering method, called co-clustering via local and global consistency, to not only make use of the relationship between word and document, but also jointly explore the local and global consistency on both word and document spaces, respectively. The proposed method has the following characteristics: 1) the word-document relationships is modeled by following information-theoretic co-clustering (ITCC); 2) the local consistency on both interword and interdocument relationships is revealed by a local predictor; and 3) the global consistency on both interword and interdocument relationships is explored by a global smoothness regularization. All the fitting errors from these three-folds are finally integrated together to formulate an objective function, which is iteratively optimized by a convergence provable updating procedure. The extensive experiments on two benchmark document datasets validate the effectiveness of the proposed co-clustering method.
Keywords :
iterative methods; optimisation; pattern clustering; word processing; ITCC; document partitioning; information-theoretic coclustering; iterative optimization; objective function; smoothness regularization; word partitioning; word-document relationships; Joints; Kernel; Manifolds; Matrix decomposition; Mutual information; Random variables; Vectors; Co-clustering; information theory; local and global learning;
Journal_Title :
Cybernetics, IEEE Transactions on
DOI :
10.1109/TCYB.2014.2317514