DocumentCode :
5187
Title :
Relational Multimanifold Coclustering
Author :
Ping Li ; Jiajun Bu ; Chun Chen ; Zhanying He ; Deng Cai
Author_Institution :
Zhejiang Provincial Key Lab. of Service Robot, Zhejiang Univ., Hangzhou, China
Volume :
43
Issue :
6
fYear :
2013
fDate :
Dec. 2013
Firstpage :
1871
Lastpage :
1881
Abstract :
Coclustering targets on grouping the samples (e.g., documents and users) and the features (e.g., words and ratings) simultaneously. It employs the dual relation and the bilateral information between the samples and features. In many real-world applications, data usually reside on a submanifold of the ambient Euclidean space, but it is nontrivial to estimate the intrinsic manifold of the data space in a principled way. In this paper, we focus on improving the coclustering performance via manifold ensemble learning, which is able to maximally approximate the intrinsic manifolds of both the sample and feature spaces. To achieve this, we develop a novel coclustering algorithm called relational multimanifold coclustering based on symmetric nonnegative matrix trifactorization, which decomposes the relational data matrix into three submatrices. This method considers the intertype relationship revealed by the relational data matrix and also the intratype information reflected by the affinity matrices encoded on the sample and feature data distributions. Specifically, we assume that the intrinsic manifold of the sample or feature space lies in a convex hull of some predefined candidate manifolds. We want to learn a convex combination of them to maximally approach the desired intrinsic manifold. To optimize the objective function, the multiplicative rules are utilized to update the submatrices alternatively. In addition, both the entropic mirror descent algorithm and the coordinate descent algorithm are exploited to learn the manifold coefficient vector. Extensive experiments on documents, images, and gene expression data sets have demonstrated the superiority of the proposed algorithm compared with other well-established methods.
Keywords :
learning (artificial intelligence); matrix decomposition; pattern clustering; affinity matrices; ambient Euclidean space; bilateral information; coordinate descent algorithm; dual relation; entropic mirror descent algorithm; feature data distributions; feature spaces; intertype relationship; intratype information; manifold coefficient vector learning; manifold ensemble learning; multiplicative rules; relational data matrix decomposition; relational multimanifold coclustering; symmetric nonnegative matrix trifactorization; Laplace equations; Linear programming; Manifolds; Matrix decomposition; Partitioning algorithms; Symmetric matrices; Vectors; Coordinate descent algorithm (CDA); entropic mirror descent algorithm (EMDA); manifold ensemble learning; nonnegative matrix trifactorization; relational coclustering;
fLanguage :
English
Journal_Title :
Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
2168-2267
Type :
jour
DOI :
10.1109/TSMCB.2012.2234108
Filename :
6409269
Link To Document :
بازگشت