Title of article :
A Tensor CP Decomposition Method for Clustering Heterogeneous Information Networks via Stochastic Gradient Descent Algorithms
Author/Authors :
Wu, Jibing Science and Technology on Information System Engineering Laboratory - National University of Defense Technology, Changsha, China , Wang, Zhifei Science and Technology on Information System Engineering Laboratory - National University of Defense Technology, Changsha, China , Wu, Yahui Science and Technology on Information System Engineering Laboratory - National University of Defense Technology, Changsha, China , Liu, Lihua Science and Technology on Information System Engineering Laboratory - National University of Defense Technology, Changsha, China , Deng, Su Science and Technology on Information System Engineering Laboratory - National University of Defense Technology, Changsha, China , Huang, Hongbin Science and Technology on Information System Engineering Laboratory - National University of Defense Technology, Changsha, China
Pages :
14
From page :
1
To page :
14
Abstract :
Clustering analysis is a basic and essential method for mining heterogeneous information networks, which consist of multiple types of objects and rich semantic relations among different object types. Heterogeneous information networks are ubiquitous in the real-world applications, such as bibliographic networks and social media networks. Unfortunately, most existing approaches, such as spectral clustering, are designed to analyze homogeneous information networks, which are composed of only one type of objects and links. Some recent studies focused on heterogeneous information networks and yielded some research fruits, such as RankClus and NetClus. However, they often assumed that the heterogeneous information networks usually follow some simple schemas, such as bityped network schema or star network schema. To overcome the above limitations, we model the heterogeneous information network as a tensor without the restriction of network schema. Then, a tensor CP decomposition method is adapted to formulate the clustering problem in heterogeneous information networks. Further, we develop two stochastic gradient descent algorithms, namely, SGDClus and SOSClus, which lead to effective clustering multityped objects simultaneously. The experimental results on both synthetic datasets and real-world dataset have demonstrated that our proposed clustering framework can model heterogeneous information networks efficiently and outperform state-of-the-art clustering methods.
Keywords :
A Tensor CP , Decomposition Method , Clustering Heterogeneous Information Networks , Descent Algorithms
Journal title :
Scientific Programming
Serial Year :
2017
Full Text URL :
Record number :
2608080
Link To Document :
بازگشت