DocumentCode :
2208095
Title :
Edge Weight Regularization over Multiple Graphs for Similarity Learning
Author :
Muthukrishnan, Pradeep ; Radev, Dragomir ; Mei, Qiaozhu
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of Michigan, Ann Arbor, MI, USA
fYear :
2010
fDate :
13-17 Dec. 2010
Firstpage :
374
Lastpage :
383
Abstract :
The growth of the web has directly influenced the increase in the availability of relational data. One of the key problems in mining such data is computing the similarity between objects with heterogeneous feature types. For example, publications have many heterogeneous features like text, citations, authorship information, venue information, etc. In most approaches, similarity is estimated using each feature type in isolation and then combined in a linear fashion. However, this approach does not take advantage of the dependencies between the different feature spaces. In this paper, we propose a novel approach to combine the different sources of similarity using a regularization framework over edges in multiple graphs. We show that the objective function induced by the framework is convex. We also propose an efficient algorithm using coordinate descent to solve the optimization problem. We extrinsically evaluate the performance of the proposed unified similarity measure on two different tasks, clustering and classification. The proposed similarity measure outperforms three baselines and a state-of-the-art classification algorithm on a variety of standard, large data sets.
Keywords :
convex programming; data mining; graph theory; pattern classification; pattern clustering; convex function; data classification; data clustering; data mining; edge weight regularization; multiple graph; optimization; regularization framework; relational data; similarity learning; Classification; Clustering; Heterogeneous Features; Machine Learning; Similarity Learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-4786
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2010.156
Filename :
5693991
Link To Document :
بازگشت