Title :
Detecting Anomalies in Bipartite Graphs with Mutual Dependency Principles
Author :
Hanbo Dai ; Feida Zhu ; Ee-Peng Lim ; HweeHwa Pang
Author_Institution :
Sch. of Inf. Syst., Singapore Manage. Univ., Singapore, Singapore
Abstract :
Bipartite graphs can model many real life applications including users-rating-products in online marketplaces, users-clicking-webpages on the World Wide Web and users referring- users in social networks. In these graphs, the anomalousness of nodes in one partite often depends on that of their connected nodes in the other partite. Previous studies have shown that this dependency can be positive (the anomalousness of a node in one partite increases or decreases along with that of its connected nodes in the other partite) or negative (the anomalousness of a node in one partite rises or falls in opposite direction to that of its connected nodes in the other partite). In this paper, we unify both positive and negative mutual dependency relationships in an unsupervised framework for detecting anomalous nodes in bipartite graphs. This is the first work that integrates both mutual dependency principles to model the complete set of anomalous behaviors of nodes that cannot be identified by either principle alone. We formulate our principles and design an iterative algorithm to simultaneously compute the anomaly scores of nodes in both partites. Moreover, we mathematically prove that the ranking of nodes by anomaly scores in each partite converges. Our framework is examined on synthetic graphs and the results show that our model outperforms existing models with only positive or negative mutual dependency principles. We also apply our framework to two real life datasets: Goodreads as a users-rating-books setting and Buzzcity as a users-clicking advertisements setting. The results show that our method is able to detect suspected spamming users and spammed books in Goodreads and achieve higher precision in identifying fraudulent advertisement publishers than existing approaches.
Keywords :
advertising; electronic publishing; graph theory; iterative methods; security of data; Buzzcity setting; Goodreads setting; World Wide Web; anomaly detection; bipartite graph; fraudulent advertisement publisher identification; iterative algorithm; mutual dependency principle; negative node dependency; node ranking; online marketplace; positive node dependency; social networks; spammed books detection; suspected spamming user detection; synthetic graph; unsupervised framework; users-clicking-webpages application; users-rating-products application; Bipartite graph; Computational modeling; Convergence; Eigenvalues and eigenfunctions; Image edge detection; Mathematical model; Vectors; Anomaly Detection; Bipartite Graph; Mutual Dependency; Mutual Reinforcement; Node Anomalies;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.167