Title :
Indexing bipartite memberships in web graphs
Author :
Yusheng Xie ; Zhengzhang Chen ; Palsetia, Diana ; Agrawal, Ankit ; Choudhary, Alok
Author_Institution :
Northwestern Univ., Evanston, IL, USA
Abstract :
Massive bipartite graphs are ubiquitous in real world and have important applications in social networks, biological mechanisms, etc. Consider one billion plus people on Facebook making trillions of connections with millions of organizations. Such big social bipartite graphs are often very skewed and unbalanced, on which traditional indexing algorithms do not perform optimally. In this paper, we propose Arowana, a data-driven algorithm for indexing large unbalanced bipartite graphs. Arowana achieves a high-performance efficiency by building an index tree that incorporates the semantic affinity among unbalanced graphs. Arowana uses probabilistic data structures to minimize space overhead and optimize search. In the experiments, we show that Arowana exhibits significant performance improvements and reduces space overhead over traditional indexing techniques.
Keywords :
Internet; graph theory; indexing; network theory (graphs); probability; social networking (online); tree data structures; Arowana data-driven algorithm; Facebook; Web graphs; biological mechanisms; bipartite membership indexing algorithm; high-performance efficiency; index tree; massive bipartite graphs; probabilistic data structures; search optimization; semantic affinity; social bipartite graphs; social networks; space overhead minimization; unbalanced bipartite graphs; unbalanced graphs; Bipartite graph; Buildings; Facebook; Indexing; Vegetation;
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2014 IEEE/ACM International Conference on
Conference_Location :
Beijing
DOI :
10.1109/ASONAM.2014.6921578