Title :
Interactive visualization of the social network of research collaborations
Author :
Alsukhni, Mohammad ; Zhu, Ying
Author_Institution :
Fac. of Eng. & Appl. Sci., Univ. of Ontario Inst. of Technol., Oshawa, ON, Canada
Abstract :
Social networks have been evolving over the past few years, leading to a rapid increase in the number and complexity of relationships among their entities. In this paper, we focus on a large scale dataset known as the Digital Bibliography and Library Project (DBLP), which contains information on all publications that have been published in computer and information science related journals and conference proceedings. We model the DBLP dataset as a social network of research collaborations. DBLP is a structured and dynamic dataset stored in the XML file format; it contains over 850,000 authors and 2 million publications and the resulting collaboration social network is a scale-free network. We define DBLP collaboration social network as a graph that consists of researchers as nodes and links representing the collaboration among the researchers. In this work, we implement a data analysis algorithm called Multidimensional Scaling (MDS) to represent the degree of collaboration among the DBLP authors as Euclidean distances in order to analyze, mine and understand the relational information in this large scale network in a visual way. MDS requires a highly computational complexity for large scale graphs such as the DBLP graph. Therefore, we propose different solutions to overcome this problem, and improve the MDS performance. In addition, as the quality of the MDS result is measured by a metric known as the stress value, we use the steepest descent method to minimize the stress in an iterative process called stress optimization in order to generate the best geometric layout of the graph. We also propose a solution to further enhance the graph visualization by partitioning the graph into sub-graphs and using repelling forces among nodes within the same sub-graph.
Keywords :
XML; bibliographic systems; computational complexity; data analysis; data visualisation; electronic publishing; gradient methods; graph theory; groupware; interactive systems; optimisation; social networking (online); DBLP author; DBLP collaboration social network; DBLP dataset; DBLP graph; Euclidean distance; MDS performance; XML file format; computational complexity; computer science related journal; conference proceedings; data analysis algorithm; digital bibliography and library project; geometric layout; graph partitioning; graph visualization; information science related journal; interactive visualization; iterative process; large scale graph; large scale network; multidimensional scaling; publication; relational information; research collaboration; scale-free network; steepest descent method; stress optimization; stress value; subgraph; Collaboration; Data visualization; Equations; Mathematical model; Social network services; Stress; XML; Multidimensional scaling; Social network; co-authorships network; digital bibliography and library project; gradient descent; graphs; information retrieval;
Conference_Titel :
Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-2282-9
Electronic_ISBN :
978-1-4673-2283-6
DOI :
10.1109/IRI.2012.6303017