Title :
kNN-MST-Agglomerative: A fast and scalable graph-based data clustering approach on GPU
Author :
Arefin, A.S. ; Riveros, Cristian ; Berretta, Regina ; Moscato, P.
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., Univ. of Newcastle, Newcastle, NSW, Australia
Abstract :
Data clustering is a distinctive method for analyzing complex networks in terms of functional relationships of the comprising elements. A number of graph-based algorithms have been proposed so far to tackle the complexity of the problem and many of them are based on the representation of data in the form of a minimum spanning tree (MST). In this work, we propose a graph-based agglomerative clustering method that is based the k-Nearest Neighbor (kNN) graphs and the Borůvka´s-MST Algorithm, (termed as, kNN-MST-Agglomerative). The proposed method is inherently parallel and in addition it is applicable to a wide class of practical problems involving large datasets. We demonstrate the performance of our method on a set of real-world biological networks constructed from a renowned breast cancer study.
Keywords :
cancer; complex networks; computational complexity; graph theory; graphics processing units; pattern clustering; tree data structures; Boruvka-MST Algorithm; GPU; MST; complex networks; data representation; fast graph-based data clustering approach; functional relationships; graph-based agglomerative clustering method; k-nearest neighbor graphs; kNN graphs; kNN-MST-agglomerative method; large datasets; minimum spanning tree; real-world biological networks; renowned breast cancer study; scalable graph-based data clustering approach; Clustering algorithms; Color; Educational institutions; Graphics processing unit; Instruction sets; Kernel; Partitioning algorithms;
Conference_Titel :
Computer Science & Education (ICCSE), 2012 7th International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4673-0241-8
DOI :
10.1109/ICCSE.2012.6295143