Title :
An Auto-tuning Solution to Data Streams Clustering in OpenCL
Author :
Fang, Jianbin ; Varbanescu, Ana Lucia ; Sips, Henk
Author_Institution :
Parallel & Distrib. Syst. Group, Delft Univ. of Technol., Delft, Netherlands
Abstract :
Due to its applicability to numerous types of data, including telephone records, web documents, and click streams, the data stream model has recently attracted attention. For analysis of such data, it is crucial to process the data in a single pass, or a small number of passes, using little memory. This paper provides an OpenCL implementation for data streams clustering, and then presents several optimizations for it, which make it more efficient in terms of memory usage. In order to maximize performance for different problem sizes and architectures, we also propose an auto-tuning solution. Experimental results show that our fully optimized implementation can perform 2.1x and 1.4x faster than the native OpenCL implementation on NVIDIA GTX480 and AMD HD5870, respectively, it can also achieve 1.4x to 3.3x speedup relative to the original CUDA implementation solution on GTX480.
Keywords :
data models; open systems; optimisation; pattern clustering; AMD HD5870; CUDA implementation solution; GTX480; NVIDIA GTX480; OpenCL implementation; Web document; auto-tuning solution; click stream; data stream clustering; data stream model; telephone record; Analytical models; Bandwidth; Equations; Graphics processing unit; Mathematical model; Memory management; Optimization; Auto-tuning; Clustering; Data Streams; OpenCL; Performance Optimizations;
Conference_Titel :
Computational Science and Engineering (CSE), 2011 IEEE 14th International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-1-4577-0974-6
DOI :
10.1109/CSE.2011.104