مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximation of kernel k-means for streaming data

DocumentCode :

594763

Title :

Approximation of kernel k-means for streaming data

Author :

Havens, Timothy C.

Author_Institution :

Dept. of Electr. & Comput. Eng., Michigan Technol. Univ., Houghton, MI, USA

fYear :

2012

fDate :

11-15 Nov. 2012

Firstpage :

509

Lastpage :

512

Abstract :

Streaming data are any data that are sequentially presented to a system such that future data cannot be accessed. By their nature, streaming data are often large data sets and can quickly outgrow the working memory for a typical computer. Clustering is one of the primary tasks used in the pattern recognition and data mining communities and kernel k-means is a well-studied and popular algorithm. However, kernel k-means is ill-suited to streaming data. In this paper, I develop an approximation to the kernel k-means that only needs access to data from time-step t and (t-1); hence, the memory requirement of the proposed algorithm is only O(n²), where n is the size of the data chunk at time t. Empirical results show that streaming kernel k-means (skKM) produces partitions similar to those produced by kernel k-means run on the entire data set.

Keywords :

approximation theory; data mining; pattern clustering; O(n²); approximation; clustering analysis; data chunk; data mining communities; kernel k-means; pattern recognition; streaming data; Clustering algorithms; Complexity theory; Equations; Kernel; Memory management; Standards; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Pattern Recognition (ICPR), 2012 21st International Conference on

Conference_Location :

Tsukuba

ISSN :

1051-4651

Print_ISBN :

978-1-4673-2216-4

Type :

conf

Filename :

6460183

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=594763