Title :
Clustering techniques for streaming data-a survey
Author :
Yogita, Y. ; Toshniwal, D.
Author_Institution :
Electron. & Comput. Eng. Dept., Indian Inst. of Technol. Roorkee, Roorkee, India
Abstract :
Nowadays many applications are generating streaming data for an example real-time surveillance, internet traffic, sensor data, health monitoring systems, communication networks, online transactions in the financial market and so on. Data Streams are temporally ordered, fast changing, massive, and potentially infinite sequence of data. Data Stream mining is a very challenging problem. This is due to the fact that data streams are of tremendous volume and flows at very high speed which makes it impossible to store and scan streaming data multiple time. Concept evolution in streaming data further magnifies the challenge of working with streaming data. Clustering is a data stream mining task which is very useful to gain insight of data and data characteristics. Clustering is also used as a pre-processing step in over all mining process for an example clustering is used for outlier detection and for building classification model. In this paper we will focus on the challenges and necessary features of data stream clustering techniques, review and compare the literature for data stream clustering by example and variable, describe some real world applications of data stream clustering, and tools for data stream clustering.
Keywords :
data mining; pattern clustering; classification model; clustering task; concept evolution; data sequence; data stream clustering technique; data stream mining; outlier detection; streaming data; Approximation algorithms; Approximation methods; Clustering algorithms; Correlation; Data mining; Data models; Fading; Clustering; Data Stream Mining; Streaming Data;
Conference_Titel :
Advance Computing Conference (IACC), 2013 IEEE 3rd International
Conference_Location :
Ghaziabad
Print_ISBN :
978-1-4673-4527-9
DOI :
10.1109/IAdCC.2013.6514355