DocumentCode :
2491766
Title :
A Heuristic Method for Unstructured Pattern Management over Data Streams
Author :
Gaoshan, Miao ; Hongyan, Li ; TengJiao, Wang
Author_Institution :
Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing, China
fYear :
2010
fDate :
6-8 April 2010
Firstpage :
468
Lastpage :
471
Abstract :
Pattern management is an important task in data stream mining and has attracted increasing attention recently. Variations of data stream patterns typically imply some fundamental changes of underlying objects and possess significant domain meanings. Many database applications require investigating the history information to get the knowledge about the evolving process of data streams. However, in most circumstances, the data stream patterns are unstructured: limited memory space cannot record all the patterns discovered online, no training sets or predefined models are available, and large numbers of noises bring another non-trivial challenge. This paper presents our research effort in online pattern management over such streams. A novel algorithm is proposed to detect stream changes, organize meaningful patterns and distinguish useful variations from noises. It extracts new trends from unstructured data heuristically, and involves a special parameter to identify whether the current event should be treated as significant. Several experiments are performed and the results prove this new method feasible and efficient.
Keywords :
data mining; heuristic programming; data stream mining; data streams; database applications; heuristic method; unstructured pattern management; Acoustic noise; Computer science education; Conference management; Consumer electronics; Data engineering; Data mining; Discrete wavelet transforms; Engineering management; Laboratories; Technology management; data stream; pattern management; unstructured data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Conference (APWEB), 2010 12th International Asia-Pacific
Conference_Location :
Busan
Print_ISBN :
978-1-7695-4012-2
Electronic_ISBN :
978-1-4244-6600-9
Type :
conf
DOI :
10.1109/APWeb.2010.77
Filename :
5474089
Link To Document :
بازگشت