DocumentCode
2491766
Title
A Heuristic Method for Unstructured Pattern Management over Data Streams
Author
Gaoshan, Miao ; Hongyan, Li ; TengJiao, Wang
Author_Institution
Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing, China
fYear
2010
fDate
6-8 April 2010
Firstpage
468
Lastpage
471
Abstract
Pattern management is an important task in data stream mining and has attracted increasing attention recently. Variations of data stream patterns typically imply some fundamental changes of underlying objects and possess significant domain meanings. Many database applications require investigating the history information to get the knowledge about the evolving process of data streams. However, in most circumstances, the data stream patterns are unstructured: limited memory space cannot record all the patterns discovered online, no training sets or predefined models are available, and large numbers of noises bring another non-trivial challenge. This paper presents our research effort in online pattern management over such streams. A novel algorithm is proposed to detect stream changes, organize meaningful patterns and distinguish useful variations from noises. It extracts new trends from unstructured data heuristically, and involves a special parameter to identify whether the current event should be treated as significant. Several experiments are performed and the results prove this new method feasible and efficient.
Keywords
data mining; heuristic programming; data stream mining; data streams; database applications; heuristic method; unstructured pattern management; Acoustic noise; Computer science education; Conference management; Consumer electronics; Data engineering; Data mining; Discrete wavelet transforms; Engineering management; Laboratories; Technology management; data stream; pattern management; unstructured data;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Conference (APWEB), 2010 12th International Asia-Pacific
Conference_Location
Busan
Print_ISBN
978-1-7695-4012-2
Electronic_ISBN
978-1-4244-6600-9
Type
conf
DOI
10.1109/APWeb.2010.77
Filename
5474089
Link To Document