DocumentCode
3309053
Title
Detecting outliers in sliding window over categorical data streams
Author
QunHui Wu ; Shilong Ma
Author_Institution
State Key Lab. of Software Dev. Environ., Beijing Univ. of Aeronaut. & Astronaut., Beijing, China
Volume
3
fYear
2011
fDate
26-28 July 2011
Firstpage
1663
Lastpage
1667
Abstract
Outlier mining is an important and active research issue in anomaly detection. However, it is a difficult problem since categorical data arrive at a fast rate, some data may be outdated and the outliers identified are likely to change. In this paper, we propose an efficient algorithm for mining outliers from categorical data streams, which discover closed frequent patterns in sliding window first. Then WCFPOF (Weighted Closed Frequent Pattern Outlier Factor) is introduced to measure the complete categorical data, and the corresponding candidate outliers are stored in QIS (Query Indexed Structure). By employing the decayed function, the outdated outliers are faded to generate the final outliers. Experimental results show that our algorithm has higher detection precision than FindFPOF. Otherwise, our algorithm has better scalability with different data sizes.
Keywords
data mining; database indexing; fault tolerant computing; query processing; very large databases; anomaly detection; categorical data streams; decayed function; outlier detection; outlier mining; query indexed structure; sliding window; weighted closed frequent pattern outlier factor; Clustering algorithms; Computational efficiency; Data mining; Data structures; Itemsets; Partitioning algorithms; Scalability; categorical data streams; closed frequent pattern; outlier detection; sliding window;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-61284-180-9
Type
conf
DOI
10.1109/FSKD.2011.6019780
Filename
6019780
Link To Document