DocumentCode
480154
Title
DB-Outlier Detection Algorithm Using Divide and Conquer Approach over Dynamic DataStream
Author
Elahi, Manzoor ; Lv, Xinjie ; Nisar, Wasif ; Khan, Imran Ali ; Qiao, Ying ; Wang, Hongan
Author_Institution
Intell. Eng. Lab., Chinese Acad. of Sci., Beijing
Volume
4
fYear
2008
fDate
12-14 Dec. 2008
Firstpage
438
Lastpage
443
Abstract
Anomaly detection is currently an important and active research problem in many fields and involved in numerous applications. Most of the existing methods are based on distance measure which can produce better results as compared to other methods. But in case of datastream these methods are computationally less efficient. In order to find some specified number of neighbors for each data element make these methods inappropriate for datasteam environment, because of the huge volume of DataStream. Moreover, declaring a point as an outlier as it arrives, can often lead us to a wrong decision. In this paper we introduced an algorithm which divide incoming stream in chunks and make preliminary clusters in each chunk during the first stage. In the second stage K-nearest neighbor approach for outlier detection is applied to each cluster. In this way we can reduce the nearest neighbor searches for each data element as well as can detect better outliers with evolution of data stream. Several experiments on different datasets confirm that our technique can find suitable outliers with low computational cost than the other exiting distance based approaches for outlier detection over data stream.
Keywords
electronic data interchange; inference mechanisms; pattern recognition; anomaly detection; data element; data stream; divide and conquer approach; dynamic DataStream; k-nearest neighbor approach; nearest neighbor search; outlier detection algorithm; Application software; Computer science; Costs; Data engineering; Data mining; Databases; Detection algorithms; Information technology; Nearest neighbor searches; Software engineering; datastream; mining; outlier;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location
Wuhan, Hubei
Print_ISBN
978-0-7695-3336-0
Type
conf
DOI
10.1109/CSSE.2008.1053
Filename
4722653
Link To Document