DocumentCode :
2331507
Title :
A Modified Density Based Outlier Mining Algorithm for Large Dataset
Author :
Yang, Peng ; Huang, Biao
Author_Institution :
Chongqing Univ. of Arts & Sci., Chongqing
fYear :
2008
fDate :
20-20 Nov. 2008
Firstpage :
37
Lastpage :
40
Abstract :
Outlier mining is to discover the objects with exceptional behavior in dataset. It is an important challenge from the knowledge discovery standpoint and attracts much attention recently. The density based outlier mining algorithm is an effective approach to detect anomalous points. However, such algorithms usually need amounts of computations. In this paper, we propose a modified density based detection algorithm which utilizes the data partitioning method. Furthermore, it presents some speedup strategies such as the introduction of module information to avoid large number of unnecessary computations while finding outliers. The algorithm is applied on both synthetic and real datasets and the experimental results show that it is efficient for outlier detection in large dataset.
Keywords :
data mining; anomalous points detection; data partitioning; knowledge discovery; large dataset; outlier mining; Data engineering; Detection algorithms; Engineering management; Information management; Information technology; Intrusion detection; Partitioning algorithms; Seminars; Space technology; Technology management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Information Technology and Management Engineering, 2008. FITME '08. International Seminar on
Conference_Location :
Leicestershire, United Kingdom
Print_ISBN :
978-0-7695-3480-0
Type :
conf
DOI :
10.1109/FITME.2008.106
Filename :
4746436
Link To Document :
بازگشت