DocumentCode
2331507
Title
A Modified Density Based Outlier Mining Algorithm for Large Dataset
Author
Yang, Peng ; Huang, Biao
Author_Institution
Chongqing Univ. of Arts & Sci., Chongqing
fYear
2008
fDate
20-20 Nov. 2008
Firstpage
37
Lastpage
40
Abstract
Outlier mining is to discover the objects with exceptional behavior in dataset. It is an important challenge from the knowledge discovery standpoint and attracts much attention recently. The density based outlier mining algorithm is an effective approach to detect anomalous points. However, such algorithms usually need amounts of computations. In this paper, we propose a modified density based detection algorithm which utilizes the data partitioning method. Furthermore, it presents some speedup strategies such as the introduction of module information to avoid large number of unnecessary computations while finding outliers. The algorithm is applied on both synthetic and real datasets and the experimental results show that it is efficient for outlier detection in large dataset.
Keywords
data mining; anomalous points detection; data partitioning; knowledge discovery; large dataset; outlier mining; Data engineering; Detection algorithms; Engineering management; Information management; Information technology; Intrusion detection; Partitioning algorithms; Seminars; Space technology; Technology management;
fLanguage
English
Publisher
ieee
Conference_Titel
Future Information Technology and Management Engineering, 2008. FITME '08. International Seminar on
Conference_Location
Leicestershire, United Kingdom
Print_ISBN
978-0-7695-3480-0
Type
conf
DOI
10.1109/FITME.2008.106
Filename
4746436
Link To Document