DocumentCode :
2219659
Title :
A vertical outlier detection algorithm with clusters as by-product
Author :
Ren, Dongmei ; Rahal, Imad ; Perrizo, William
Author_Institution :
Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
fYear :
2004
fDate :
15-17 Nov. 2004
Firstpage :
22
Lastpage :
29
Abstract :
Outlier detection can lead to discovering unexpected and interesting knowledge, which is critically important to some areas such as monitoring of criminal activities in electronic commerce, credit card fraud, and the like. In This work, we propose an efficient outlier detection method with clusters as by-product, which works efficiently for large datasets. Our contributions are: a) We introduce a local connective factor (LCF); b) Based on LCF, we propose an outlier detection method which can efficiently detect outliers and group data into clusters in a one-time process. Our method does not require the beforehand clustering process, which is the first step in other state-of-the-art clustering-based outlier detection methods; c) The performance of our method is further improved by means of a vertical data representation, P-trees. We tested our method with real dataset. Our method shows around five-time speed improvements compared to the other contemporary clustering-based outlier-detection approaches.
Keywords :
data mining; edge detection; pattern clustering; statistical databases; tree data structures; very large databases; P-trees; clustering process; datasets; knowledge discovery; local connective factor; outlier detection; vertical data representation; Computer science; Computerized monitoring; Credit cards; Data models; Detection algorithms; Electronic commerce; Neodymium; Performance analysis; Surveillance; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2004. ICTAI 2004. 16th IEEE International Conference on
ISSN :
1082-3409
Print_ISBN :
0-7695-2236-X
Type :
conf
DOI :
10.1109/ICTAI.2004.22
Filename :
1374166
Link To Document :
بازگشت