Title :
A vertical outlier detection algorithm with clusters as by-product
Author :
Ren, Dongmei ; Rahal, Imad ; Perrizo, William
Author_Institution :
Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
Abstract :
Outlier detection can lead to discovering unexpected and interesting knowledge, which is critically important to some areas such as monitoring of criminal activities in electronic commerce, credit card fraud, and the like. In This work, we propose an efficient outlier detection method with clusters as by-product, which works efficiently for large datasets. Our contributions are: a) We introduce a local connective factor (LCF); b) Based on LCF, we propose an outlier detection method which can efficiently detect outliers and group data into clusters in a one-time process. Our method does not require the beforehand clustering process, which is the first step in other state-of-the-art clustering-based outlier detection methods; c) The performance of our method is further improved by means of a vertical data representation, P-trees. We tested our method with real dataset. Our method shows around five-time speed improvements compared to the other contemporary clustering-based outlier-detection approaches.
Keywords :
data mining; edge detection; pattern clustering; statistical databases; tree data structures; very large databases; P-trees; clustering process; datasets; knowledge discovery; local connective factor; outlier detection; vertical data representation; Computer science; Computerized monitoring; Credit cards; Data models; Detection algorithms; Electronic commerce; Neodymium; Performance analysis; Surveillance; Testing;
Conference_Titel :
Tools with Artificial Intelligence, 2004. ICTAI 2004. 16th IEEE International Conference on
Print_ISBN :
0-7695-2236-X
DOI :
10.1109/ICTAI.2004.22