DocumentCode :
2851419
Title :
RDF: a density-based outlier detection method using vertical data representation
Author :
Ren, Dongmei ; Wang, Baoying ; Perrizo, William
Author_Institution :
Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
fYear :
2004
fDate :
1-4 Nov. 2004
Firstpage :
503
Lastpage :
506
Abstract :
Outlier detection can lead to discovering unexpected and interesting knowledge, which is critical important to some areas such as monitoring of criminal activities in electronic commerce, credit card fraud, etc. In this paper, we developed an efficient density-based outlier detection method for large datasets. Our contributions are: a) we introduce a relative density factor (RDF); b) based on RDF, we propose an RDF-based outlier detection method which can efficiently prune the data points which are deep in clusters, and detect outliers only within the remaining small subset of the data; c) the performance of our method is further improved by means of a vertical data representation, P-trees. We tested our method with NHL and NBA data. Our method shows an order of magnitude speed improvement compared to the contemporary approaches.
Keywords :
credit transactions; data mining; electronic commerce; fraud; trees (mathematics); very large databases; P-trees; RDF-based outlier detection; credit card fraud; density-based outlier detection; electronic commerce; relative density factor; vertical data representation; Computer science; Computerized monitoring; Costs; Credit cards; Data structures; Electronic commerce; Neodymium; Resource description framework; Surveillance; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
Type :
conf
DOI :
10.1109/ICDM.2004.10010
Filename :
1410346
Link To Document :
بازگشت