DocumentCode :
3189396
Title :
Mining Distance-Based Outliers from Categorical Data
Author :
Li, Shuxin ; Lee, Robert ; Lang, Sheau-Dong
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
225
Lastpage :
230
Abstract :
Distance-based outlier detection is an important data mining technique that finds abnormal data objects according to some distance function. However, when this technique is applied to high-dimensional categorical data, a traditional simple matching dissimilarity measure does not provide an adequate model. In this article, we employ a new common- neighbor-based distance function to measure the proximity between a pair of data points. Experiments show that better outlier mining results can be achieved when the new distance function is utilized rather than a conventional simple matching dissimilarity measure.
Keywords :
Algorithm design and analysis; Computational complexity; Computer science; Conferences; Data mining; Data security; Electronic commerce; Euclidean distance; Object detection; Risk management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3019-2
Electronic_ISBN :
978-0-7695-3033-8
Type :
conf
DOI :
10.1109/ICDMW.2007.75
Filename :
4476672
Link To Document :
بازگشت