DocumentCode :
2767574
Title :
Prototype based outlier detection
Author :
Kim, Seungtaek ; Cho, Sungzoon
Author_Institution :
Seoul Nat. Univ., Seoul
fYear :
0
fDate :
0-0 0
Firstpage :
820
Lastpage :
826
Abstract :
Outliers refer to "minority" data that are different from most other data. They usually disturb data mining process. But, sometimes they provide valuable information. Thus, it is important to identify outliers in a given data set. In this paper, we propose a novel approach which scores "outlierness" based on the distance from majority data. First, prototype data are identified. Second, those prototypes that are distant from others are eliminated. Finally, the outlierness of each data point is computed as the distance from the remaining prototypes. Experiments involving various data sets show that the proposed approach performs well in terms of accuracy, robustness and versatility.
Keywords :
data mining; prototypes; data mining process; minority data; outlier detection; prototype; Cellular phones; Credit cards; Data mining; Equations; Intrusion detection; Parametric statistics; Probability; Prototypes; Robustness; Statistical distributions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9490-9
Type :
conf
DOI :
10.1109/IJCNN.2006.246769
Filename :
1716180
Link To Document :
بازگشت