Title :
AORS: Affinity-based outlier ranking score
Author :
Shaohong Zhang ; Hau-San Wong ; Wen-Jun Shen ; Dongqing Xie
Author_Institution :
Dept. of Comput. Sci., Guangzhou Univ., Guangzhou, China
Abstract :
Outlier ranking methods can provide a quantitative measure to evaluate the outlierness of data instances in data clustering and attract great interest in pattern recognition and data mining communities. However, it has been pointed out that the diverse scaling ranges of these scores bring difficulty to result interpretation. Moreover, popular outlier ranking scores based on simple distance measures might not accurately reflect the complex affinity among data points. In this paper, we propose a new outlier ranking method based on consensus affinity of a cluster ensemble. Two new outlier ranking scores generalized from well-known clustering evaluation measures, Rvv from the RAND measure and ARIvv from Adjusted Rand Index (ARI), are adopted for outlierness evaluation. Compared to other outlierness ranking measures, the two new measures have the desired bounds without additional transformations. Consistent with the improvement of Adjusted Rand Index (ARI) over RAND, we find that ARIvv also significantly outperforms Rvv. Benefiting from the consensus affinity of a cluster ensemble, our proposed method with the ARIvv score provides significant improvement beyond a number of competing algorithms on public UCI benchmark data sets. Studies with both theoretical analysis and experimental validation show the effectiveness of our proposed methods.
Keywords :
learning (artificial intelligence); pattern clustering; statistical analysis; AORS method; ARI; RAND measure; adjusted Rand index; affinity-based outlier ranking score; cluster ensemble; clustering evaluation measures; data clustering; data instance; data mining; data points; distance measure; outlier ranking methods; outlierness evaluation; pattern recognition; public UCI benchmark data sets; quantitative measure; Clustering algorithms; Correlation; Educational institutions; Indexes; Partitioning algorithms; Power measurement; Uncertainty;
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
DOI :
10.1109/IJCNN.2014.6889551