DocumentCode :
3717393
Title :
Top (k1, k2) Distance-based outliers detection in an uncertain dataset
Author :
Fei Liu;Yan Jia
Author_Institution :
College of Computer, National University of Defense Technology, 410073, Changsha, P.R. China
fYear :
2015
Firstpage :
2290
Lastpage :
2299
Abstract :
In this paper, we focus on distance-based outliers detection in an uncertain dataset, which is very useful in large social network. Based on the x-tuple model and the possible world semantics, we propose the concept of tuple outlier score, top k1 probability and top (k1, k2) distance-based outlier. We then design an algorithm using dynamic programming technique to calculate tuple outlier scores and detect top (k1, k2) distance-based outliers. The local neighbor region is proposed to detect approximate outliers with high precision efficiently. We also propose two pruning strategies to avoid additional computation overhead and prune data objects that cannot be outliers. After theory analysis, we conduct experiments in two real datasets to verify good performance of our method.
Keywords :
"Algorithm design and analysis","Probability","Social network services","Semantics","Data models","Acceleration","Mathematical model"
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BigData.2015.7364018
Filename :
7364018
Link To Document :
بازگشت