DocumentCode
3292114
Title
A Novel Outlier Detection Algorithm for Distributed Databases
Author
Zhou, Jiaogen ; Zhao, Chunjiang ; Wan, You ; Huang, Wenjiang ; Yang, Baozhu ; Ge, Jixin
Author_Institution
NERCITA, Beijing Acad. of Agric. & Forestry Sci., Beijing
Volume
5
fYear
2008
fDate
18-20 Oct. 2008
Firstpage
293
Lastpage
297
Abstract
Traditional outlier detection algorithms are designed to apply to centralized databases, not distributed databases. We proposed a novel outlier detection algorithm for distributed databases. Given data assigned to different network nodes of a network platform, where each node has its own memory and hard disc, and the communication between nodes driven by message, the populated data would be non-overlapping. The working way of the network system is a manager-worker mode, that is, that a node as manager is responsible for assigning tasks to worker and querying the results from worker nodes. The algorithm first detected local outliers based on distance on all nodes, and then identified local outliers collected in the central node where a globally screening operation on all local outliers was implemented to achieve really global outliers. To scale the algorithm to massive data and reduce its computing complexity, a data filtering technology was further presented. Experimental results demonstrated that the algorithm effectively and efficiently handled on real and artificial data.
Keywords
computational complexity; distributed databases; centralized databases; computational complexity; data filtering technology; distributed databases; manager-worker mode; outlier detection algorithm; Algorithm design and analysis; Application software; Classification algorithms; Clustering algorithms; Detection algorithms; Distributed computing; Distributed databases; Filtering; Nearest neighbor searches; Statistical distributions; data mining; distributed database; outlier detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location
Jinan Shandong
Print_ISBN
978-0-7695-3305-6
Type
conf
DOI
10.1109/FSKD.2008.422
Filename
4666540
Link To Document