مرکز منطقه ای اطلاع رساني علوم و فناوري - New Methods for Deviation-Based Outlier Detection in Large Database

DocumentCode :

2739482

Title :

New Methods for Deviation-Based Outlier Detection in Large Database

Author :

Zhang, Zhiyuan ; Feng, Xia

Author_Institution :

Sch. of Comput. Sci. & Technol., Civil Aviation Univ. of China, Tianjin, China

Volume :

fYear :

2009

fDate :

14-16 Aug. 2009

Firstpage :

495

Lastpage :

499

Abstract :

Outlier (also called deviation or exception) detection is an important function in data mining. In identifying outliers, the deviation-based approach has many advantages and draws much attention. Although a linear algorithm for sequential deviation detection is proposed, it is not stable and always loses many deviation points. In this paper, we present three algorithms on detecting deviations. The first algorithm is time proportional to the square of the dataset length, and the second is time proportional to the square of the number of distinct data values. These two algorithms lead to same result, while the latter is much more efficient than the former. In the third algorithm, a deviation factor is defined to help finding deviation points. Although leading to approximation results, it is the most efficient of the three, especially to large datasets with lots of distinct values.

Keywords :

data mining; database management systems; data mining; dataset length square proportional; deviation factor; distinct data values square proportional; linear algorithm; outlier detection; sequential deviation detection; Algorithm design and analysis; Computer science; Counting circuits; Data mining; Databases; Dynamic programming; Fuzzy systems; Histograms; Out of order; Performance analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on

Conference_Location :

Tianjin

Print_ISBN :

978-0-7695-3735-1

Type :

conf

DOI :

10.1109/FSKD.2009.303

Filename :

5358526

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2739482