DocumentCode :
3105629
Title :
The PDD Framework for Detecting Categories of Peculiar Data
Author :
Shrestha, Mahesh ; Hamilton, Howard J. ; Yao, Yiyu ; Konkel, Ken ; Geng, Liqiang
Author_Institution :
Dept. of Comput. Sci., Univ. of Regina, Regina, SK
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
562
Lastpage :
571
Abstract :
Peculiar data are objects that are relatively few in number and significantly different from the other objects in a data set. In this paper, we propose the PDD framework for detecting multiple categories of peculiar data. This framework provides an extensible set of perspectives for viewing data, currently including viewing data as a set of records, attributes, frequencies, intervals, sequences, or sequences of changes. By using these six views of the data, multiple categories of peculiar data can be detected to reveal different aspects of the data. For each view, the framework provides an extensible set of peculiarity measures to detect outliers and other kinds of peculiar data. The PDD framework has been implemented for Oracle and Access. Experiments are reported for data sets concerning Regina weather and NHL hockey.
Keywords :
data analysis; data mining; database management systems; Access; NHL hockey; Oracle; PDD framework; Regina weather; data aspects; outlier detection; peculiar data category detection; peculiarity measure; Cancer; Cleaning; Computer science; Density measurement; Event detection; Frequency; Intrusion detection; Medical diagnosis; Object detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.159
Filename :
4053082
Link To Document :
بازگشت