Title :
The PDD Framework for Detecting Categories of Peculiar Data
Author :
Shrestha, Mahesh ; Hamilton, Howard J. ; Yao, Yiyu ; Konkel, Ken ; Geng, Liqiang
Author_Institution :
Dept. of Comput. Sci., Univ. of Regina, Regina, SK
Abstract :
Peculiar data are objects that are relatively few in number and significantly different from the other objects in a data set. In this paper, we propose the PDD framework for detecting multiple categories of peculiar data. This framework provides an extensible set of perspectives for viewing data, currently including viewing data as a set of records, attributes, frequencies, intervals, sequences, or sequences of changes. By using these six views of the data, multiple categories of peculiar data can be detected to reveal different aspects of the data. For each view, the framework provides an extensible set of peculiarity measures to detect outliers and other kinds of peculiar data. The PDD framework has been implemented for Oracle and Access. Experiments are reported for data sets concerning Regina weather and NHL hockey.
Keywords :
data analysis; data mining; database management systems; Access; NHL hockey; Oracle; PDD framework; Regina weather; data aspects; outlier detection; peculiar data category detection; peculiarity measure; Cancer; Cleaning; Computer science; Density measurement; Event detection; Frequency; Intrusion detection; Medical diagnosis; Object detection;
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2701-7
DOI :
10.1109/ICDM.2006.159