DocumentCode :
3207684
Title :
Mining ordinal patterns for data cleaning
Author :
Liu, Y.B. ; Liu, D.Y.
fYear :
2004
fDate :
8-10 Nov. 2004
Firstpage :
438
Lastpage :
443
Abstract :
It is well recognized that sequential pattern mining plays an essential role in many scientific and business domains. In this paper, a new extension of sequential pattern, ordinal pattern, is proposed. An ordinal pattern is an ordinal sequence of attributes, whose values commonly occur in ascending order over data set. Ordinal pattern mining requests that values of different attributes must be comparable and ordinal. After each record in data set is transformed into an ordinal sequence of attributes according to their ordinal values, ordinal patterns can be mined by means of mining sequential patterns. But our work is different from sequential pattern mining. One use of ordinal patterns is to identify possible error records in data cleaning, in which the values of attributes break the ordinal patterns which most of the data conform to. Experiments verify the high efficiency of the method presented.
Keywords :
data mining; pattern recognition; data cleaning; ordinal pattern; sequential pattern mining; Cleaning; Computer science; Computer science education; Data mining; Diseases; Educational institutions; Educational technology; Itemsets; Knowledge engineering; Laboratories;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2004. IRI 2004. Proceedings of the 2004 IEEE International Conference on
Print_ISBN :
0-7803-8819-4
Type :
conf
DOI :
10.1109/IRI.2004.1431500
Filename :
1431500
Link To Document :
بازگشت