DocumentCode :
441814
Title :
Mining attributes´ sequential patterns for error identification in data set
Author :
Liu, Ya-Bo ; Liu, Da-you
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
Volume :
3
fYear :
2005
fDate :
18-21 Aug. 2005
Firstpage :
1931
Abstract :
It is well recognized that sequential pattern mining plays an essential role in many scientific and business domains. In this paper, a new extension of sequential pattern, attributes´ sequential pattern, is proposed. An attributes´ sequential pattern is a sequence of attributes, whose values commonly occur in ascending order over data set. After each record in data set is transformed into an attributes´ sequence according to their ordinal values, attributes´ sequential patterns can be mined by means of mining sequential patterns. But our work is different from sequential pattern mining. One use of attributes´ sequential patterns is to identify possible errors in data set for data cleaning, in which the values of attributes break the attributes´ sequential patterns which most of the data conform to. Experiments verify the high efficiency of the method presented.
Keywords :
data mining; pattern recognition; sequences; attribute sequential pattern mining; data cleaning; data set error identification; Cleaning; Computer errors; Computer science; Data mining; Diseases; Educational institutions; Educational technology; Itemsets; Laboratories; Pattern recognition; Attributes’ sequence; Attributes’ sequential pattern; Data Cleaning; Sequential pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
Type :
conf
DOI :
10.1109/ICMLC.2005.1527261
Filename :
1527261
Link To Document :
بازگشت