Title :
Treatment Method after Discretization of Continuous Attributes Based on Attributes Importance and Samples Entropy
Author :
Ha, ShenHua ; Zhuang, Liyan ; Zhou, Yuxin ; Pei, Zhili ; Liu, Lisha ; Lu, Yinan ; Kong, Ying
Author_Institution :
Coll. of Comput. Sci. & Technol., Inner Mongolia Univ. for the Nat., Tongliao, China
Abstract :
It has great significance to efficiently distinguish the type of the samples´ data in the decision table after the discretization for the course of machine learning and data mining afterwards. This paper puts forward an annotation method of distinguishing the data type based on attributes importance and the samples entropy, and processed the simulation test using part of the UCI database which was artificially modified, it turns out the method is able to efficiently identify the data type with high accuracy, low misidentification rate and low reject rate.
Keywords :
attribute grammars; data mining; decision tables; entropy; learning (artificial intelligence); UCI database; annotation method; attributes; data mining; decision table; discretization; machine learning; samples entropy; Databases; Entropy; Error analysis; Heart; Iris; Noise; Set theory; Discretization; attributes importance; decision table; samples entropy;
Conference_Titel :
Intelligent Computation Technology and Automation (ICICTA), 2011 International Conference on
Conference_Location :
Shenzhen, Guangdong
Print_ISBN :
978-1-61284-289-9
DOI :
10.1109/ICICTA.2011.293