DocumentCode :
144802
Title :
A novel discretization technique using Class Attribute Interval Average
Author :
Baka, Abdulloh ; Wettayaprasit, Wiphada ; Vanichayobon, Sirirut
Author_Institution :
Dept. of Comput. Sci., Prince of Songkla Univ., Songkhla, Thailand
fYear :
2014
fDate :
6-8 May 2014
Firstpage :
95
Lastpage :
100
Abstract :
Discretization algorithm is important for data mining preprocessing because it will help the user to easily understand the data, reduce the complexity of data, reduce processing time, and increase efficiency and accuracy of the data. This paper proposes the new discretization algorithm called Class Attribute Interval Average (CAIA). The algorithm uses 2D-quanta matrix table to calculate each of class individual interval´s average and merge the best adjacent intervals to form the new interval. The experimental design uses four-UCI data sets (Iris, Breast Cancer, Heart Diseases, Glass) and four-classification algorithms (J48, RBF, MLP, NB). The comparisons of experimental result with the other six discretization algorithms (EW, EF, ChiMerge, IEM, CAIM, CACC) show that the proposed CAIA has the best mean rank for both of the accuracy and the number of intervals.
Keywords :
data mining; design of experiments; matrix algebra; pattern classification; 2D-quanta matrix table; CAIA; J48; MLP; NB; RBF; UCI data sets; adjacent intervals; breast cancer; class attribute interval average; classification algorithms; data complexity reduction; data mining preprocessing; discretization algorithm; discretization technique; experimental design; glass; heart diseases; iris; Accuracy; Breast cancer; Computer aided instruction; Data mining; Heart; Iris; Merging; Classification; Data Discretization; Data Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Information and Communication Technology and it's Applications (DICTAP), 2014 Fourth International Conference on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4799-3723-3
Type :
conf
DOI :
10.1109/DICTAP.2014.6821664
Filename :
6821664
Link To Document :
بازگشت