Title :
An experimental investigation of the effect of discrete attributes on the precision of classification methods
Author :
Entezari-Maleki, Reza ; Iranmanesh, Seyyed Mehdi ; Minaei-Bidgoli, Behrouz
Author_Institution :
Dept. of Comput. Eng., Iran Univ. of Sci. & Technol. (IUST), Tehran, Iran
Abstract :
In this paper, the precisions of the logistic regression, naive-Bayes and linear data classification methods, with regard to the area under curve (AUC) metric have been compared. The effect of parameters including size of the dataset, kind of the independent attributes, number of the discrete attributes, and their values have been investigated. From the results, it can be concluded that in datasets consisting of both discrete and continuous attributes, the AUC of the three mentioned classifiers is the same. With increasing the number of the discrete attributes, the AUC of the logistic regression is increased and the precision related to this classifier become more than the other two classifiers. Also considering impact of the discrete attributes it can be seen that with increasing the number of values in discrete attributes the AUC related to the logistic regression classifier increases and linear regressions´ AUC decreases, but the AUC of the naive-Bayes classifier remains constant. The results of this research can help data miners in selecting the more efficient classifiers based on the conditions of feature that exist in their datasets.
Keywords :
Bayes methods; data mining; pattern classification; regression analysis; area under curve metric; continuous attributes; discrete attributes; linear data classification method; logistic regression classifier; naive-Bayes method; Artificial neural networks; Breast cancer; Classification tree analysis; Linear regression; Logistics; Niobium; Statistical analysis; Support vector machine classification; Support vector machines; Testing;
Conference_Titel :
Information and Communication Technologies, 2009. ICICT '09. International Conference on
Conference_Location :
Karachi
Print_ISBN :
978-1-4244-4608-7
Electronic_ISBN :
978-1-4244-4609-4
DOI :
10.1109/ICICT.2009.5267189