DocumentCode
62623
Title
Inducing Decision Trees based on a Cluster Quality Index
Author
Loyola, O. ; Medina, M.A. ; Garcia, M.
Author_Institution
Centro de Bioplantas, Ciego de Avila, Cuba
Volume
13
Issue
4
fYear
2015
fDate
Apr-15
Firstpage
1141
Lastpage
1147
Abstract
Decision trees are popular classifiers in data mining, artificial intelligence, and pattern recognition, because they are accurate and easy to comprehend. In this paper, we introduce a new procedure for inducing decision trees, to obtain trees that are more accurate, more compact, and more balanced. Each candidate split is evaluated using Rand Statistics, a quality index based on external measures, because it is considered by many authors as the best existing index. Our method was compared with other state-of-the-art methods and the results over 30 databases from the UCI Repository prove our claims. We also introduce a new equation to measure the balance of a binary tree.
Keywords
decision trees; pattern classification; pattern clustering; UCI repository; binary tree; cluster quality index; decision trees; rand statistics; Breast cancer; Decision trees; Glass; Indexes; Ionosphere; Silicon; Silicon compounds; decision trees; gain ratio; gini index; rand statistic; supervised classification; validation indexes;
fLanguage
English
Journal_Title
Latin America Transactions, IEEE (Revista IEEE America Latina)
Publisher
ieee
ISSN
1548-0992
Type
jour
DOI
10.1109/TLA.2015.7106368
Filename
7106368
Link To Document