DocumentCode
150210
Title
MGI: A New Heuristic for classifying continuous attributes in decision trees
Author
Anuradha ; Gupta, Gaurav
Author_Institution
CSE/IT Dept., ITM Univ., Gurgaon, India
fYear
2014
fDate
5-7 March 2014
Firstpage
291
Lastpage
295
Abstract
A New Heuristic for generating height balanced compact decision trees with reduced path length has been proposed in this paper. The proposed node split measure (MGI) modifies attribute selection process of gini index by giving importance to almost equal sized sub partitions and at same time counts frequently occurring target classes in those sub partitions. This results in less skewed decision trees, while maintaining similar accuracy as provided by gini index. As a part of data pre processing step, MGI gives a new approach to handle missing values and also provides initial data set size reduction. The performance evaluation is done on few sample data sets taken from UCI Machine learning repository. The experimental evaluation of MGI node split measure is done with gini index which shows that the proposed split measure produces height balanced decision tree of reduced height, which in turn reduces antecedents in the rule generated.
Keywords
decision trees; pattern classification; MGI; UCI machine learning repository; attribute selection process; continuous attributes classification; data preprocessing; data set size reduction; gini index; height balanced compact decision trees; heuristic; missing values; node split measure; reduced path length; subpartitions; Accuracy; Classification algorithms; Decision trees; Equations; Gain; Indexes; Vegetation; Accuracy; Classification; Decision Tree; Gini Index; Height;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing for Sustainable Global Development (INDIACom), 2014 International Conference on
Conference_Location
New Delhi
Print_ISBN
978-93-80544-10-6
Type
conf
DOI
10.1109/IndiaCom.2014.6828146
Filename
6828146
Link To Document