DocumentCode :
3078325
Title :
An optimized approach to generate simplified decision trees
Author :
Hussain, Muhammad Awais ; Rao, M.K. ; Mahmood, Ali Mirza
Author_Institution :
Dept. of Dept. of Electron. & Comput. Eng., KLEF Univ., Guntur, India
fYear :
2013
fDate :
26-28 Dec. 2013
Firstpage :
1
Lastpage :
5
Abstract :
With the development of computer technology and computer network technology, the degree of information is getting higher and higher, people´s ability of using information technology to collect and produce data is substantially enhanced. The discovery of the optimal algorithms for mining useful knowledge and improving the effectiveness of information utilization are problems need to be addressed urgently. It was under this background that Data Mining (DM) technology came into being and developed. Data mining is a process to extract information and knowledge from a large number of incomplete, noisy, fuzzy and random data. In these data, the information and knowledge are hidden, which people can´t discover at present, but potentially useful. In recent years, the decision tree has become an important data mining method. The basic learning strategy of decision tree is divide and conquer technique, which uses from root to leaf of the decision tree structure. This paper emphasizes to propose a new decision tree model based on multivariate statistical method Principal Component analysis on multi-attribute data for reducing dimensionality and to transform traditional decision tree algorithm to form a new algorithmic model. The experiments demonstrate that this method can not only optimizes the structure of the decision tree, but also overcomes the problems existing in pruning and to mine the better rule set without effecting the purpose of prediction accuracy altogether.
Keywords :
data mining; decision trees; principal component analysis; algorithmic model; computer network technology; data mining method; data mining technology; decision tree algorithm; decision tree model; information extraction; information technology; information utilization; multiattribute data; multivariate statistical method; optimal algorithms; prediction accuracy; principal component analysis; simplified decision tree structure; useful knowledge mining; Accuracy; Algorithm design and analysis; Data mining; Decision trees; Educational institutions; Impurities; Training; Decision trees; Multivariate Statistical Method; Optimized decision tree; Principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Computing Research (ICCIC), 2013 IEEE International Conference on
Conference_Location :
Enathi
Print_ISBN :
978-1-4799-1594-1
Type :
conf
DOI :
10.1109/ICCIC.2013.6724191
Filename :
6724191
Link To Document :
بازگشت