DocumentCode :
3277576
Title :
Improved C4.5 decision tree algorithm based on sample selection
Author :
Fucai Chen ; Xiaowei Li ; Lixiong Liu
Author_Institution :
China Nat. Digital Switching Syst. Eng. Technol. R&D Center, Zhengzhou, China
fYear :
2013
fDate :
23-25 May 2013
Firstpage :
779
Lastpage :
782
Abstract :
To improve the classification accuracy and reduce the training time of large sample, and find the best training set, this paper proposes the improved C4.5 decision tree algorithm based on sample selection. The algorithm is based on the fact that decision tree can only get local optimal solution and has the bigger relativity with initial sample. In sample selection, we use iteration process to find the best training set. Using accuracy of the selected sample training as iteration Information is highly optimized for general use. Partition similarity is used for the best selection as the standard. Experiments show that the accuracy and time consumption of the proposed algorithm, aiming at classification and identification of large sample data, is better than C4.5 decision tree.
Keywords :
decision trees; pattern classification; classification accuracy improvement; improved C4.5 decision tree algorithm; iteration Information; iteration process; large sample data classification; large sample data identification; local optimal solution; partition similarity; sample selection; training time reduction; IP networks; Training; C4.5 decision tree; partition similarity; sample selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2013 4th IEEE International Conference on
Conference_Location :
Beijing
ISSN :
2327-0586
Print_ISBN :
978-1-4673-4997-0
Type :
conf
DOI :
10.1109/ICSESS.2013.6615421
Filename :
6615421
Link To Document :
بازگشت