DocumentCode :
1934093
Title :
Fast C4.5
Author :
He, Ping ; Chen, Ling ; Xu, Xiao-Hua
Author_Institution :
Yangzhou Univ., Yangzhou
Volume :
5
fYear :
2007
fDate :
19-22 Aug. 2007
Firstpage :
2841
Lastpage :
2846
Abstract :
C4.5 is a well-known machine learning algorithm used extensively, however, its runtime performance is sacrificed for the consideration of the limited main memory at that time. We present a fast implementation of C4.5 algorithm, named FC4.5(Fast C4.5). It organizes novel data structures, uses the indirect bucket-sort combined with the bit-parallel technique, and confines the binary-search of the cutoff within the narrowest range. The combination of these techniques enables FC4.5 greatly accelerates the tree construction process of C4.5 algorithm. Experiments show that FC4.5 can build the same decision tree as C4.5 (Release 8) system and the runtime performance gain up to 5.8 times. Besides, FC4.5 also achieves a good scalability on different kinds of datasets.
Keywords :
data structures; learning (artificial intelligence); trees (mathematics); bit-parallel technique; data structure; fast C4.5 algorithm; indirect bucket-sort; machine learning algorithm; Acceleration; Computer science; Cybernetics; Data mining; Decision trees; Machine learning; Machine learning algorithms; Performance gain; Runtime; Scalability; Bit-parallel technique; C4.5; Classification; Fast; Indirect bucket-sort;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
Type :
conf
DOI :
10.1109/ICMLC.2007.4370632
Filename :
4370632
Link To Document :
بازگشت