Title :
Decision tree algorithm optimization research based on MapReduce
Author :
Fangfang Yuan;Fusheng Lian;Xingjian Xu;Zhaohua Ji
Author_Institution :
College of Computer and Information Engineering, Inner Mongolia Normal University, Hohhot 010010, China
Abstract :
With the advent of the computer science, the data volume that needed to be processed under many practical situations increases dramatically, challenging many traditional machine learning techniques. Bearing this in mind, we made an intensive study on the optimization of decision tree algorithm and its corresponding porting to the big data analysis in this paper. An optimized genetic algorithm is merged into the implementation of the decision tree algorithm above, and we also invent a parallel genetic decision tree algorithm using MapReduce, which is very suitable for analyzing big data in cloud computing environment. Experiment results show that our algorithm acquires a nearly linear speedup, keeping a similar classification accuracy at the same time.
Keywords :
"Decision trees","Algorithm design and analysis","Classification algorithms","Optimization","Genetic algorithms","Genetics","Cloud computing"
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
Print_ISBN :
978-1-4799-8352-0
Electronic_ISBN :
2327-0594
DOI :
10.1109/ICSESS.2015.7339225