Title of article :
Building optimal regression tree by ant colony system–genetic algorithm: Application to modeling of melting points Original Research Article
Author/Authors :
Bahram Hemmateenejad، نويسنده , , Mojtaba Shamsipur، نويسنده , , Vali Zare-Shahabadi، نويسنده , , Morteza Akhond، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Abstract :
The classification and regression trees (CART) possess the advantage of being able to handle large data sets and yield readily interpretable models. A conventional method of building a regression tree is recursive partitioning, which results in a good but not optimal tree. Ant colony system (ACS), which is a meta-heuristic algorithm and derived from the observation of real ants, can be used to overcome this problem. The purpose of this study was to explore the use of CART and its combination with ACS for modeling of melting points of a large variety of chemical compounds. Genetic algorithm (GA) operators (e.g., cross averring and mutation operators) were combined with ACS algorithm to select the best solution model. In addition, at each terminal node of the resulted tree, variable selection was done by ACS–GA algorithm to build an appropriate partial least squares (PLS) model. To test the ability of the resulted tree, a set of approximately 4173 structures and their melting points were used (3000 compounds as training set and 1173 as validation set). Further, an external test set containing of 277 drugs was used to validate the prediction ability of the tree. Comparison of the results obtained from both trees showed that the tree constructed by ACS–GA algorithm performs better than that produced by recursive partitioning procedure.
Keywords :
Genetic Algorithm , Melting points , Ant colony system , Quantitative structure–property relationship , Classification and regression tree
Journal title :
Analytica Chimica Acta
Journal title :
Analytica Chimica Acta