Title :
Deriving Quantitative Structure-Activity Relationship Models Using Genetic Programming for Drug Discovery
Author :
Neophytou, Katerina ; Nicolaou, Christos A. ; Pattichis, Constantinos S. ; Schizas, Christos N.
Author_Institution :
Univ. of Cyprus, Nicosia
Abstract :
Genetic Programming is a heuristic search algorithm inspired by evolutionary techniques that has been shown to produce satisfactory solutions to problems related to several scientific domains [1]. Presented here is a methodology for the creation of Quantitative Structure-Activity Relationship (QSAR) models for the prediction of chemical activity, using Genetic Programming. QSAR analysis is crucial for drug discovery since good QSAR models enable human experts to select compounds with increased chances of being active for further investigations. Our technique has been tested using the Selwood dataset, a benchmark dataset for the QSAR field [2]. The results indicate that the QSAR models created are accurate, reliable and simple and can thus be used to identify molecular descriptors correlated with measured activity and for the prediction of the activity of untested molecules. The QSAR models we generated predict the activity of untested molecules with an error ranging between 0.46 -0.8 on the scale [-1,1]. These results compare favourably with results sited in the literature for the same dataset [3], [4], Our models are constructed using any combination of the arithmetic operators {+, -, /, *}, the descriptors available and constant values.
Keywords :
biochemistry; drugs; genetic algorithms; heuristic programming; mathematical operators; medical computing; molecular biophysics; search problems; arithmetic operator; chemical activity prediction; drug discovery; evolutionary technique; genetic programming; heuristic search algorithm; molecular descriptor; quantitative structure-activity relationship model; Arithmetic; Biological system modeling; Biology computing; Chemicals; Computer science; Drugs; Genetic programming; Heuristic algorithms; Humans; Predictive models; Genetic Programming; QSAR; Selwood dataset;
Conference_Titel :
Information Technology Applications in Biomedicine, 2007. ITAB 2007. 6th International Special Topic Conference on
Conference_Location :
Tokyo
Print_ISBN :
978-1-4244-1868-8
Electronic_ISBN :
978-1-4244-1868-8
DOI :
10.1109/ITAB.2007.4407401