Title of article
Structure-based classification of active and inactive estrogenic compounds by decision tree, LVQ and kNN methods
Author/Authors
Arja Asikainen، نويسنده , , Mikko Kolehmainen، نويسنده , , Juhani Ruuskanen، نويسنده , , Kari Tuppurainen، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2006
Pages
16
From page
658
To page
673
Abstract
The performance of decision tree (DT), learning vector quantization (LVQ), and k-nearest neighbour (kNN) methods classifying active and inactive estrogenic compounds in terms of their structure activity relationship (SAR) was evaluated. A set of 311 compounds was used for construction of the models, the predictive power of which was verified with separate training and test sets. Principal components derived from molecular descriptors calculated with DRAGON software were used as variables representing the structures of the compounds. Broadly, kNN had the best classification ability and DT the weakest, although the performance of each method was dependent on the group of compounds used for modelling. The best performance was obtained with kNN for the calf estrogen receptor data, averaging 98.3% of correctly classified compounds in the external tests. Overall, the results indicate that all the methods tested are suitable for the SAR classification of estrogenic compounds, producing models with a predictive power ranging from adequate to excellent.
Keywords
SAR , Principal components , Endocrine disruptors , Sammon s mapping , Tooldiag , estrogen receptor
Journal title
Chemosphere
Serial Year
2006
Journal title
Chemosphere
Record number
738494
Link To Document