DocumentCode :
2007679
Title :
Comprehensible Models for Predicting Molecular Interaction with Heart-Regulating Genes
Author :
Sonstrod, Cecilia ; Johansson, Ulf ; Norinder, Ulf ; Bostrom, Henrik
Author_Institution :
Sch. of Bus. & Inf., Univ. of Boras, Boras, Sweden
fYear :
2008
fDate :
11-13 Dec. 2008
Firstpage :
559
Lastpage :
564
Abstract :
When using machine learning for in silico modeling, the goal is normally to obtain highly accurate predictive models. Often, however, models should also bring insights into interesting relationships in the domain. It is then desirable that machine learning techniques have the ability to obtain small and transparent models, where the user can control the tradeoff between accuracy, comprehensibility and coverage. In this study, three different decision list algorithms are evaluated on a dataset concerning the interaction of molecules with a human gene that regulates heart functioning (hERG). The results show that decision list algorithms can obtain predictive performance not far from the state-of-the-art method random forests, but also that algorithms focusing on accuracy alone may produce complex decision lists that are very hard to interpret. The experiments also show that by sacrificing accuracy only to a limited degree, comprehensibility (measured as both model size and classification complexity) can be improved remarkably.
Keywords :
biochemistry; biology computing; learning (artificial intelligence); classification complexity; decision list algorithms; heart-regulating genes; in silico modeling; machine learning; molecular interaction prediction; random forests; Biochemistry; Biological information theory; Biological system modeling; Drugs; High temperature superconductors; Humans; Informatics; Machine learning; Pharmaceuticals; Predictive models; Comprehensible Models; Data Mining; Decision Lists; hERG;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-0-7695-3495-4
Type :
conf
DOI :
10.1109/ICMLA.2008.130
Filename :
4725029
Link To Document :
بازگشت