• DocumentCode
    2007679
  • Title

    Comprehensible Models for Predicting Molecular Interaction with Heart-Regulating Genes

  • Author

    Sonstrod, Cecilia ; Johansson, Ulf ; Norinder, Ulf ; Bostrom, Henrik

  • Author_Institution
    Sch. of Bus. & Inf., Univ. of Boras, Boras, Sweden
  • fYear
    2008
  • fDate
    11-13 Dec. 2008
  • Firstpage
    559
  • Lastpage
    564
  • Abstract
    When using machine learning for in silico modeling, the goal is normally to obtain highly accurate predictive models. Often, however, models should also bring insights into interesting relationships in the domain. It is then desirable that machine learning techniques have the ability to obtain small and transparent models, where the user can control the tradeoff between accuracy, comprehensibility and coverage. In this study, three different decision list algorithms are evaluated on a dataset concerning the interaction of molecules with a human gene that regulates heart functioning (hERG). The results show that decision list algorithms can obtain predictive performance not far from the state-of-the-art method random forests, but also that algorithms focusing on accuracy alone may produce complex decision lists that are very hard to interpret. The experiments also show that by sacrificing accuracy only to a limited degree, comprehensibility (measured as both model size and classification complexity) can be improved remarkably.
  • Keywords
    biochemistry; biology computing; learning (artificial intelligence); classification complexity; decision list algorithms; heart-regulating genes; in silico modeling; machine learning; molecular interaction prediction; random forests; Biochemistry; Biological information theory; Biological system modeling; Drugs; High temperature superconductors; Humans; Informatics; Machine learning; Pharmaceuticals; Predictive models; Comprehensible Models; Data Mining; Decision Lists; hERG;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-0-7695-3495-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2008.130
  • Filename
    4725029