• DocumentCode
    16772
  • Title

    Weighted Tanimoto Extreme Learning Machine with Case Study in Drug Discovery

  • Author

    Czarnecki, Wojciech Marian

  • Author_Institution
    Fac. of Math. & Comput. Sci., Jagiellonian Univ., Krakow, Poland
  • Volume
    10
  • Issue
    3
  • fYear
    2015
  • fDate
    Aug. 2015
  • Firstpage
    19
  • Lastpage
    29
  • Abstract
    Machine learning methods are becoming more and more popular in the field of computer-aided drug design. The specific data characteristic, including sparse, binary representation as well as noisy, imbalanced datasets, presents a challenging binary classification problem. Currently, two of the most successful models in such tasks are the Support Vector Machine (SVM) and Random Forest (RF). In this paper, we introduce a Weighted Tanimoto Extreme Learning Machine (T-WELM), an extremely simple and fast method for predicting chemical compound biological activity and possibly other data with discrete, binary representation. We show some theoretical properties of the proposed model including the ability to learn arbitrary sets of examples. Further analysis shows numerous advantages of T-WELM over SVMs, RFs and traditional Extreme Learning Machines (ELM) in this particular task. Experiments performed on 40 large datasets of thousands of chemical compounds show that T-WELMs achieve much better classification results and are at the same time faster in terms of both training time and further classification than both ELM models and other state-of-the-art methods in the field.
  • Keywords
    drug delivery systems; learning (artificial intelligence); medical computing; pattern classification; support vector machines; RF; SVM; T-WELM; binary classification problem; chemical compound biological activity prediction; computer-aided drug design; data characteristic; drug discovery; random forest; support vector machine; weighted Tanimoto extreme learning machine method; Biological system modeling; Compounds; Computational modeling; Design automation; Drugs; Fingerprint recognition; Machine learning;
  • fLanguage
    English
  • Journal_Title
    Computational Intelligence Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1556-603X
  • Type

    jour

  • DOI
    10.1109/MCI.2015.2437312
  • Filename
    7160842