• DocumentCode
    506822
  • Title

    Enhancing prediction understandability for transmembane segments by BoostingFOIL

  • Author

    He, Jieyue ; Chen, Pingping ; Zhao, Dejing ; Zhong, Wei

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
  • Volume
    1
  • fYear
    2009
  • fDate
    20-22 Nov. 2009
  • Firstpage
    739
  • Lastpage
    743
  • Abstract
    In recent years, many studies have focused on improving the accuracy of prediction of trans-membrane segments, and many significant results have been achieved. In spite of these considerable results, the existing methods lack the ability to explain the process of how a learning result is reached and why a prediction decision is made. The explanation of the decision process is important for acceptance of machine learning technology in bioinformatics applications such as protein structure prediction. Decision trees provide insightful interpretation, with form of the propositional IF-THEN rules. The decision tree algorithm produces a large number of rules which is not easy to read and difficult to express adequately complex characteristics of biological sequence. First-order rules with variables have outstanding representation capability, and they can be used to reduce the number of rules. Therefore, in this paper, we present an innovative approach to generate rules for understanding prediction of transmembrane segments. This new approach combines the first-order inductive learning (FOIL) with enhancing techniques of Boosting to produce a new algorithm called BoostingFOIL. The experimental results for prediction of transmembrane segments on 165 low-resolution test data set show that not only the comprehensibility of BoostingFOIL is much better than that of decision tree, but also the test accuracy of these rules is higher as well. The most important contribution of our work is that the first-order rules produced by BoostingFOIL can be easily applied to advanced deduction in inductive learning procedure.
  • Keywords
    bioinformatics; biomembranes; decision making; decision trees; learning (artificial intelligence); molecular biophysics; proteins; BoostingFOIL algorithm; bioinformatics; decision tree; first-order inductive learning; inductive learning procedure; transmembane segments; transmembrane proteins; Bioinformatics; Computer science; Decision making; Decision trees; Hidden Markov models; Humans; Machine learning; Proteins; Support vector machines; Testing; Decision tree; First Order Inductive Learner; Transmembrane segments prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-4754-1
  • Electronic_ISBN
    978-1-4244-4738-1
  • Type

    conf

  • DOI
    10.1109/ICICISYS.2009.5358389
  • Filename
    5358389