• DocumentCode
    2528536
  • Title

    Rule clustering and super-rule generation for transmembrane segments prediction

  • Author

    He, Jieyue ; Chen, Bernard ; Hu, Hae-Jin ; Harrison, Robert ; Tai, Phang C. ; Dong, Yisheng ; Pan, Yi

  • Author_Institution
    Dept. of Comput. Sci., Southeast Univ., Nanjing, China
  • fYear
    2005
  • fDate
    8-11 Aug. 2005
  • Firstpage
    224
  • Lastpage
    227
  • Abstract
    The explanation of a decision is important for the acceptance of machine learning technology in bioinformatics applications such as protein structure prediction. In past research, we have already combined SVM with decision tree to extract rules for understanding transmembrane segments prediction. However, rules we have gotten are as many as about 20,000. This large number of rules makes them difficult for us to interpret their meaning. In this paper, a novel approach of rule clustering (SVM_DT_C) for super-rule generation is presented. We use K-means clustering to cluster huge number of rules to generate many new super-rules. The experimental results show that the super-rules produced by SVM_DT_C can be analyzed manually by a researcher, and these super-rules are not only new but also achieve very high transmembrane prediction accuracy (exceeding 95%) most of the times.
  • Keywords
    biology computing; biomembranes; decision trees; learning (artificial intelligence); pattern clustering; support vector machines; K-means clustering; SVM; bioinformatics; decision tree; machine learning technology; protein structure; rule clustering; rule extraction; super-rule generation; transmembrane segments prediction; Aggregates; Bioinformatics; Biology computing; Clustering algorithms; Computer science; Decision trees; Encoding; Machine learning; Proteins; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
  • Print_ISBN
    0-7695-2442-7
  • Type

    conf

  • DOI
    10.1109/CSBW.2005.121
  • Filename
    1540609