• DocumentCode
    3230734
  • Title

    Automatic protein structure classification through structural fingerprinting

  • Author

    Aung, Zeyar ; Tan, Kian-Lee

  • Author_Institution
    Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore
  • fYear
    2004
  • fDate
    19-21 May 2004
  • Firstpage
    508
  • Lastpage
    515
  • Abstract
    In this paper, we present a new scheme named "CP-Mine" for automatic three-dimensional (3D) protein structure classification using structural fingerprints. We represent a 3D protein structure as a CPset, which is a set of inter-SSE contact patterns (CPs) existing in the protein. Suppose we have a database of protein structures whose class labels are already known, and suppose there are distinct protein structure classes in the database. For each class, we generate its fingerprint by mining the frequent CPsets from all the member protein structures belonging to this class. When we want to predict the class label of an unknown protein, we also generate the CPset of this protein, and find the intersection between this CPset and the fingerprint of each protein structure class one by one. Then, the labels of the classes with the highest degree of intersection are returned as the answer. The proposed method is a pure classification scheme in that any kind of structural comparison, alignment or searching is not needed to be performed. The preliminary experimental results shows that our method can classify the protein structures accurately and efficiently.
  • Keywords
    biology computing; data mining; molecular biophysics; proteins; automatic protein structure classification; frequent CPsets mining; interSSE contact patterns; structural fingerprinting; Bioinformatics; Computer science; Data mining; Drives; Fingerprint recognition; Laboratories; Machine learning; Predictive models; Protein engineering; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
  • Print_ISBN
    0-7695-2173-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2004.1317385
  • Filename
    1317385