• DocumentCode
    3298733
  • Title

    A framework for predicting proteins 3D structures

  • Author

    Duwairi, Rehab ; Kassawneh, Amal

  • Author_Institution
    Qatar Univ., Doha
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    37
  • Lastpage
    44
  • Abstract
    This paper proposes a framework for predicting protein three dimensional structures from their primary sequences. The proposed method utilizes the natural multi-label and hierarchical intrinsic nature of proteins to build a multi-label and hierarchical classifier for predicting protein folds. The classifier predicts protein folds in two stages, at the first stage, it predicts the protein structural class, and in the second stage, it predicts the protein fold. When comparing our technique with SVM, naive Bayes, and boosted C4.5 we get a higher accuracy more than SVM and better than naive Bayes when using the composition, secondary structure and hydrophobicity feature attributes, and give higher accuracy than C4.5 when using composition, secondary structure, hydrophobicity, and polarity feature attributes. MuLAM was used as a basic classifier in the hierarchy of the implemented framework. Two major modifications were made to MuLAM, namely: the pheromone update and term selection strategies of MuLAM were altered.
  • Keywords
    Bayes methods; biology computing; molecular biophysics; pattern classification; proteins; support vector machines; MuLAM; SVM; boosted C4.5 method; hydrophobicity feature attributes; naive Bayes method; pheromone update; protein classifier; protein folds prediction; protein structural class; proteins 3D structures; support vector machine; Amino acids; Biology; Bonding; Chemicals; Computer science; Hydrogen; Protein engineering; Spine; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems and Applications, 2008. AICCSA 2008. IEEE/ACS International Conference on
  • Conference_Location
    Doha
  • Print_ISBN
    978-1-4244-1967-8
  • Electronic_ISBN
    978-1-4244-1968-5
  • Type

    conf

  • DOI
    10.1109/AICCSA.2008.4493514
  • Filename
    4493514