• DocumentCode
    617954
  • Title

    An efficient encoding for simplified protein structure prediction using genetic algorithms

  • Author

    Shatabda, Swakkhar ; Newton, M. A. Hakim ; Rashid, M.A. ; Sattar, Abdul

  • Author_Institution
    Inst. for Integrated & Intell. Syst. (IIIS), Griffith Univ., Nathan, QLD, Australia
  • fYear
    2013
  • fDate
    20-23 June 2013
  • Firstpage
    1217
  • Lastpage
    1224
  • Abstract
    Protein structure prediction is one of the most challenging problems in computational biology and remains unsolved for many decades. In a simplified version of the problem, the task is to find a self-avoiding walk with the minimum free energy assuming a discrete lattice and a given energy matrix. Genetic algorithms currently produce the state-of-the-art results for simplified protein structure prediction. However, performance of the genetic algorithms largely depends on the encodings they use in representing protein structures and the twin removal technique they use in eliminating duplicate solutions from the current population. In this paper, we present a new efficient encoding for protein structures. Our encoding is nonisomorphic in nature and results into efficient twin removal. This helps the search algorithm diversify and explore a larger area of the search space. In addition to this, we also propose an approximate matching scheme for removing near-similar solutions from the population. Our encoding algorithm is generic and applicable to any lattice type. On the standard benchmark proteins, our techniques significantly improve the state-of-the-art genetic algorithm for hydrophobic-polar (HP) energy model on face-centered-cubic (FCC) lattice.
  • Keywords
    benchmark testing; biology; encoding; genetic algorithms; hydrophobicity; matrix algebra; proteins; search problems; FCC lattice; computational biology; discrete lattice; encoding algorithm; energy matrix; face-centered-cubic lattice; hydrophobic-polar energy model; matching scheme; minimum free energy; protein structure prediction; search algorithm; search space; self-avoiding walk; standard benchmark proteins; state-of-the-art genetic algorithm; Encoding; FCC; Genetic algorithms; Lattices; Proteins; Sociology; Vectors; Encoding; Genetic Algorithm; HP Model; Lattice; Protein Structure Prediction; Simplified Models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2013 IEEE Congress on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4799-0453-2
  • Electronic_ISBN
    978-1-4799-0452-5
  • Type

    conf

  • DOI
    10.1109/CEC.2013.6557704
  • Filename
    6557704