DocumentCode
617954
Title
An efficient encoding for simplified protein structure prediction using genetic algorithms
Author
Shatabda, Swakkhar ; Newton, M. A. Hakim ; Rashid, M.A. ; Sattar, Abdul
Author_Institution
Inst. for Integrated & Intell. Syst. (IIIS), Griffith Univ., Nathan, QLD, Australia
fYear
2013
fDate
20-23 June 2013
Firstpage
1217
Lastpage
1224
Abstract
Protein structure prediction is one of the most challenging problems in computational biology and remains unsolved for many decades. In a simplified version of the problem, the task is to find a self-avoiding walk with the minimum free energy assuming a discrete lattice and a given energy matrix. Genetic algorithms currently produce the state-of-the-art results for simplified protein structure prediction. However, performance of the genetic algorithms largely depends on the encodings they use in representing protein structures and the twin removal technique they use in eliminating duplicate solutions from the current population. In this paper, we present a new efficient encoding for protein structures. Our encoding is nonisomorphic in nature and results into efficient twin removal. This helps the search algorithm diversify and explore a larger area of the search space. In addition to this, we also propose an approximate matching scheme for removing near-similar solutions from the population. Our encoding algorithm is generic and applicable to any lattice type. On the standard benchmark proteins, our techniques significantly improve the state-of-the-art genetic algorithm for hydrophobic-polar (HP) energy model on face-centered-cubic (FCC) lattice.
Keywords
benchmark testing; biology; encoding; genetic algorithms; hydrophobicity; matrix algebra; proteins; search problems; FCC lattice; computational biology; discrete lattice; encoding algorithm; energy matrix; face-centered-cubic lattice; hydrophobic-polar energy model; matching scheme; minimum free energy; protein structure prediction; search algorithm; search space; self-avoiding walk; standard benchmark proteins; state-of-the-art genetic algorithm; Encoding; FCC; Genetic algorithms; Lattices; Proteins; Sociology; Vectors; Encoding; Genetic Algorithm; HP Model; Lattice; Protein Structure Prediction; Simplified Models;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2013 IEEE Congress on
Conference_Location
Cancun
Print_ISBN
978-1-4799-0453-2
Electronic_ISBN
978-1-4799-0452-5
Type
conf
DOI
10.1109/CEC.2013.6557704
Filename
6557704
Link To Document