Title :
Enhancement of Initial Equivalency for Protein Structure Alignment Based on Encoded Local Structures
Author :
Hung, Kuo-Yuan ; Jui-Chih Wang ; Cheng-Wei Chen ; Cheng-Long Chuang ; Kun-Nan Tsai ; Chung-Ming Chen
Author_Institution :
Inst. of Biomed. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Most alignment algorithms find an initial equivalent residue pair followed by an iterative optimization process to explore better near-optimal alignments in the surrounding solution space of the initial alignment. It plays a decisive role in determining the alignment quality since a poor initial alignment may make the final alignment trapped in an undesirable local optimum even with an iterative optimization. We proposed a vector-based alignment algorithm with a new initial alignment approach accounting for local structure features called MIRAGE-align. The new idea is to enhance the quality of the initial alignment based on encoded local structural alphabets to identify the protein structure pair whose sequence identity falls in or below twilight zone. The statistical analysis of alignment quality based on match index and computation time demonstrated that MIRAGE-align algorithm outperformed four previously published algorithms, i.e., the residue-based algorithm, the vector-based algorithm, TM-align, and Fr-TM-align. MIRAGE-align yields a better estimate of initial solution to enhance the quality of initial alignment and enable the employment of a noniterative optimization process to achieve a better alignment.
Keywords :
biology computing; iterative methods; molecular biophysics; molecular configurations; optimisation; proteins; statistical analysis; vectors; Fr-TM-align; MIRAGE-align algorithm; TM-align; encoded local structural alphabets; encoded local structures; iterative optimization process; match index; near-optimal alignments; protein structure alignment algorithms; residue-based algorithm; statistical analysis; surrounding solution space; twilight zone; vector-based alignment algorithm; Algorithm design and analysis; Amino acids; Optimization; Protein engineering; Proteins; Statistical analysis; Vectors; Initial equivalency; secondary structure; structural alphabet; structure alignment; Computational Biology; Models, Molecular; Protein Structure, Secondary; Proteins; Sequence Alignment;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2012.2204892