Title :
Protein Structure Modeling in Two-Level Topology Strings for Structure Comparison
Author :
Razmara, Jafar ; Deris, Safaai B. ; Parvizpour, Sepideh
Author_Institution :
Fac. of Comput. Sci. & Inf. Syst., Univ. Teknol. Malaysia, Johor Bahru, Malaysia
Abstract :
Biological data representation in textual sequences has opened a new hybrid research area in protein structure analysis. The paper presents a novel method for structural comparison and alignment of proteins based on text modeling techniques. The method firstly models the geometry of protein secondary and tertiary structure in two-level topology strings and then, employs n-gram modeling and cross-entropy concept from computational linguistics approach to compare strings and align structures. The results of the experiments confirm the validity and efficiency of the proposed method. Moreover, conceptual simplicity of the method indicates applicability and preference of the method in comparison to the other related techniques.
Keywords :
biology computing; data structures; proteins; topology; biological data representation; computational linguistics; cross-entropy concept; protein structure analysis; protein structure modeling; structure comparison; text modeling technique; textual sequence; two-level topology strings; Bioinformatics; Computational modeling; Entropy; Geometry; Hidden Markov models; Proteins; Topology; Cross-entropy; Protein structure comparison; Structure alignment; Topology string; n-gram modeling;
Conference_Titel :
Signal-Image Technology and Internet-Based Systems (SITIS), 2010 Sixth International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-9527-6
Electronic_ISBN :
978-0-7695-4319-2
DOI :
10.1109/SITIS.2010.57