Title :
Evaluating Protein Similarity from Coarse Structures
Author :
Wang, Yong ; Ling-yun Wu ; Zhang, Ji-Hong ; Zhan, Zhong-Wei ; Xiang-Sun Zhang ; Luonan Chen
Author_Institution :
Inst. of Appl. Math., Chinese Acad. of Sci., Beijing, China
Abstract :
To unscramble the relationship between protein function and protein structure, it is essential to assess the protein similarity from different aspects. Although many methods have been proposed for protein structure alignment or comparison, alternative similarity measures are still strongly demanded due to the requirement of fast screening and query in large-scale structure databases. In this paper, we first formulate a novel representation of a protein structure, i.e., feature sequence of surface (FSS). Then, a new score scheme is developed to measure the similarity between two representations. To verify the proposed method, numerical experiments are conducted in four different protein data sets. We also classify SARS coronavirus to verify the effectiveness of the new method. Furthermore, preliminary results of fast classification of the whole CATH v2.5.1 database based on the new macrostructure similarity are given as a pilot study. We demonstrate that the proposed approach to measure the similarities between protein structures is simple to implement, computationally efficient, and surprisingly fast. In addition, the method itself provides a new and quantitative tool to view a protein structure.
Keywords :
molecular biophysics; numerical analysis; proteins; CATH v2.5.1 database; SARS coronavirus; feature sequence-of-surface; macrostructure similarity; numerical experiments; protein function; protein similarity; protein structure; Atomic measurements; Biology computing; Data mining; Frequency selective surfaces; Large-scale systems; Mathematics; Organizing; Proteins; Sequences; Spatial databases; Bioinformatics (genome or protein) databases; Machine learning; Optimization; Protein structure; protein surface.; structure comparison; Algorithms; Computational Biology; Computer Simulation; Databases, Protein; Humans; Models, Molecular; Pattern Recognition, Automated; Protein Conformation; Proteins; SARS Virus; Sequence Alignment; Sequence Analysis, Protein; Software;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2007.70250