Title :
Identifying structural repeats in proteins using graph centrality measures
Author :
Jain, Ruchi ; Yalamanchili, Hari Krishna ; Parekh, Nita
Author_Institution :
Center for Comput. Natural, Sci. & Bioinf., IIIT, Hyderabad, India
Abstract :
Here we apply the graph-theoretic concept of betweenness centrality to a class of protein repeats, e.g., Armadillo (ARM) and HEAT. The Betweenness of a node represents how often a node is traversed on the shortest path between all pairs of nodes i, j in the network and thus gives the contribution of each node in the network. These repeats are not easily detectable at the sequence level because of low conservation between independent repeated units, e.g., HEAT repeats are known to have less than 13% identity. Their identification at the structure level typically involves self structure-structure comparison, which can be computationally very intensive. Our analysis of a set of proteins from ARM and HEAT repeat family shows that the repeat regions exhibit similar connectivity patterns for the repeating units. Since it is generally accepted that in many networks, the larger the degree of a node, the larger the chance that many of the shortest paths will pass through this node, computing vertex Betweenness provides a simple and elegant approach for identifying tandem structural repeats in proteins.
Keywords :
biology computing; data analysis; graph theory; molecular configurations; proteins; Armadillo; HEAT; graph centrality measure; graph theory; protein structural repeat identification; structure-structure comparison; Amino acids; Bioinformatics; Computer networks; Databases; Graph theory; Particle measurements; Pattern analysis; Proteins; Radar; Solvents; Armadillo; Betweenness; HEAT; Protein contact network; Protein structural repeats;
Conference_Titel :
Nature & Biologically Inspired Computing, 2009. NaBIC 2009. World Congress on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4244-5053-4
DOI :
10.1109/NABIC.2009.5393609