Title of article :
(φ,ψ)2 Motifs: A Purely Conformation-Based Fine-Grained Enumeration of Protein Parts at the Two-Residue Level
Author/Authors :
Scott A. Hollingsworth، نويسنده , , Matthew C. Lewis، نويسنده , , Donald S. Berkholz، نويسنده , , Weng-Keen Wong، نويسنده , , P. Andrew Karplus، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2012
Pages :
16
From page :
78
To page :
93
Abstract :
A deep understanding of protein structure benefits from the use of a variety of classification strategies that enhance our ability to effectively describe local patterns of conformation. Here, we use a clustering algorithm to analyze 76,533 all-trans segments from protein structures solved at 1.2 Å resolution or better to create a purely φ,ψ-based comprehensive empirical categorization of common conformations adopted by two adjacent φ,ψ pairs (i.e., (φ,ψ)2 motifs). The clustering algorithm works in an origin-shifted four-dimensional space based on the two φ,ψ pairs to yield a parameter-dependent list of (φ,ψ)2 motifs, in order of their prominence. The results are remarkably distinct from and complementary to the standard hydrogen-bond-centered view of secondary structure. New insights include an unprecedented level of precision in describing the φ,ψ angles of both previously known and novel motifs, ordering of these motifs by their population density, a data-driven recommendation that the standard Cαi…Cαi + 3 < 7 Å criteria for defining turns be changed to 6.5 Å, identification of β-strand and turn capping motifs, and identification of conformational capping by residues in polypeptide II conformation. We further document that the conformational preferences of a residue are substantially influenced by the conformation of its neighbors, and we suggest that accounting for these dependencies will improve protein modeling accuracy. Although the CUEVAS-4D(r10є14) ‘parts list’ presented here is only an initial exploration of the complex (φ,ψ)2 landscape of proteins, it shows that there is value to be had from this approach, and it opens the door to more in-depth characterizations at the (φ,ψ)2 level and at higher dimensions.
Keywords :
secondary structure , Ramachandran plot , Protein conformation , capping motifs , Machine Learning
Journal title :
Journal of Molecular Biology
Serial Year :
2012
Journal title :
Journal of Molecular Biology
Record number :
1254330
Link To Document :
بازگشت