Title :
Higher-order representations of protein structure space
Author :
Molloy, Kevin ; Van, M. Jennifer ; Barbara, Daniel ; Shehu, Amarda
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
Abstract :
Fragment-based representations of protein structure have recently been proposed to identify remote homologs with reasonable accuracy. The representations have also been shown through PCA to elucidate low-dimensional maps of protein structure space. In this work we conduct further analysis of these representations, showing that the low-dimensional maps preserve functional co-localization. Moreover, we employ Latent Dirichlet Allocation to investigate a new, topic-based representation. We show through various techniques adapted from text mining that the topics have unique signatures over structural classes and allow a complementary yet informative organization of protein structure space.
Keywords :
molecular biophysics; molecular configurations; proteins; fragment-based representation; functional colocalization; higher-order representation; informative organization; latent Dirichlet allocation; low-dimensional maps; protein structure space; text mining; topic-based representation; Biological system modeling; Libraries; Proteins; Resource management; Semantics; Text mining; Vectors; Latent Dirichlet Allocation; molecular fragments; protein structure space;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2013 IEEE 3rd International Conference on
Conference_Location :
New Orleans, LA
DOI :
10.1109/ICCABS.2013.6629202