• DocumentCode
    3495742
  • Title

    Higher-order representations of protein structure space

  • Author

    Molloy, Kevin ; Van, M. Jennifer ; Barbara, Daniel ; Shehu, Amarda

  • Author_Institution
    Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
  • fYear
    2013
  • fDate
    12-14 June 2013
  • Firstpage
    1
  • Lastpage
    2
  • Abstract
    Fragment-based representations of protein structure have recently been proposed to identify remote homologs with reasonable accuracy. The representations have also been shown through PCA to elucidate low-dimensional maps of protein structure space. In this work we conduct further analysis of these representations, showing that the low-dimensional maps preserve functional co-localization. Moreover, we employ Latent Dirichlet Allocation to investigate a new, topic-based representation. We show through various techniques adapted from text mining that the topics have unique signatures over structural classes and allow a complementary yet informative organization of protein structure space.
  • Keywords
    molecular biophysics; molecular configurations; proteins; fragment-based representation; functional colocalization; higher-order representation; informative organization; latent Dirichlet allocation; low-dimensional maps; protein structure space; text mining; topic-based representation; Biological system modeling; Libraries; Proteins; Resource management; Semantics; Text mining; Vectors; Latent Dirichlet Allocation; molecular fragments; protein structure space;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2013 IEEE 3rd International Conference on
  • Conference_Location
    New Orleans, LA
  • Type

    conf

  • DOI
    10.1109/ICCABS.2013.6629202
  • Filename
    6629202