• DocumentCode
    1641038
  • Title

    A novel method for protein 3D-structure similarity measure based on n-gram modeling

  • Author

    Razmara, Jafar ; Deris, Safaai B.

  • Author_Institution
    Fac. of Comput. Sci. & Inf. Syst., Univ. Teknol. Malaysia, Skudai
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The present paper describes a novel method for measuring structural similarity of proteins in three dimensions. The method gets its roots from computational linguistics and the related techniques for modeling protein structure in string form and pairwise comparison of protein sequences. The method uses n-gram based modeling techniques for capturing regularities in protein structure sequences and joints cross-entropy measures for comparing two protein sequences to do similarity test. In this way, the 3D structure of protein is represented in string form and, then, a similarity test is performed over these sequences. To find an overlap between two protein structures in 3D-space, a superposition task is also applied. In order to confirm the validity of this method, some experiments were performed using a collection of the protein data sets on publicly available servers which showed that the method is efficient.
  • Keywords
    bioinformatics; computational linguistics; molecular biophysics; proteins; 3D-space; bioinformatics; computational linguistics; n-gram modeling; protein 3D-structure similarity measure; protein sequences; protein structure modeling; superposition task; Algorithm design and analysis; Amino acids; Bioinformatics; Biological system modeling; Computational linguistics; Computer science; Dynamic programming; Management information systems; Proteins; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4244-2844-1
  • Electronic_ISBN
    978-1-4244-2845-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2008.4696719
  • Filename
    4696719