• DocumentCode
    32789
  • Title

    Divide and Conquer Approach to Contact Map Overlap Problem Using 2D-Pattern Mining of Protein Contact Networks

  • Author

    Koneru, Suvarna Vani ; Durga Bhavani, S.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., V.R. Siddhartha Eng. Coll., Vijayawada, India
  • Volume
    12
  • Issue
    4
  • fYear
    2015
  • fDate
    July-Aug. 1 2015
  • Firstpage
    729
  • Lastpage
    737
  • Abstract
    A novel approach to Contact Map Overlap (CMO) problem is proposed using the two dimensional clusters present in the contact maps. Each protein is represented as a set of the non-trivial clusters of contacts extracted from its contact map. The approach involves finding matching regions between the two contact maps using approximate 2D-pattern matching algorithm and dynamic programming technique. These matched pairs of small contact maps are submitted in parallel to a fast heuristic CMO algorithm. The approach facilitates parallelization at this level since all the pairs of contact maps can be submitted to the algorithm in parallel. Then, a merge algorithm is used in order to obtain the overall alignment. As a proof of concept, MSVNS, a heuristic CMO algorithm is used for global as well as local alignment. The divide and conquer approach is evaluated for two benchmark data sets that of Skolnick and Ding et al. It is interesting to note that along with achieving saving of time, better overlap is also obtained for certain protein folds.
  • Keywords
    biology computing; data mining; divide and conquer methods; dynamic programming; molecular biophysics; molecular clusters; molecular configurations; pattern matching; proteins; 2D-pattern mining; approximate 2D-pattern matching algorithm; benchmark data sets; contact map overlap problem; divide-and-conquer approach; dynamic programming technique; fast heuristic CMO algorithm; nontrivial clusters; protein contact networks; protein folds; two-dimensional clusters; Approximation algorithms; Bioinformatics; Clustering algorithms; Dynamic programming; Heuristic algorithms; Pattern matching; Proteins; Approximate 2D-pattern matching; dynamic programming; mining contact maps; non-crossing matching;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2015.2394402
  • Filename
    7018052