Title :
Divide and Conquer Approach to Contact Map Overlap Problem Using 2D-Pattern Mining of Protein Contact Networks
Author :
Koneru, Suvarna Vani ; Durga Bhavani, S.
Author_Institution :
Dept. of Comput. Sci. & Eng., V.R. Siddhartha Eng. Coll., Vijayawada, India
Abstract :
A novel approach to Contact Map Overlap (CMO) problem is proposed using the two dimensional clusters present in the contact maps. Each protein is represented as a set of the non-trivial clusters of contacts extracted from its contact map. The approach involves finding matching regions between the two contact maps using approximate 2D-pattern matching algorithm and dynamic programming technique. These matched pairs of small contact maps are submitted in parallel to a fast heuristic CMO algorithm. The approach facilitates parallelization at this level since all the pairs of contact maps can be submitted to the algorithm in parallel. Then, a merge algorithm is used in order to obtain the overall alignment. As a proof of concept, MSVNS, a heuristic CMO algorithm is used for global as well as local alignment. The divide and conquer approach is evaluated for two benchmark data sets that of Skolnick and Ding et al. It is interesting to note that along with achieving saving of time, better overlap is also obtained for certain protein folds.
Keywords :
biology computing; data mining; divide and conquer methods; dynamic programming; molecular biophysics; molecular clusters; molecular configurations; pattern matching; proteins; 2D-pattern mining; approximate 2D-pattern matching algorithm; benchmark data sets; contact map overlap problem; divide-and-conquer approach; dynamic programming technique; fast heuristic CMO algorithm; nontrivial clusters; protein contact networks; protein folds; two-dimensional clusters; Approximation algorithms; Bioinformatics; Clustering algorithms; Dynamic programming; Heuristic algorithms; Pattern matching; Proteins; Approximate 2D-pattern matching; dynamic programming; mining contact maps; non-crossing matching;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2015.2394402