DocumentCode :
79586
Title :
Solving the Secondary Structure Matching Problem in Cryo-EM De Novo Modeling Using a Constrained K -Shortest Path Graph Algorithm
Author :
Al Nasr, Kamal ; Ranjan, Desh ; Zubair, Mohammad ; Lin Chen ; Jing He
Author_Institution :
Dept. of Comput. Sci., Tennessee State Univ., Nashville, TN, USA
Volume :
11
Issue :
2
fYear :
2014
fDate :
March-April 2014
Firstpage :
419
Lastpage :
430
Abstract :
Electron cryomicroscopy is becoming a major experimental technique in solving the structures of large molecular assemblies. More and more three-dimensional images have been obtained at the medium resolutions between 5 and 10 Å. At this resolution range, major α-helices can be detected as cylindrical sticks and β-sheets can be detected as plain-like regions. A critical question in de novo modeling from cryo-EM images is to determine the match between the detected secondary structures from the image and those on the protein sequence. We formulate this matching problem into a constrained graph problem and present an O(Δ2N22N) algorithm to this NP-Hard problem. The algorithm incorporates the dynamic programming approach into a constrained K-shortest path algorithm. Our method, DP-TOSS, has been tested using α-proteins with maximum 33 helices and α-β proteins up to five helices and 12 β-strands. The correct match was ranked within the top 35 for 19 of the 20 α-proteins and all nine α-β proteins tested. The results demonstrate that DP-TOSS improves accuracy, time and memory space in deriving the topologies of the secondary structure elements for proteins with a large number of secondary structures and a complex skeleton.
Keywords :
biology computing; dynamic programming; electron microscopy; graphs; molecular biophysics; molecular configurations; proteins; sheet materials; α-β proteins; α-helices; β-sheets; NP-Hard problem; O(Δ2N22N) algorithm; complex skeleton; constrained K-shortest path graph algorithm; constrained graph problem; cryoelectron microscopy de novo modeling; cylindrical sticks; detected secondary structures; dynamic programming approach; large molecular assembly structure; matching problem; medium resolutions; memory space; plain-like regions; protein sequence; resolution range; secondary structure elements; secondary structure matching problem; three-dimensional images; Amino acids; Heuristic algorithms; Image resolution; Protein sequence; Skeleton; Topology; Protein structure; algorithm; electron cryomicroscopy; graph; image; secondary structure;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2014.2302803
Filename :
6727403
Link To Document :
بازگشت