• DocumentCode
    79586
  • Title

    Solving the Secondary Structure Matching Problem in Cryo-EM De Novo Modeling Using a Constrained K -Shortest Path Graph Algorithm

  • Author

    Al Nasr, Kamal ; Ranjan, Desh ; Zubair, Mohammad ; Lin Chen ; Jing He

  • Author_Institution
    Dept. of Comput. Sci., Tennessee State Univ., Nashville, TN, USA
  • Volume
    11
  • Issue
    2
  • fYear
    2014
  • fDate
    March-April 2014
  • Firstpage
    419
  • Lastpage
    430
  • Abstract
    Electron cryomicroscopy is becoming a major experimental technique in solving the structures of large molecular assemblies. More and more three-dimensional images have been obtained at the medium resolutions between 5 and 10 Å. At this resolution range, major α-helices can be detected as cylindrical sticks and β-sheets can be detected as plain-like regions. A critical question in de novo modeling from cryo-EM images is to determine the match between the detected secondary structures from the image and those on the protein sequence. We formulate this matching problem into a constrained graph problem and present an O(Δ2N22N) algorithm to this NP-Hard problem. The algorithm incorporates the dynamic programming approach into a constrained K-shortest path algorithm. Our method, DP-TOSS, has been tested using α-proteins with maximum 33 helices and α-β proteins up to five helices and 12 β-strands. The correct match was ranked within the top 35 for 19 of the 20 α-proteins and all nine α-β proteins tested. The results demonstrate that DP-TOSS improves accuracy, time and memory space in deriving the topologies of the secondary structure elements for proteins with a large number of secondary structures and a complex skeleton.
  • Keywords
    biology computing; dynamic programming; electron microscopy; graphs; molecular biophysics; molecular configurations; proteins; sheet materials; α-β proteins; α-helices; β-sheets; NP-Hard problem; O(Δ2N22N) algorithm; complex skeleton; constrained K-shortest path graph algorithm; constrained graph problem; cryoelectron microscopy de novo modeling; cylindrical sticks; detected secondary structures; dynamic programming approach; large molecular assembly structure; matching problem; medium resolutions; memory space; plain-like regions; protein sequence; resolution range; secondary structure elements; secondary structure matching problem; three-dimensional images; Amino acids; Heuristic algorithms; Image resolution; Protein sequence; Skeleton; Topology; Protein structure; algorithm; electron cryomicroscopy; graph; image; secondary structure;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2302803
  • Filename
    6727403