Title :
Automated protein NMR resonance assignments
Author :
Wan, Xiang ; Xu, Dong ; Slupsky, Carolyn M. ; Lin, Guohui
Author_Institution :
Dept. of Comput. Sci., Alberta Univ., Edmonton, Alta., Canada
Abstract :
NMR resonance peak assignment is one of the key steps in solving an NMR protein structure. The assignment process links resonance peaks to individual residues of the target protein sequence, providing the prerequisite for establishing intra- and inter-residue spatial relationships between atoms. The assignment process is tedious and time-consuming, which could take many weeks. Though there exist a number of computer programs to assist the assignment process, many NMR labs are still doing the assignments manually to ensure quality. This paper presents (1) a new scoring system for mapping spin systems to residues, (2) an automated adjacency information extraction procedure from NMR spectra, and (3) a very fast assignment algorithm based on our previous proposed greedy filtering method and a maximum matching algorithm to automate the assignment process. The computational tests on 70 instances of (pseudo) experimental NMR data of 14 proteins demonstrate that the new score scheme has much better discerning power with the aid of adjacency information between spin systems simulated across various NMR spectra. Typically, with automated extraction of adjacency information, our method achieves nearly complete assignments for most of the proteins. The experiment shows very promising perspective that the fast automated assignment algorithm together with the new score scheme and automated adjacency extraction may be ready for practical use.
Keywords :
biological NMR; biology computing; genetic algorithms; molecular biophysics; proteins; spin systems; NMR labs; NMR protein structure; NMR resonance peak; NMR spectra; adjacency information extraction procedure; assignment algorithm; computational tests; computer programs; discerning power; experimental NMR data; greedy filtering method; inter-residue spatial relationships; intra-residue spatial relationships; maximum matching algorithm; new score scheme; scoring system; spin systems mapping; target protein sequence; Amino acids; Biomedical informatics; Chemicals; Computer networks; Data mining; Filtering algorithms; Magnetic resonance; Nuclear magnetic resonance; Protein engineering; Protein sequence;
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
DOI :
10.1109/CSB.2003.1227319