Title :
Statistical and visual morph movie analysis of crystallographic mutant selection bias in protein mutation resource data
Author :
Krebs, Werner G. ; Bourne, Philip E.
Author_Institution :
Nat. Parternership for Adv. Comput. Infrastructure, San Diego Supercomput. Center, CA, USA
Abstract :
The relationship between protein mutations and conformational change can potentially decipher the language relating sequence to structure. Elsewhere, we presented the protein mutant resource (PMR), an online tool that systematically identified related mutants in the protein databank (PDB), inferred mutant Gene Ontology classifications using data-mining, and allowed intuitive exploration of relationships between mutant structures. Here, we perform a comprehensive statistical analysis of PMR mutants. Although the PMR contains spectacular conformational changes, generally there is a counter-intuitive inverse relationship between conformational change and the number of mutations. That is, PDB mutations contrast naturally evolved mutations. We compare the frequencies of mutations in the PMR/PDB datasets against the PAM250 natural mutation frequencies to confirm this. We make available morph movies from PMR structure pairs, allowing visual analysis of conformational change and the ability to distinguish visually between conformational change due to motions (e.g., ligand binding) and mutations. The PMR is at http://pmr.sdsc.edu.
Keywords :
biology computing; crystallography; data mining; genetics; proteins; statistical analysis; PAM250 natural mutation frequencies; comprehensive statistical analysis; counter-intuitive inverse relationship; crystallographic mutant selection bias; data-mining; intuitive exploration; mutant gene ontology; online tool; protein databank; protein mutant resource; protein mutation resource data; spectacular conformational change; statistical morph movie analysis; visual morph movie analysis; Amino acids; Chemicals; Crystallography; Drugs; Frequency; Genetic mutations; Motion pictures; Protein engineering; Sequences; Spatial databases;
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
DOI :
10.1109/CSB.2003.1227317