DocumentCode :
3519981
Title :
Structure Based Functional Analysis of Bacteriophage f1 Gene V Protein
Author :
Masso, Majid ; Mathe, Ewy ; Parvez, Nida ; Hijazi, Kahkeshan ; Vaisman, Iosif I.
Author_Institution :
Lab. for Struct. Bioinf., George Mason Univ., Manassas, VA
fYear :
2008
fDate :
3-5 Nov. 2008
Firstpage :
402
Lastpage :
406
Abstract :
A computational mutagenesis methodology utilizing a four-body, knowledge-based, statistical contact potential is applied toward globally quantifying relative structural changes (residual scores) in bacteriophage f1 gene V protein (GVP) due to single amino acid residue substitutions. We show that these residual scores correlate well with experimentally measured relative changes in protein function caused by the mutations. For each mutant, the approach also yields local measures of environmental perturbation occurring at every residue position (residual profile) in the protein. Implementation of the random forest algorithm, utilizing experimental GVP mutants whose feature vector components include environmental changes at the mutated position and at six nearest neighbors, correctly classifies mutants based on function with up to 72% accuracy while achieving 0.77 area under the receiver operating characteristic curve and a 0.42 correlation coefficient. An optimally trained random forest model is subsequently used to infer function for all remaining unexplored GVP mutants.
Keywords :
bioinformatics; biological techniques; knowledge based systems; microorganisms; proteins; proteomics; statistics; bacteriophage f1 GVP; bacteriophage f1 gene V protein; computational mutagenesis methodology; environmental perturbation; feature vector components; four body knowledge based statistical contact potential; protein function changes; protein mutations; relative structural change quantification; residual score quantification; single amino acid residue substitution; structure based functional analysis; Amino acids; Bioinformatics; Biomedical computing; Biomedical measurements; DNA; Functional analysis; Genetic mutations; Laboratories; Protein engineering; USA Councils; Delaunay tessellation; Gene V protein; computational mutagenesis; random forest supervised learning; statistical potential;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
Type :
conf
DOI :
10.1109/BIBM.2008.14
Filename :
4684928
Link To Document :
بازگشت