DocumentCode :
2767170
Title :
Function prediction for in silico protein mutagenesis using transduction and active learning
Author :
Basit, Nada ; Wechsler, Harry
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
939
Lastpage :
940
Abstract :
As wet lab mutagenesis is expensive and time consuming one would use instead in silico mutagenesis. Towards that end, this paper proposes a novel method, T-RF-AL, for in silico mutagenesis. The method combines the merits of transduction, random forests, and active learning, the latter driven by a criterion of maximum curiosity. The feasibility of the T-RF-AL is shown on predicting mutant activity for HIV-1 Protease (HIV-1) and Bacteriophage T4 Lysozyme (T4) datasets. The new method, incremental in nature, compares favorably against random forests using the same active learning criteria, that of maximum curiosity. The observed advantages include better prediction accuracy that is achieved faster and using less training data.
Keywords :
biochemistry; biology computing; cellular biophysics; enzymes; genetics; learning (artificial intelligence); molecular biophysics; HIV-1 protease; active learning; bacteriophage T4 lysozyme datasets; function prediction; in silico protein mutagenesis; mutant activity; random forests; transduction; wet lab mutagenesis; Accuracy; Amino acids; Bioinformatics; Learning systems; Protein engineering; Proteins; active learning; enzyme mutant activity; incremental learning; maximum curiosity; protein function prediction; random forests; transduction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1612-6
Type :
conf
DOI :
10.1109/BIBMW.2011.6112511
Filename :
6112511
Link To Document :
بازگشت