DocumentCode
462084
Title
Computational Models for Identifying Promiscuous HLA-B7 Binders based on Information Theory and Support Vector Machine
Author
Zhang, Guang Lan ; Tong, Joo Chuan ; Zhang, Zong Hong ; Zheng, Yun ; Brusic, Vladimir ; August, J. Thomas ; Kwoh, Chee Keong
Author_Institution
Inst. for Infocomm Res., Singapore
fYear
2006
fDate
11-14 Dec. 2006
Firstpage
319
Lastpage
323
Abstract
Computational vaccinology is a developing discipline. To become a standard component in vaccine development, it requires accurate and broadly applicable models of wet-lab experiments. We developed prediction models based on a novel data representation of peptide/MHC interaction and support vector machines (SVM) for prediction of peptides that promiscuously bind to multiple human leukocyte antigen (HLA) alleles belonging to HLA-B7 supertype. 10-fold cross-validation results showed that the area under the receiver operating curve (Aroc) of SVM models is above 0.90. Blind testing results showed that the average Aroc of SVM models is 0.84. A learning approach based on information theory, termed Information Learning Approach, was used for feature selection. Several amino acid positions with high information content have been identified in input 9mer peptides and HLA alleles and were used as input features to SVM. They are position 1, 2, 4, 5, 7, 8, 9 in 9mer peptides and position 45 and 97 in HLA-B7 molecules. Prediction accuracy was improved after feature selection. These positions cover the anchor positions of HLA-B7 alleles, which have important biological roles for successful biding of relevant peptides.
Keywords
information theory; molecular biophysics; physiological models; proteins; support vector machines; HLA-B7 binders; amino acid; binding peptide; blind testing; computational vaccinology; information theory; multiple human leukocyte antigen alleles; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Biomedical and Pharmaceutical Engineering, 2006. ICBPE 2006. International Conference on
Conference_Location
Singapore
Print_ISBN
978-981-05-79
Electronic_ISBN
81-904262-1-4
Type
conf
Filename
4155916
Link To Document