DocumentCode :
78617
Title :
Quantifying Significance of MHC II Residues
Author :
Ying Fan ; Ruoshui Lu ; Lusheng Wang ; Andreatta, Massimo ; Shuai Cheng Li
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China
Volume :
11
Issue :
1
fYear :
2014
fDate :
Jan.-Feb. 2014
Firstpage :
17
Lastpage :
25
Abstract :
The major histocompatibility complex (MHC), a cell-surface protein mediating immune recognition, plays important roles in the immune response system of all higher vertebrates. MHC molecules are highly polymorphic and they are grouped into serotypes according to the specificity of the response. It is a common belief that a protein sequence determines its three dimensional structure and function. Hence, the protein sequence determines the serotype. Residues play different levels of importance. In this paper, we quantify the residue significance with the available serotype information. Knowing the significance of the residues will deepen our understanding of the MHC molecules and yield us a concise representation of the molecules. In this paper we propose a linear programming-based approach to find significant residue positions as well as quantifying their significance in MHC II DR molecules. Among all the residues in MHC II DR molecules, 18 positions are of particular significance, which is consistent with the literature on MHC binding sites, and succinct pseudo-sequences appear to be adequate to capture the whole sequence features. When the result is used for classification of MHC molecules with serotype assigned by WHO, a 98.4 percent prediction performance is achieved. The methods have been implemented in java (http://code.google.com/p/quassi/).
Keywords :
biochemistry; cellular biophysics; linear programming; molecular biophysics; proteins; MHC II DR molecules; MHC II residues; MHC binding sites; MHC molecules; cell-surface protein mediating immune recognition; high polymorphism; high vertebrates; immune response system; linear programming-based approach; major histocompatibility complex; molecule representation; prediction performance; protein sequence; residue positions; serotype information; serotypes; succinct pseudosequences; three dimensional structure; whole sequence features; Amino acids; Bioinformatics; Immune system; Integer linear programming; Mathematical model; Peptides; Vectors; MHC II molecules; integer linear programming; quantify significant residues (positions);
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2013.138
Filename :
6654172
Link To Document :
بازگشت