Title :
Computational prediction of continuous B-cell epitopes using random forest classifier
Author :
Kavitha, K.V. ; Saritha, R. ; Vinod, Chandra S. S.
Author_Institution :
Dept. of Comput. Sci., Coll. of Eng., Trivandrum, India
Abstract :
Immune responses protect against infection by microbes like viruses, bacteria, fungi and other parasites and against entry of non-microbial foreign matter. Any foreign matter which enters the body that can induce the immune system to produce a corresponding antibody is called an antigen. Antibodies are specific proteins produced to destroy antigen. A portion of antigen which can bond with the antigen binding site of the antibody is called B-cell epitope or antigenic determinant. These epitopes play a vital role in the development of peptide vaccines, in diagnosis of diseases, immune based cancer therapies and also for allergy research. Computational prediction of the presence and location of B-cell epitopes is a challenging task in the field of immunology. This paper proposes a new data mining model for prognostication of continuous B-cell epitopes by combining amino acid propensity scales and di, tri and tetra peptide immunogenicity scales. Principal Component Analysis, a dimensionality reduction technique was used to filter features that do not play a major role in epitope prediction. Finally the prediction was performed by Random Forest Classifier. Experimental results have shown that RF classifier improved the prediction accuracy compared to existing prediction algorithms like AAP, AAT-fs and BCPred.
Keywords :
cancer; cellular biophysics; data mining; learning (artificial intelligence); medical computing; microorganisms; patient diagnosis; pattern classification; principal component analysis; proteins; RF classifier; allergy research; amino acid propensity scales; antibody; antigenic determinant; bacteria; computational prediction; continuous B-cell epitopes; data mining model; dimensionality reduction technique; dipeptide immunogenicity scales; diseases diagnosis; epitope prediction; features filtering; fungi; immune based cancer therapies; immune responses; immune system; immunology; infection protection; microbes; nonmicrobial foreign matter; parasites; peptide vaccines development; prediction accuracy; prediction algorithms; principal component analysis; prognostication; proteins; random forest classifier; tetra peptide immunogenicity scales; tripeptide immunogenicity scales; viruses; Accuracy; Amino acids; Feature extraction; Immune system; Peptides; Proteins; Support vector machines;
Conference_Titel :
Computing, Communications and Networking Technologies (ICCCNT),2013 Fourth International Conference on
Conference_Location :
Tiruchengode
Print_ISBN :
978-1-4799-3925-1
DOI :
10.1109/ICCCNT.2013.6726820