DocumentCode :
3209316
Title :
MVP algorithm based prediction method for virulent proteins in bacterial pathogens
Author :
Wang, Tong ; Xia, Tian ; Huang, Qingbua
Author_Institution :
Inst. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
Volume :
2
fYear :
2010
fDate :
13-14 Sept. 2010
Firstpage :
308
Lastpage :
311
Abstract :
Identifying whether the uncharacterized protein belongs to a virulent protein or not is important. If it is virulent protein, it is very useful for studying its virulence mechanisms in pathogens as well as designing antiviral drugs. Particularly, with a large number of virulent protein sequences discovered in recent years, it is urgent to develop an automated method to predict the bacterial virulent proteins. In this work, a sequence encoding scheme based on combing DC (Dipeptide Composition) and PseAA (Pseudo Amino Acid) is introduced to represent protein samples. However, this sequence encoding scheme would correspond to a very high dimensional feature vector. A DR (Dimensionality Reduction) algorithm, the so-called MVP (Maximum variance projection) is introduced to extract the key features from the high-dimensional space and reduce the original high-dimensional vector to a lower-dimensional one. Finally, our jackknife test results thus obtained are quite encouraging, which indicate that the above method is used effectively to deal with this complicated problem of predicting virulent proteins in bacterial pathogens.
Keywords :
biology computing; drugs; feature extraction; genomics; microorganisms; proteins; MVP algorithm; PseAA; antiviral drug design; automated method; bacterial pathogen; combing DC; dimensionality reduction algorithm; dipeptide composition; features extraction; jackknife test; maximum variance projection; prediction method; pseudo amino acid; sequence encoding; uncharacterized protein; virulent protein sequence; Biological information theory; Educational institutions; Electronic mail; Encoding; DC; DR; MVP; PseAA; Virulent proteins;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Natural Computing Proceedings (CINC), 2010 Second International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-7705-0
Type :
conf
DOI :
10.1109/CINC.2010.5643726
Filename :
5643726
Link To Document :
بازگشت