DocumentCode :
3107498
Title :
Prediction of O-Glycosylation Sites in Protein Sequence by Kernel Principal Component Analysis
Author :
Yang, Xue-Mei ; Cui, Xue-Wei ; Yang, Xue-Zhu
Author_Institution :
Coll. of Math. & Inf. Sci., Xianyang Normal Univ., Xianyang, China
fYear :
2010
fDate :
26-28 Sept. 2010
Firstpage :
267
Lastpage :
270
Abstract :
O-glycosylation is one of the main types of the mammalian protein glycosylation, it occurs on the particular site of serine and threonine. It´s important to predict the O-glycosylation site. In this paper, we propose a new method of kernel principal component analysis (KPCA) to predict the O-glycosylation site with window size w=9. The samples for experiment are encoded by the sparse coding and projected into kernel space first, then the features are extracted by PCA, at last the classification is done by Mahanalobis distance. The result of experiments shows that the proposed method of KPCA is more effective and accurate than PCA. The prediction accuracy is about 84.5%.
Keywords :
bioinformatics; feature extraction; pattern classification; principal component analysis; O-glycosylation site prediction; features extraction; kernel principal component analysis; mammalian protein glycosylation; protein sequence; sparse coding; Accuracy; Encoding; Kernel; Principal component analysis; Protein sequence; Training; KPCA; classification; glycosylation; prediction; protein; sparse coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Aspects of Social Networks (CASoN), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-8785-1
Type :
conf
DOI :
10.1109/CASoN.2010.68
Filename :
5636903
Link To Document :
بازگشت