Title :
Prediction of O-linked glycosylation sites in protein by independent component analysis
Author :
Wang, Chu-Zheng ; Tan, Xiao-Feng ; Chen, Yen-wei ; Ito, Masahiro ; Nishikawa, Ikuko
Author_Institution :
Coll. of Comput. & Inf. Eng., Central South Univ. of Forestry & Technol., Changsha, China
Abstract :
Glycosylation is one of the most important post-translation modifications steps in eukaryotic cell. In this paper, we propose a new approach based on independent component analysis (ICA) for prediction O-linked glycosylation site and pattern analysis. Principal component analysis (PCA) is first used to find significant uncorrelated components, and then ICA is used to extract independent components to construct a subspace (main basis) of protein sequence. The prediction is viewed as a 2-classes classification problem. The test protein vector is projected to each subspace. The protein sequence is classified into the nearest class by calculating the distance between the test vector and its projection on the subspace. The prediction accuracy of our proposed new approach is higher than that of other subspace methods based on PCA.
Keywords :
biology computing; independent component analysis; molecular biophysics; molecular configurations; principal component analysis; proteins; 2-classes classification problem; O-linked glycosylation sites; independent component analysis; pattern analysis; principal component analysis; protein sequence; protein vector; subspace methods; Accuracy; Feature extraction; Image coding; Principal component analysis; Protein sequence; Support vector machine classification; O-glycosylation; independent component analysis; pattern analysis; positional probability function;
Conference_Titel :
Signal Processing Systems (ICSPS), 2010 2nd International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-6892-8
Electronic_ISBN :
978-1-4244-6893-5
DOI :
10.1109/ICSPS.2010.5555569