DocumentCode
2454336
Title
A method for improving the accuracy of predicting protein localization
Author
Wang, Tong ; Huang, Qinghua ; Yao, Lixiu
Author_Institution
Inst. of Comput. & Inf., Polytech. Univ., Shanghai, China
fYear
2010
fDate
24-27 Aug. 2010
Firstpage
1319
Lastpage
1322
Abstract
In this paper, a system based on the novel Maximum Variance Projection (MVP) is proposed to improve the performance of protein subcellular localization prediction. Firstly, the protein sequences are quantized into a high dimension space using a new representation approach Position-Specific Score Matrix (PSSM). However, the problems caused by such representation are computation complexity and complicated classifier design. To sort out this problem, a new dimension reduction algorithm, MVP, is introduced. It extracts the essential features from the high dimension feature space. Then, K-Nearest Neighbor (K-NN) classifier is employed to recognize the subcellular localization of proteins according to the new features after dimension reduction. A good experimental result is obtained based on the jackknife dataset.
Keywords
bioinformatics; cellular biophysics; computational complexity; proteins; K-Nearest Neighbor classifier; Maximum Variance Projection; Position-Specific Score Matrix; accuracy; classifier design; computation complexity; jackknife dataset; protein subcellular localization prediction; Accuracy; Amino acids; Classification algorithms; Feature extraction; Prediction algorithms; Principal component analysis; Proteins; Dimensionality reduction; PSSM; Subcellular localization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Education (ICCSE), 2010 5th International Conference on
Conference_Location
Hefei
Print_ISBN
978-1-4244-6002-1
Type
conf
DOI
10.1109/ICCSE.2010.5593736
Filename
5593736
Link To Document