Title :
Predicting the Subcellular Localization of Proteins with Multiple Sites Based on N-Terminal Signals
Author :
Xumi Qu ; Yuehui Chen ; Shanping Qiao
Author_Institution :
Sch. of Inf. Sci. & Eng., Univ. of Jinan, Jinan, China
Abstract :
Sub cellular localization of proteins is an important attribute in bioinformatics, closely related to its functions, signal transduction and biological process. In this research field, great progress has been made in recent years. However, some shortcomings still exist in the prediction methods. Such as the extracted features information is not complete enough to achieve a higher prediction accuracy rate, some important protein information and the correlation of the amino acid sequence are usually ignored and so on. Some proteins do not have only one location, they may have two locations or three and even more, but were considered to have only one location. In this study, we divide a protein sequence into two parts according to its N-terminal sorting signals and extract their pseudo amino acid composition features respectively. And then we use the multi-label KNN, shorted for ML-KNN to deal with the proteins which have two, three or even more locations. The results are satisfied by Jack Knife test.
Keywords :
bioinformatics; proteins; ML-KNN; N-terminal sorting signals; amino acid sequence; bioinformatics; biological process; extracted features information; jack knife test; multilabel KNN; multiple sites; prediction accuracy; protein information; protein subcellular localization prediction; pseudo amino acid composition features; signal transduction; Accuracy; Amino acids; Educational institutions; Feature extraction; Proteins; Sorting; Vectors; ML-KNN; N-terminal sorting signals; pseudo amino acid composition; subcellular localization;
Conference_Titel :
Information Science and Cloud Computing Companion (ISCC-C), 2013 International Conference on
Conference_Location :
Guangzhou
DOI :
10.1109/ISCC-C.2013.101