Title :
A Double-SVM Classification System for Single and Multiple-Subcellular Localizations of Yeast Proteins Using Sequence Motifs
Author :
Zhang, Su ; Yang, Wei ; Wu, Ning ; Chen, Yazhu ; Lu, Hongtao ; Zhang, Zhizhou
Author_Institution :
Shanghai Jiao Tong Univ., Shanghai
Abstract :
The cellular localization site and the potential functionality of a protein are closely related. In this paper, we develop a novel Double-SVM Classification System for predicting the subcellular localization sites of the proteins. First, a set of features are made from the occurrence frequency of sequence motifs. Then discriminant features are selected by I-RELIEF and used as the inputs of the support vector machine (SVM) for classification. The two classes are single and multiple-subcellular localizations. Due to the large size difference among the protein sequences, we set two SVMs, one for the shorter sequences and the other for the longer ones. This system is applied to predict the subcellular localization sites of Yeast proteins. The experimental result shows that the testing accuracy of the system is 66%, which is higher than that of the traditional single-SVM model.
Keywords :
biology computing; pattern classification; proteins; support vector machines; I-RELIEF; cellular localization site; discriminant features; double-SVM classification system; multiple-subcellular localization; protein potential functionality; sequence motifs; support vector machine; yeast proteins; Amino acids; Bioinformatics; Cities and towns; Frequency; Fungi; Genomics; Protein engineering; Protein sequence; Support vector machine classification; Support vector machines; protein subcellular localization; sequence motif; support vector machine;
Conference_Titel :
Information Acquisition, 2007. ICIA '07. International Conference on
Conference_Location :
Seogwipo-si
Print_ISBN :
1-4244-1220-X
Electronic_ISBN :
1-4244-1220-X
DOI :
10.1109/ICIA.2007.4295720