DocumentCode :
3320969
Title :
A Double-SVM Classification System for Single and Multiple-Subcellular Localizations of Yeast Proteins Using Sequence Motifs
Author :
Zhang, Su ; Yang, Wei ; Wu, Ning ; Chen, Yazhu ; Lu, Hongtao ; Zhang, Zhizhou
Author_Institution :
Shanghai Jiao Tong Univ., Shanghai
fYear :
2007
fDate :
8-11 July 2007
Firstpage :
173
Lastpage :
176
Abstract :
The cellular localization site and the potential functionality of a protein are closely related. In this paper, we develop a novel Double-SVM Classification System for predicting the subcellular localization sites of the proteins. First, a set of features are made from the occurrence frequency of sequence motifs. Then discriminant features are selected by I-RELIEF and used as the inputs of the support vector machine (SVM) for classification. The two classes are single and multiple-subcellular localizations. Due to the large size difference among the protein sequences, we set two SVMs, one for the shorter sequences and the other for the longer ones. This system is applied to predict the subcellular localization sites of Yeast proteins. The experimental result shows that the testing accuracy of the system is 66%, which is higher than that of the traditional single-SVM model.
Keywords :
biology computing; pattern classification; proteins; support vector machines; I-RELIEF; cellular localization site; discriminant features; double-SVM classification system; multiple-subcellular localization; protein potential functionality; sequence motifs; support vector machine; yeast proteins; Amino acids; Bioinformatics; Cities and towns; Frequency; Fungi; Genomics; Protein engineering; Protein sequence; Support vector machine classification; Support vector machines; protein subcellular localization; sequence motif; support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Acquisition, 2007. ICIA '07. International Conference on
Conference_Location :
Seogwipo-si
Print_ISBN :
1-4244-1220-X
Electronic_ISBN :
1-4244-1220-X
Type :
conf
DOI :
10.1109/ICIA.2007.4295720
Filename :
4295720
Link To Document :
بازگشت