DocumentCode
3320969
Title
A Double-SVM Classification System for Single and Multiple-Subcellular Localizations of Yeast Proteins Using Sequence Motifs
Author
Zhang, Su ; Yang, Wei ; Wu, Ning ; Chen, Yazhu ; Lu, Hongtao ; Zhang, Zhizhou
Author_Institution
Shanghai Jiao Tong Univ., Shanghai
fYear
2007
fDate
8-11 July 2007
Firstpage
173
Lastpage
176
Abstract
The cellular localization site and the potential functionality of a protein are closely related. In this paper, we develop a novel Double-SVM Classification System for predicting the subcellular localization sites of the proteins. First, a set of features are made from the occurrence frequency of sequence motifs. Then discriminant features are selected by I-RELIEF and used as the inputs of the support vector machine (SVM) for classification. The two classes are single and multiple-subcellular localizations. Due to the large size difference among the protein sequences, we set two SVMs, one for the shorter sequences and the other for the longer ones. This system is applied to predict the subcellular localization sites of Yeast proteins. The experimental result shows that the testing accuracy of the system is 66%, which is higher than that of the traditional single-SVM model.
Keywords
biology computing; pattern classification; proteins; support vector machines; I-RELIEF; cellular localization site; discriminant features; double-SVM classification system; multiple-subcellular localization; protein potential functionality; sequence motifs; support vector machine; yeast proteins; Amino acids; Bioinformatics; Cities and towns; Frequency; Fungi; Genomics; Protein engineering; Protein sequence; Support vector machine classification; Support vector machines; protein subcellular localization; sequence motif; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Acquisition, 2007. ICIA '07. International Conference on
Conference_Location
Seogwipo-si
Print_ISBN
1-4244-1220-X
Electronic_ISBN
1-4244-1220-X
Type
conf
DOI
10.1109/ICIA.2007.4295720
Filename
4295720
Link To Document