DocumentCode :
464308
Title :
A Comparison of Sequence Kernels for Localization Prediction of Transmembrane Proteins
Author :
Maetschke, Stefan ; Gallagher, Marcus ; Bodén, Mikael
Author_Institution :
Sch. of Inf. Technol. & Electr. Eng., Queensland Univ., Brisbane, Qld.
fYear :
2007
fDate :
1-5 April 2007
Firstpage :
367
Lastpage :
372
Abstract :
We applied support vector machines to the prediction of the subcellular localization of transmembrane proteins, and compared the performance of different sequence kernels on this task. More specifically we measured prediction accuracy, computation time, number of kernel evaluations and number of support vectors for the spectrum, the full spectrum, the wildcard, the mismatch, the local-alignment and the residue-coupling kernel. The local-alignment achieved the highest prediction accuracy, with a Matthews correlation coefficient of 0.51, closely followed by the mismatch kernel. However, the local-alignment kernel was also the most time consuming kernel and seven times slower than the mismatch kernel. The spectrum kernel was the fastest kernel but linked to the highest number of support vectors and kernel evaluations. The residue-coupling kernel showed the lowest number of support vectors and kernel evaluations. No correlation between the number of support vectors and prediction accuracy could be observed. A localization predictor (TMPLoc) has been made available at http://pprowler.itee.uq.edu.au/TMPLoc
Keywords :
biology computing; proteins; support vector machines; local-alignment kernel; localization prediction; sequence kernels; subcellular localization; support vector machines; transmembrane proteins; Accuracy; Bioinformatics; Biomembranes; Cells (biology); Computational biology; Computational intelligence; Kernel; Protein engineering; Sequences; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0710-9
Type :
conf
DOI :
10.1109/CIBCB.2007.4221246
Filename :
4221246
Link To Document :
بازگشت