DocumentCode :
3101757
Title :
Incorporating Homology Using Multi-Instance Kernel for Protein Subcelluar Localization
Author :
Mei, Suyu ; Fei, Wang
Author_Institution :
Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
fYear :
2010
fDate :
18-20 June 2010
Firstpage :
1
Lastpage :
4
Abstract :
Kernel method has witnessed many successful applications in computational biology in recent years, and thus kernel design is a key step to define the similarity between two protein sequences. This paper aims at designing a kernel to derive more accurate similarity between two protein sequences by incorporating homology. Here a homologous sequence is viewed as one evolutionary instance of the target sequence and all homologous sequences constitute one homology bag. K-mer based spectrum kernel is used to define the similarity between any two instances and multi-instance kernel is as the sum of instance-wise spectrum kernels, called homology-based multi-instance kernel (HoMIKernel). By varying k-mer size and compressing 20 amino acids, we can derive several HoMIKernels, which are combined as HoMIKernel+ to capture more contextual information and cover size-varying motifs. We evaluate HoMIKernel+ on three unbalanced eukaryotic benchmark dataset. The experiments show that HoMIKernel+ achieves better predictive performance than the baseline models; and the incorporation of homologous sequences does increase the predictive accuracy.
Keywords :
biological techniques; biology computing; cellular biophysics; molecular biophysics; molecular configurations; proteins; HoMIKernels; K-mer based spectrum kernel; homologous sequence; homology bag; instance-wise spectrum kernels; k-mer size; multiinstance kernel; protein sequences; protein subcelluar localization; size-varying motifs; unbalanced eukaryotic benchmark dataset; Amino acids; Data mining; Drugs; Feature extraction; Kernel; Laboratories; Phylogeny; Predictive models; Protein sequence; Shape;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
ISSN :
2151-7614
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
Type :
conf
DOI :
10.1109/ICBBE.2010.5515543
Filename :
5515543
Link To Document :
بازگشت