DocumentCode :
2034655
Title :
Protein classification using family profiles
Author :
Li, YuGang ; Lu, Yao ; Zhang, Fa ; Qiu, ZhenGe ; Liu, Zhiyong
Author_Institution :
Beijing Key Lab. for Intell. Inf. Technol., Beijing Inst. of Technol., Beijing, China
Volume :
5
fYear :
2010
fDate :
10-12 Aug. 2010
Firstpage :
2212
Lastpage :
2216
Abstract :
Protein classification plays an important role in the research in Bioinformatics. Many discriminative methods, including the SVM based algorithms are used to do this job. In order to use these methods, variable length protein sequences must be converted into fixed-length dimensional vectors. The current work presents a new method of converting sequences into vectors. The method first constructs profile sequences for each protein domain family, then the alignment values of every family profile sequence with a single protein sequence, is used as the protein´s according vectors. Then classification algorithms are used to train and predict protein sequences involved. Experiments were presented to test the ability of the SVM algorithm and the LS_StaticEField algorithm to recognize previously unknown sequences via this converting method. Experimental results show that the converting method is good enough and that the LS_StaticEField algorithm is comparable with the SVM one.
Keywords :
bioinformatics; pattern classification; support vector machines; LS_StaticEField algorithm; SVM algorithm; bioinformatics; family profiles; fixed-length dimensional vectors; protein classification; support vector machines; variable length protein sequences; Classification algorithms; Hidden Markov models; Kernel; Proteins; Support vector machines; Training; Bioinformatics; Profile sequence; SCOP; SVM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
Type :
conf
DOI :
10.1109/FSKD.2010.5569543
Filename :
5569543
Link To Document :
بازگشت