DocumentCode :
119938
Title :
Analysis and comparison of the kernel accuracy for saccharomyces genus protein sequence classification
Author :
Hyo Jung Chun ; Dong Woo Hong ; Taeseon Yoon
Author_Institution :
Hankuk Acad. of Foreign Studies, Yongin, South Korea
fYear :
2014
fDate :
16-19 Feb. 2014
Firstpage :
228
Lastpage :
233
Abstract :
Even as the modern protein sequencing technology is uncovering great amount of new protein, such data are futile without knowledge of the functions the protein sequences encode. To overcome the limitation of experimental analysis, utilizing Support Vector Machine and kernel methods for functional prediction of unannotated protein has become a promising topic of research in the field of computational biology, inducing many researchers to develop kernels with improved accuracy and efficiency. In this paper, we assigned the Gaussian, Polynomial, and Normal kernels to each Sensu Stricto, Sensu Lato, and Petite-Negative groups of the Saccharomyces fungus species, for which the kernel showed the greatest accuracy in protein sequence classification. From the result we discovered the sequential shapes of the proteins and detected similarities of the structural linearity among the proteins species belonging in the same group. The resulting data allow us to provide an important categorization of kernels that will predict protein function with the greatest accuracy depending on the group of the Saccharomyces the protein belongs for future researches using sequential analysis for prediction, into which you can type your own text.
Keywords :
Gaussian processes; bioinformatics; genomics; microorganisms; pattern classification; proteins; support vector machines; Gaussian kernel; Saccharomyces fungus species; Saccharomyces genus protein sequence classification; computational biology; functional unannotated protein prediction; kernel method accuracy improvement; kernel method efficiency improvement; normal kernel; petite-negative group; polynomial kernel; protein function prediction; protein sequencing technology; sensu lato group; sensu stricto group; sequential prediction analysis; sequential protein shapes; structural linearity; support vector machine; Accuracy; Kernel; Polynomials; Proteins; Shape; Support vector machines; Vectors; Function prediction; Kernel methods; Protein Sequence; Support Vector Machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Communication Technology (ICACT), 2014 16th International Conference on
Conference_Location :
Pyeongchang
Print_ISBN :
978-89-968650-2-5
Type :
conf
DOI :
10.1109/ICACT.2014.6778954
Filename :
6778954
Link To Document :
بازگشت