DocumentCode :
574987
Title :
Protein fold prediction using cluster merging
Author :
Phuoc, Ngyuen Quang ; Kim, Sung-Ryul
Author_Institution :
eB Corp., Seoul, South Korea
fYear :
2011
fDate :
Nov. 29 2011-Dec. 1 2011
Firstpage :
293
Lastpage :
298
Abstract :
Protein folding prediction, also called protein structure prediction, is one of the most important issues for understanding living organisms. Therefore, predicting the folding structure of proteins from their linear sequence is a very big challenge in biology. Despite years of research and the wide variety of approaches, protein folding still remains a difficult problem. One of the main difficulties is controlling the over-fitting and under-fitting behavior of classifiers in the prediction systems. In this paper we propose a new learning method to improve the accuracy of protein folding prediction by balancing between over-fitting and under-fitting. The key of this method is based on a special way for analyzing the distance among training data points in order to cluster them into spaces which have high density of data points. By this, the over fitting and under fitting can be controlled in a comprehensive manner. Some experimental results seem to indicate that the proposed method has a significant potential on improve the accuracy of protein folding prediction.
Keywords :
biology computing; learning (artificial intelligence); merging; pattern classification; pattern clustering; proteins; biology; classifier overfitting behavior; classifier underfitting behavior; cluster merging; data point density; learning method; linear sequence; living organisms; protein fold prediction; protein structure prediction; training data points; Accuracy; Amino acids; Feature extraction; Fitting; Proteins; Training; Training data; Classification; Cluster; Over-fitting; Protein folding prediction; Under-fitting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Sciences and Convergence Information Technology (ICCIT), 2011 6th International Conference on
Conference_Location :
Seogwipo
Print_ISBN :
978-1-4577-0472-7
Type :
conf
Filename :
6316623
Link To Document :
بازگشت