DocumentCode :
3373768
Title :
Protein secondary structure prediction using data mining tool C5
Author :
Lu, Meiliu ; Zhang, Du ; Xu, Hongjun ; Lau, Ken Tse-Yau ; Lu, Li
Author_Institution :
Dept. of Comput. Sci., California State Univ., Sacramento, CA, USA
fYear :
1999
fDate :
1999
Firstpage :
107
Lastpage :
110
Abstract :
This paper reports our experimental results in protein secondary structure prediction using the machine learning software, C5. The accuracy improvement in the prediction of protein secondary structure is the focus of our study. Starting with a target protein with unknown secondary structures, we investigate three different approaches and find that training cases selected based on sequence homology can achieve the highest predictive accuracy of 75% in testing cases. Our result indicates that the method of selecting proteins for the training cases has the most significant impact on predictive accuracy
Keywords :
biology computing; data mining; molecular configurations; proteins; 3D structure; C5; data mining tool; knowledge discovery; linear amino acid sequence; machine learning; machine learning software; predictive accuracy; protein molecule; protein secondary structure prediction; sequence homology; tertiary structure; training cases; Amino acids; Biochemistry; Coils; Crystallography; Data mining; Nuclear magnetic resonance; Protein engineering; Read only memory; Tellurium; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 1999. Proceedings. 11th IEEE International Conference on
Conference_Location :
Chicago, IL
ISSN :
1082-3409
Print_ISBN :
0-7695-0456-6
Type :
conf
DOI :
10.1109/TAI.1999.809774
Filename :
809774
Link To Document :
بازگشت