DocumentCode
3373768
Title
Protein secondary structure prediction using data mining tool C5
Author
Lu, Meiliu ; Zhang, Du ; Xu, Hongjun ; Lau, Ken Tse-Yau ; Lu, Li
Author_Institution
Dept. of Comput. Sci., California State Univ., Sacramento, CA, USA
fYear
1999
fDate
1999
Firstpage
107
Lastpage
110
Abstract
This paper reports our experimental results in protein secondary structure prediction using the machine learning software, C5. The accuracy improvement in the prediction of protein secondary structure is the focus of our study. Starting with a target protein with unknown secondary structures, we investigate three different approaches and find that training cases selected based on sequence homology can achieve the highest predictive accuracy of 75% in testing cases. Our result indicates that the method of selecting proteins for the training cases has the most significant impact on predictive accuracy
Keywords
biology computing; data mining; molecular configurations; proteins; 3D structure; C5; data mining tool; knowledge discovery; linear amino acid sequence; machine learning; machine learning software; predictive accuracy; protein molecule; protein secondary structure prediction; sequence homology; tertiary structure; training cases; Amino acids; Biochemistry; Coils; Crystallography; Data mining; Nuclear magnetic resonance; Protein engineering; Read only memory; Tellurium; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 1999. Proceedings. 11th IEEE International Conference on
Conference_Location
Chicago, IL
ISSN
1082-3409
Print_ISBN
0-7695-0456-6
Type
conf
DOI
10.1109/TAI.1999.809774
Filename
809774
Link To Document