مرکز منطقه ای اطلاع رساني علوم و فناوري - A data mining approach to predict protein secondary structure

DocumentCode :

2838170

Title :

A data mining approach to predict protein secondary structure

Author :

Yang, Bingru ; Sui, Haifeng ; Qu Wu ; Wang, Lijun

Author_Institution :

Sch. of Inf. Eng., Univ. of Sci. & Technol. Beijing, Beijing, China

Volume :

fYear :

2010

fDate :

22-24 Oct. 2010

Abstract :

In bioinformatics,proteins are coded by strings, called “primary structures”. Biologists have long enough gathered these primary structures in large databases. Numerous experiments and analyses of primary structures have revealed that the protein primary structure closely correlates with the protein second structure. In this paper, we present a data mining approach based on machine learning techniques to predict protein second structure. Based on majority voting mechanism, the approach combine the predictions of homology analysis classifier, Support vector machine(SVM) classifier and modified Knowledge Discovery in Databases (KDD*) process. They are validated with 2 different datasets. Their predictive accuracy results outperform the best secondary structure predictors by 2.00% on average.

Keywords :

bioinformatics; data mining; learning (artificial intelligence); pattern classification; support vector machines; bioinformatics; data mining; homology analysis classifier; knowledge discovery in databases; machine learning; protein secondary structure; support vector machine; data mining; protein secondary structure; protein structure prediction;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Application and System Modeling (ICCASM), 2010 International Conference on

Conference_Location :

Taiyuan

Print_ISBN :

978-1-4244-7235-2

Electronic_ISBN :

978-1-4244-7237-6

Type :

conf

DOI :

10.1109/ICCASM.2010.5620659

Filename :

5620659

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2838170