DocumentCode :
2306449
Title :
Protein secondary structure prediction with high accuracy using Support Vector Machine
Author :
Shoyaib, Mohammad ; Baker, Syed Murtuza ; Jabid, Taskeed ; Anwar, Firoz ; Khan, Haseena
Author_Institution :
Inst. of Inf. Technol., Univ. of Dhaka, Dhaka
fYear :
2007
fDate :
27-29 Dec. 2007
Firstpage :
1
Lastpage :
4
Abstract :
Mining bioinformatics data is an emerging area of research. Proteomics is one of the largest areas of focus in bioinformatics and data mining research. Protein structure prediction is one of the most crucial and decisive problem in all the areas of research. Protein secondary structure can be used for the determination of the tertiary structure via the fold recognition method. Hence, predicting the secondary structures from the proteinpsilas primary sequences has attracted the attention of many researchers. Experimental methods have proved to be complex and expensive. So to develop a simple and accurate method for structure prediction is of great importance. In this paper, a new method has been proposed based on the machine learning technique. The first step of this proposal is to find out frequent patterns of consecutive amino acids in a protein database. After this, a set of frequent words (feature set) is found. Then support vector machine (SVM) is used as a binary/tertiary classifier for the classification of protein secondary structure with these frequent words.
Keywords :
biology computing; data mining; support vector machines; SVM; binary-tertiary classifier; data mining bioinformatics; fold recognition method; protein secondary structure prediction; support vector machine; tertiary structure; Accuracy; Amino acids; Bioinformatics; Data mining; Machine learning; Proposals; Proteins; Proteomics; Support vector machine classification; Support vector machines; Protein; Secondary Structure; Support Vector Machine; amino acid;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and information technology, 2007. iccit 2007. 10th international conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4244-1550-2
Electronic_ISBN :
978-1-4244-1551-9
Type :
conf
DOI :
10.1109/ICCITECHN.2007.4579365
Filename :
4579365
Link To Document :
بازگشت