DocumentCode
2025035
Title
DM-pred Method: A New Method to Predict Secondary Structures Based on Data Mining Techniques
Author
Fayech, Sondès ; Essoussi, Nadia ; Limam, Mohamed
Author_Institution
LARODEC, Univ. of Tunis, Tunis, Tunisia
fYear
2011
fDate
Aug. 29 2011-Sept. 2 2011
Firstpage
445
Lastpage
449
Abstract
Protein secondary structure prediction is a key step in prediction of protein tertiary structure. There have emerged many methods based on machine learning techniques, such as neural networks (NN) and support vector machines (SVM), to focus on the prediction of the secondary structures. In this paper a new method, DM-pred, was proposed based on a protein clustering method to detect homologous sequences, a sequential pattern mining method to detect frequent patterns, features extraction and quantification approaches to prepare features and SVM method to predict structures. When tested on the most popular secondary structure datasets, DM-pred achieved a Q3 accuracy of 78.20% and a SOV of 76.49% which illustrates that it is one of the top range methods for protein secondary structure prediction.
Keywords
bioinformatics; data mining; learning (artificial intelligence); neural nets; pattern clustering; proteins; support vector machines; DM-pred method; SVM method; data mining; homologous sequence; machine learning; neural network; protein clustering; protein tertiary structure; secondary structure prediction; sequential pattern mining; support vector machine; Amino acids; Data mining; Databases; Protein sequence; Support vector machines; Training; SVM; clustering; features; protein secondary structure prediction; sequential pattern mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications (DEXA), 2011 22nd International Workshop on
Conference_Location
Toulouse
ISSN
1529-4188
Print_ISBN
978-1-4577-0982-1
Type
conf
DOI
10.1109/DEXA.2011.27
Filename
6059858
Link To Document