DocumentCode :
1697075
Title :
Stability of different feature selection methods for selecting protein sequence descriptors in protein solubility classification problem
Author :
Kocbek, Simon ; Stiglic, Gregor ; Pernek, Igor ; Kokol, Peter
Author_Institution :
Fac. of Health Sci., Univ. of Maribor, Maribor, Slovenia
fYear :
2010
Firstpage :
50
Lastpage :
55
Abstract :
Predicting protein solubility has gained lots of intention in the recent years and several descriptors have been defined to describe proteins in these works. Therefore, different feature selection methods have been used for selecting the most important attributes. An empirical study, that aims to explain the relationship between the number of samples and stability of seven different feature selection techniques for protein datasets, is presented.
Keywords :
biology computing; pattern classification; proteins; feature selection method stability; protein datasets; protein sequence descriptors selection; protein solubility classification problem; Amino acids; Correlation; Databases; Numerical stability; Proteins; Stability analysis; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems (CBMS), 2010 IEEE 23rd International Symposium on
Conference_Location :
Perth, WA
ISSN :
1063-7125
Print_ISBN :
978-1-4244-9167-4
Type :
conf
DOI :
10.1109/CBMS.2010.6042613
Filename :
6042613
Link To Document :
بازگشت