Title :
Stability of different feature selection methods for selecting protein sequence descriptors in protein solubility classification problem
Author :
Kocbek, Simon ; Stiglic, Gregor ; Pernek, Igor ; Kokol, Peter
Author_Institution :
Fac. of Health Sci., Univ. of Maribor, Maribor, Slovenia
Abstract :
Predicting protein solubility has gained lots of intention in the recent years and several descriptors have been defined to describe proteins in these works. Therefore, different feature selection methods have been used for selecting the most important attributes. An empirical study, that aims to explain the relationship between the number of samples and stability of seven different feature selection techniques for protein datasets, is presented.
Keywords :
biology computing; pattern classification; proteins; feature selection method stability; protein datasets; protein sequence descriptors selection; protein solubility classification problem; Amino acids; Correlation; Databases; Numerical stability; Proteins; Stability analysis; Support vector machines;
Conference_Titel :
Computer-Based Medical Systems (CBMS), 2010 IEEE 23rd International Symposium on
Conference_Location :
Perth, WA
Print_ISBN :
978-1-4244-9167-4
DOI :
10.1109/CBMS.2010.6042613