Title :
Multivariate filter and PSO in protein function classification
Author :
Abdul-Rahman, Shuzlina ; Mohamed-Hussein, Zeti-Azura ; Bakar, Azuraliza Abu
Author_Institution :
Fac. of Inf. Sci. & Technol., Center for Artificial Intell. Technol. (CAIT), Bangi, Malaysia
Abstract :
Protein features are often complex, and they are challenging to classify. In identifying the most discriminatory features in protein sequences, we propose a new feature-selection strategy by integrating the multivariate filter and Particle Swarm Optimisation (PSO) algorithms. Experimental results, based on the number of reducts and classification accuracy, were analysed in both the filter and wrapper phases. For our dataset, the proposed method statistically significantly improves the obtained classification accuracy and reduces the number of feature subsets. In the filter phase, the accuracy is improved more than 4% in three out of four multivariate feature selection methods compared to a model without feature selection. In the second phase, the accuracy is increased from 97.51% to 100%. We also demonstrate the importance of the correct parameter settings in the PSO to guarantee good performance.
Keywords :
feature extraction; molecular biophysics; particle swarm optimisation; pattern classification; proteins; PSO algorithm; classification accuracy; feature selection strategy; filter phase; multivariate feature selection method; multivariate filter; particle swarm optimisation algorithm; protein feature; protein function classification; protein sequence; wrapper phase; Accuracy; Classification algorithms; Correlation; Entropy; Filtering algorithms; Optimization; Proteins; Feature Selection; Multivariate Filter; Particle Swarm Optimisation; Protein Sequences;
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2010 International Conference of
Conference_Location :
Paris
Print_ISBN :
978-1-4244-7897-2
DOI :
10.1109/SOCPAR.2010.5686158