Title :
The Painter´s Feature Selection for Gene Expression Data
Author :
Apiletti, D. ; Baralis, E. ; Bruno, G. ; Fiori, A.
Author_Institution :
Politec. di Torino, Turin
Abstract :
Feature selection is a fundamental task in microarray data analysis. It aims at identifying the genes which are mostly associated with a tissue category, disease state or clinical outcome. An effective feature selection reduces computation costs and increases classification accuracy. This paper presents a novel multi-class approach to feature selection for gene expression data, which is called Painter´s approach. It has the benefits of both a parameter free technique and a native multi- category method. It consists of two phases. The first is a filtering phase that smooths the effect of noise and outliers, which represent a common problem in microarray data. In the second phase, the actual gene selection is performed. Preliminary experimental results on three public datasets are presented. They confirm the intuition of the proposed approach leading to high classification accuracies.
Keywords :
biological techniques; biology computing; feature extraction; genetics; pattern classification; Painter´s feature selection; classification accuracy; gene expression data; gene identification; microarray data analysis; multiclass approach; parameter free technique; Biology computing; Computational efficiency; DNA; Data analysis; Diseases; Filtering; Gene expression; Performance analysis; Phase noise; Testing; Animals; Gene Expression Profiling; Humans; Models, Theoretical; Oligonucleotide Array Sequence Analysis; Predictive Value of Tests; Software;
Conference_Titel :
Engineering in Medicine and Biology Society, 2007. EMBS 2007. 29th Annual International Conference of the IEEE
Conference_Location :
Lyon
Print_ISBN :
978-1-4244-0787-3
DOI :
10.1109/IEMBS.2007.4353269