Title :
Embedded Gene Selection for Imbalanced Microarray Data Analysis
Author :
Li, Guo-Zheng ; Meng, Hao-Hua ; Ni, Jun
Author_Institution :
Dept. of Control Sci. & Eng., Tongji Univ., Shanghai
Abstract :
Most of microarray data sets are imbalanced, i.e. the number of positive examples is much less than that of negative, which will hurt performance of classifiers when it is used for tumor classification. Though it is critical, few previous works paid attention to this problem. Here we propose embedded gene selection with two algorithms i.e. EGSEE (Embedded Gene Selection for EasyEnsemble) and EGSIEE (Embedded Gene Selection for Individuals of EasyEnsemble) to treat this problem and improve generalization performance of the EasyEnsemble classifier. Experimental results on several microarray data sets show that compared with the previous two filter feature selection methods, EGSEE and EGSIEE obtain better performance.
Keywords :
cancer; cellular biophysics; genetics; medical diagnostic computing; tumours; EGSEE; EGSIEE; embedded gene selection; imbalanced microarray data analysis; tumor classification; Cities and towns; Data analysis; Data engineering; Embedded computing; Filters; Gene expression; Neoplasms; Pattern classification; Radio control; Radiology; Embedded feature selection; Gene selection; Imbanlance problem; Microarray analysis; ensemble;
Conference_Titel :
Computer and Computational Sciences, 2008. IMSCCS '08. International Multisymposiums on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3430-5
DOI :
10.1109/IMSCCS.2008.33