Title :
Gene selection by using an improved Fast Correlation-Based Filter
Author :
Zeng, Xue-Qiang ; Li, Guo-Zheng ; Chen, Su-Fen
Author_Institution :
Comput. Center, Nanchang Univ., Nanchang, China
Abstract :
Among various redundancy based gene selection methods, the Fast Correlation-Based Filter (FCBF) is one of the most effective. FCBF works in an iterative way, where one predominant feature is selected at each step and then some redundant features are removed by the selected one. However, the size of selected feature subset is not considered by FCBF, and weakly relevant features are too inclined to be eliminated. Aiming at this problem, this paper proposes a new approximate Markov blanket definition for FCBF, which strengthens the criterion for redundant features. Based on the new definition, the size of the selected feature set is used to adjust the criterion dynamically. Experimental results on several real gene data sets demonstrated the outstanding performance of the proposed algorithm compared with other several state-of-arts techniques.
Keywords :
bioinformatics; data mining; genetics; Markov blanket definition; fast correlation-based filter; gene data sets; gene selection method; redundant features; state-of-arts techniques; FCBF; Feature Selection; Redundant Feature;
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2010 IEEE International Conference on
Conference_Location :
Hong, Kong
Print_ISBN :
978-1-4244-8303-7
Electronic_ISBN :
978-1-4244-8304-4
DOI :
10.1109/BIBMW.2010.5703874