Title :
Margin based variable importance for random forest
Author :
Yang Fan ; Li Xuan ; Zhou Qifeng ; Luo Linkai
Author_Institution :
Dept. of Autom., Xiamen Univ., Xiamen, China
Abstract :
Random forest variable importance measure (RF-VIM) has been widely used in many applications such as bioinformatics. The permutation based VIM accesses the variable importance with changes of the OOB accuracy caused by random permutation. In this paper a margin based variable importance measure was proposed, which use the changes in the margin distribution of training samples caused by random permutation to assess the importance of features. Experiments on gene expression data show that the method outperforms both the original RF-VIM and the Gini importance on some datasets.
Keywords :
bioinformatics; data handling; random processes; Gini importance; RF-VIM; bioinformatics; gene expression data; margin based variable importance measure; margin distribution; random forest variable importance measure; random permutation; Accuracy; Cancer; Lungs; Radio frequency; Training; Tumors; Vegetation; feature selection; margin distribution; random forest; variable importance;
Conference_Titel :
Computer Science & Education (ICCSE), 2011 6th International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-9717-1
DOI :
10.1109/ICCSE.2011.6028885