DocumentCode :
1791717
Title :
Low redundancy feature selection with grouped variables and its application to healthcare data
Author :
Hang Wu ; Ji-Jiang Yang ; Jianqiang Li
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
71
Lastpage :
76
Abstract :
In the current era massive datasets in healthcare are becoming much more available for analysis, where numerous features are designed or constructed to represent a patient. Feature selection algorithms play a key role in reducing the data dimension thus speeding up the succeeding learning algorithms as well as improving predicting accuracy. How to select the appropriate subset of features with low redundancy is one of the interesting problem in feature selection. In this paper, we present a new feature selection algorithm which aims to select low redundant features in the setting of grouped variables. We adopt a global optimization method based on Lipschitz continuity and present evaluation results on several datasets, which demonstrates the correctness and effectiveness of our algorithm.
Keywords :
data handling; learning (artificial intelligence); medical information systems; optimisation; Lipschitz continuity; data dimension; feature selection algorithms; global optimization method; grouped variables; healthcare data; learning algorithms; low redundancy feature selection; Accuracy; Algorithm design and analysis; Linear programming; Optimization; Prediction algorithms; Redundancy; Vectors; bioinformatics; biomedical informatics; data analysis; data mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004396
Filename :
7004396
Link To Document :
بازگشت