Title :
Feature Selection Based on a New Dependency Measure
Author :
Sha, Chaofeng ; Qiu, Xipeng ; Zhou, Aoying
Author_Institution :
Dept. of Comput. Sci. & Eng., Fudan Univ., Shanghai
Abstract :
Feature selection is a process commonly used in machine learning, wherein a subset of the features available from the data are selected for application of a learning algorithm. Feature selection is effective in reducing dimensionality, removing irrelevant data, increasing learning accuracy and efficiency. In this paper, we propose a new information distance to measure the relevancy of two features. Unlike the information measure in previous feature selection works, our proposed information distance meets the condition of triangle inequality. We use InfoDist to feature selection and the experimental results showed it has a better performance.
Keywords :
data reduction; information theory; learning (artificial intelligence); dependency measure; dimensionality reduction; feature selection; information distance; learning algorithm; machine learning; triangle inequality; Application software; Chaos; Computer science; Data engineering; Fuzzy systems; Information theory; Knowledge engineering; Machine learning; Mutual information; Text categorization;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Shandong
Print_ISBN :
978-0-7695-3305-6
DOI :
10.1109/FSKD.2008.515