DocumentCode
120487
Title
Feature selection using mutual information for high- dimensional data sets
Author
Nagpal, Arpita ; Gaur, Deepti ; Gaur, Surabhi
Author_Institution
Comput. Sci. Deptt., ITM Univ., Gurgaon, India
fYear
2014
fDate
21-22 Feb. 2014
Firstpage
45
Lastpage
49
Abstract
To reduce the dimensionality of dataset, redundant and irrelevant features need to be segregated from multidimensional dataset. To remove these features, one of the feature selection techniques needs to be used. Here, a feature selection technique to remove irrelevant features has been used. Correlation measures based on the concept of mutual information has been adopted to calculate the degree of association between features. In this paper authors are proposing a new algorithm to segregate features from high dimensional data by visualizing relevant features in the form of graph as a dataset.
Keywords
data visualisation; feature selection; trees (mathematics); correlation measures; dimensionality reduction; feature removal; feature segregation; feature selection; feature visualization; high-dimensional data sets; minimum spanning tree; multidimensional dataset; mutual information; Algorithm design and analysis; Classification algorithms; Correlation; Entropy; Filtering algorithms; Mutual information; Random variables; Correlation; data set; feature selection; minimum spanning tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location
Gurgaon
Print_ISBN
978-1-4799-2571-1
Type
conf
DOI
10.1109/IAdCC.2014.6779292
Filename
6779292
Link To Document