DocumentCode
255397
Title
Optimization of feature selection method for high dimensional data using fisher score and minimum spanning tree
Author
Singh, B. ; Sankhwar, J.S. ; Vyas, O.P.
Author_Institution
Inf. Technol., IIITA, Allahabad, India
fYear
2014
fDate
11-13 Dec. 2014
Firstpage
1
Lastpage
6
Abstract
For classification of High Dimensional data, feature selection is the most important step for obtaining optimal result with respect to processing power required and time taken. Feature selection is a method by which the most relevant feature is selected from a set of features containing redundant and irrelevant features thereby reducing the load on the classification algorithm. This paper proposes an implementation of this method in a two tier structure. In the first step, a high ranking feature is selected using the well-known filter based algorithm - Fisher Score. This algorithm selects the relevant feature from the feature set based on a preset threshold. The second step generates a cluster of redundant features utilizing MST (Minimum Spanning Tree) algorithm, which are then filtered out to preserve the most relevant features out of each cluster. This increases the classification accuracy as well as running time and hence the computation cost. The efficacy of the presented approach was validated by comparing the same with other well-known feature selection algorithms: Fisher Score, CFS and ConsSF with respect to three classifiers: IB1, C4.5, and Naïve Bayes. The presented approach achieved higher Classification accuracy and lower run time.
Keywords
feature selection; pattern classification; trees (mathematics); CFS; ConsSF; Fisher score; MST; classification algorithm; feature selection method; high dimensional data; minimum spanning tree; redundant features; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Correlation; Data mining; Filtering algorithms; Feature Selection; Fisher Score; High Dimensional Data; Minimum Spanning Tree;
fLanguage
English
Publisher
ieee
Conference_Titel
India Conference (INDICON), 2014 Annual IEEE
Conference_Location
Pune
Print_ISBN
978-1-4799-5362-2
Type
conf
DOI
10.1109/INDICON.2014.7030450
Filename
7030450
Link To Document