DocumentCode :
3026160
Title :
Study of dimension reduction methodologies in data mining
Author :
Sharma, Nitika ; Saroha, Kriti
Author_Institution :
Sch. of IT, Center for Dev. of Adv. Comput. (C-DAC), Noida, India
fYear :
2015
fDate :
15-16 May 2015
Firstpage :
133
Lastpage :
137
Abstract :
The data mining applications such as bioinformatics, risk management, forensics etc., involves very high dimensional dataset. Due to large number of dimensions, a well known problem of “Curse of Dimensionality” occurs. This problem leads to lower accuracy of machine learning classifiers due to involvement of many insignificant and irrelevant dimensions or features in the dataset. There are many methodologies that are being used to find the Critical Dimensions for a dataset that significantly reduces the number of dimensions. These feature reduction and subset selection methods reduce feature set, that eventually results in high classification accuracy and lower computation cost of machine learning algorithms. This paper surveys the schemes that are majorly used for Dimensionality Reduction mainly focusing Bioinformatics, Agricultural, Gene and Protein Expression datasets. A comparative analysis of surveyed methodologies is also done, based on which, best methodology for a certain type of dataset can be chosen.
Keywords :
data mining; pattern classification; agricultural dataset; bioinformatics dataset; curse-of-dimensionality; data mining; dimension reduction methodologies; feature reduction method; gene dataset; machine learning classifiers; protein expression dataset; subset selection method; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Machine learning algorithms; Principal component analysis; Data mining; Principal Component Analysis; critical dimension; curse of dimensionality; dimensionality reduction; feature selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-8889-1
Type :
conf
DOI :
10.1109/CCAA.2015.7148359
Filename :
7148359
Link To Document :
بازگشت