DocumentCode
183000
Title
Parallel feature selection using positive approximation based on MapReduce
Author
Qing He ; Xiaohu Cheng ; Fuzhen Zhuang ; Zhongzhi Shi
Author_Institution
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
397
Lastpage
402
Abstract
Over the last few decades, feature selection has been a hot research area in pattern recognition and machine learning, and many famous feature selection algorithms have been proposed. Among them, feature selection using positive approximation(FSPA) is an accelerator for traditional rough set based feature selection algorithms, which can significantly reduce the running time. However, FSPA still cannot handle large scale and high dimension dataset due to the memory constraints. In this paper, we propose a parallel implementation of FSPA using MapReduce framework, which is a programming model for processing large scale datasets. The experimental results demonstrate that the proposed algorithm can process large scale and high dimension dataset efficiently on commodity computers.
Keywords
approximation theory; learning (artificial intelligence); parallel processing; pattern recognition; rough set theory; FSPA; MapReduce; commodity computers; large scale datasets; machine learning; parallel feature selection; pattern recognition; positive approximation; rough set based feature selection algorithms; Algorithm design and analysis; Approximation algorithms; Approximation methods; Arrays; Computational modeling; Computers; Machine learning algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5147-5
Type
conf
DOI
10.1109/FSKD.2014.6980867
Filename
6980867
Link To Document