DocumentCode :
183000
Title :
Parallel feature selection using positive approximation based on MapReduce
Author :
Qing He ; Xiaohu Cheng ; Fuzhen Zhuang ; Zhongzhi Shi
Author_Institution :
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
fYear :
2014
fDate :
19-21 Aug. 2014
Firstpage :
397
Lastpage :
402
Abstract :
Over the last few decades, feature selection has been a hot research area in pattern recognition and machine learning, and many famous feature selection algorithms have been proposed. Among them, feature selection using positive approximation(FSPA) is an accelerator for traditional rough set based feature selection algorithms, which can significantly reduce the running time. However, FSPA still cannot handle large scale and high dimension dataset due to the memory constraints. In this paper, we propose a parallel implementation of FSPA using MapReduce framework, which is a programming model for processing large scale datasets. The experimental results demonstrate that the proposed algorithm can process large scale and high dimension dataset efficiently on commodity computers.
Keywords :
approximation theory; learning (artificial intelligence); parallel processing; pattern recognition; rough set theory; FSPA; MapReduce; commodity computers; large scale datasets; machine learning; parallel feature selection; pattern recognition; positive approximation; rough set based feature selection algorithms; Algorithm design and analysis; Approximation algorithms; Approximation methods; Arrays; Computational modeling; Computers; Machine learning algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5147-5
Type :
conf
DOI :
10.1109/FSKD.2014.6980867
Filename :
6980867
Link To Document :
بازگشت