• DocumentCode
    183000
  • Title

    Parallel feature selection using positive approximation based on MapReduce

  • Author

    Qing He ; Xiaohu Cheng ; Fuzhen Zhuang ; Zhongzhi Shi

  • Author_Institution
    Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
  • fYear
    2014
  • fDate
    19-21 Aug. 2014
  • Firstpage
    397
  • Lastpage
    402
  • Abstract
    Over the last few decades, feature selection has been a hot research area in pattern recognition and machine learning, and many famous feature selection algorithms have been proposed. Among them, feature selection using positive approximation(FSPA) is an accelerator for traditional rough set based feature selection algorithms, which can significantly reduce the running time. However, FSPA still cannot handle large scale and high dimension dataset due to the memory constraints. In this paper, we propose a parallel implementation of FSPA using MapReduce framework, which is a programming model for processing large scale datasets. The experimental results demonstrate that the proposed algorithm can process large scale and high dimension dataset efficiently on commodity computers.
  • Keywords
    approximation theory; learning (artificial intelligence); parallel processing; pattern recognition; rough set theory; FSPA; MapReduce; commodity computers; large scale datasets; machine learning; parallel feature selection; pattern recognition; positive approximation; rough set based feature selection algorithms; Algorithm design and analysis; Approximation algorithms; Approximation methods; Arrays; Computational modeling; Computers; Machine learning algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4799-5147-5
  • Type

    conf

  • DOI
    10.1109/FSKD.2014.6980867
  • Filename
    6980867