• DocumentCode
    3690243
  • Title

    On the architecture of a big data classification tool based on a map reduce approach for hyperspectral image analysis

  • Author

    V. A. Ayma;R. S. Ferreira;P. N. Happ;D. A. B. Oliveira;G. A. O. P. Costa;R. Q. Feitosa;A. Plaza;P. Gamba

  • Author_Institution
    Dept. of Electrical Engineering, Pontifical Catholic University of Rio de Janeiro, Brazil
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    1508
  • Lastpage
    1511
  • Abstract
    Advances in remote sensors are providing exceptional quantities of large-scale data with increasing spatial, spectral and temporal resolutions, raising new challenges in its analysis, e.g. those presents in classification processes. This work presents the architecture of the InterIMAGE Cloud Platform (ICP): Data Mining Package; a tool able to perform supervised classification procedures on huge amounts of data, on a distributed infrastructure. The architecture is implemented on top of the MapReduce framework. The tool has four classification algorithms implemented taken from WEKA´s machine learning library, namely: Decision Trees, Naïve Bayes, Random Forest and Support Vector Machines. The SVM classifier was applied on datasets of different sizes (2 GB, 4 GB and 10 GB) for different cluster configurations (5, 10, 20, 50 nodes). The results show the tool as a potential approach to parallelize classification processes on big data.
  • Keywords
    "Big data","Iterative closest point algorithm","Data mining","Computer architecture","Cloud computing","Classification algorithms","Training"
  • Publisher
    ieee
  • Conference_Titel
    Geoscience and Remote Sensing Symposium (IGARSS), 2015 IEEE International
  • ISSN
    2153-6996
  • Electronic_ISBN
    2153-7003
  • Type

    conf

  • DOI
    10.1109/IGARSS.2015.7326066
  • Filename
    7326066