DocumentCode
3690243
Title
On the architecture of a big data classification tool based on a map reduce approach for hyperspectral image analysis
Author
V. A. Ayma;R. S. Ferreira;P. N. Happ;D. A. B. Oliveira;G. A. O. P. Costa;R. Q. Feitosa;A. Plaza;P. Gamba
Author_Institution
Dept. of Electrical Engineering, Pontifical Catholic University of Rio de Janeiro, Brazil
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
1508
Lastpage
1511
Abstract
Advances in remote sensors are providing exceptional quantities of large-scale data with increasing spatial, spectral and temporal resolutions, raising new challenges in its analysis, e.g. those presents in classification processes. This work presents the architecture of the InterIMAGE Cloud Platform (ICP): Data Mining Package; a tool able to perform supervised classification procedures on huge amounts of data, on a distributed infrastructure. The architecture is implemented on top of the MapReduce framework. The tool has four classification algorithms implemented taken from WEKA´s machine learning library, namely: Decision Trees, Naïve Bayes, Random Forest and Support Vector Machines. The SVM classifier was applied on datasets of different sizes (2 GB, 4 GB and 10 GB) for different cluster configurations (5, 10, 20, 50 nodes). The results show the tool as a potential approach to parallelize classification processes on big data.
Keywords
"Big data","Iterative closest point algorithm","Data mining","Computer architecture","Cloud computing","Classification algorithms","Training"
Publisher
ieee
Conference_Titel
Geoscience and Remote Sensing Symposium (IGARSS), 2015 IEEE International
ISSN
2153-6996
Electronic_ISBN
2153-7003
Type
conf
DOI
10.1109/IGARSS.2015.7326066
Filename
7326066
Link To Document