Title :
Verification and validation of Parallel Support Vector Machine algorithm based on MapReduce Program model on Hadoop cluster
Author :
Kiran, M. ; Kumar, Ajit ; Prathap, B.R.
Author_Institution :
Fac. of Eng., Dept. of Comput. Sci. & Eng., Christ Univ., Bangalore, India
Abstract :
From the recent years the large volume of data is growing bigger and bigger. It is difficult to measure the total volume of structured and unstructured data that require machine-based systems and technologies in order to be fully analyzed. Efficient implementation techniques are the key to meeting the scalability and performance requirements entailed in such scientific data analysis. So for the same in this paper the Sequential Support Vector Machine in WEKA and various MapReduce Programs including Parallel Support Vector Machine on Hadoop cluster is analyzed and thus, in this way Algorithms are Verified and Validated on Hadoop Cluster using the Concept of MapReduce. In this paper, the performance of above applications has been shown with respect to execution time/training time and number of nodes. Experimental Results shows that as the number of nodes increases the execution time decreases. This experiment is basically a research study of above MapReduce applications.
Keywords :
data analysis; parallel processing; support vector machines; Hadoop cluster; MapReduce program model; WEKA; execution time; machine-based systems; parallel support vector machine algorithm; scientific data analysis; sequential support vector machine; structured data; training time; unstructured data; Communication systems; Computational modeling; File systems; Machine learning algorithms; Prediction algorithms; Support vector machines; Vectors; Hadoop; LIBSVM; Machine Learning; MapReduce; MultiFileWordCount; Parallel SVM; SVM; WEKA Tool;
Conference_Titel :
Advanced Computing and Communication Systems (ICACCS), 2013 International Conference on
Conference_Location :
Coimbatore
DOI :
10.1109/ICACCS.2013.6938728