DocumentCode
185567
Title
An overview of free software tools for general data mining
Author
Jovic, A. ; Brkic, K. ; Bogunovic, N.
Author_Institution
Dept. of Electron., Microelectron., Comput. & Intell. Syst., Univ. of Zagreb, Zagreb, Croatia
fYear
2014
fDate
26-30 May 2014
Firstpage
1112
Lastpage
1117
Abstract
This expert paper describes the characteristics of six most used free software tools for general data mining that are available today: RapidMiner, R, Weka, KNIME, Orange, and scikit-learn. The goal is to provide the interested researcher with all the important pros and cons regarding the use of a particular tool. A comparison of the implemented algorithms covering all areas of data mining (classification, regression, clustering, associative rules, feature selection, evaluation criteria, visualization, etc.) is provided. In addition, the tools´ support for the more advanced and specialized research topics (big data, data streams, text mining, etc.) is outlined, where applicable. The tools are also compared with respect to the community support, based on the available sources. This multidimensional overview in the form of expert paper on data mining tools emphasizes the quality of RapidMiner, R, Weka, and KNIME platforms, but also acknowledges the significant advancements made in the other tools.
Keywords
data mining; public domain software; DM; KNIME; Orange; R; RapidMiner; Weka; free software tools; general data mining; scikit-learn; Big data; Communities; Data mining; Data visualization; Graphical user interfaces; Machine learning algorithms; Vegetation;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2014 37th International Convention on
Conference_Location
Opatija
Print_ISBN
978-953-233-081-6
Type
conf
DOI
10.1109/MIPRO.2014.6859735
Filename
6859735
Link To Document