• DocumentCode
    185567
  • Title

    An overview of free software tools for general data mining

  • Author

    Jovic, A. ; Brkic, K. ; Bogunovic, N.

  • Author_Institution
    Dept. of Electron., Microelectron., Comput. & Intell. Syst., Univ. of Zagreb, Zagreb, Croatia
  • fYear
    2014
  • fDate
    26-30 May 2014
  • Firstpage
    1112
  • Lastpage
    1117
  • Abstract
    This expert paper describes the characteristics of six most used free software tools for general data mining that are available today: RapidMiner, R, Weka, KNIME, Orange, and scikit-learn. The goal is to provide the interested researcher with all the important pros and cons regarding the use of a particular tool. A comparison of the implemented algorithms covering all areas of data mining (classification, regression, clustering, associative rules, feature selection, evaluation criteria, visualization, etc.) is provided. In addition, the tools´ support for the more advanced and specialized research topics (big data, data streams, text mining, etc.) is outlined, where applicable. The tools are also compared with respect to the community support, based on the available sources. This multidimensional overview in the form of expert paper on data mining tools emphasizes the quality of RapidMiner, R, Weka, and KNIME platforms, but also acknowledges the significant advancements made in the other tools.
  • Keywords
    data mining; public domain software; DM; KNIME; Orange; R; RapidMiner; Weka; free software tools; general data mining; scikit-learn; Big data; Communities; Data mining; Data visualization; Graphical user interfaces; Machine learning algorithms; Vegetation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2014 37th International Convention on
  • Conference_Location
    Opatija
  • Print_ISBN
    978-953-233-081-6
  • Type

    conf

  • DOI
    10.1109/MIPRO.2014.6859735
  • Filename
    6859735