Title :
Comparing approaches to prepare data in classification problems
Author :
Gonçalves, Paulo M., Jr. ; Barros, Roberto S M
Author_Institution :
Centro de Informdtica, Univ. Fed. de Pernambuco, Recife, Brazil
Abstract :
This paper presents a comparison between DMPML and three data mining applications (Weka, RapidMiner, and KN-IME) that implement the directed graph approach, concerning the time spent to create and execute the data preparation tasks for two data mining algorithms. The tests were executed using different types of data sets: numerical, categorical, and mixed. We observed that the scheme used by the DMPML framework can simplify the usage of different data mining algorithms and reduce the time spent creating the data preparation tasks.
Keywords :
data mining; data preparation; directed graphs; numerical analysis; pattern classification; DMPML; KN-IME; RapidMiner; Weka; categorical test; classification problem; data mining algorithm; data preparation task execution; data set; directed graph approach; mixed test; numerical test; Artificial neural networks; Computers; Conferences; Data mining; Machine learning; XML;
Conference_Titel :
Computer Systems and Applications (AICCSA), 2011 9th IEEE/ACS International Conference on
Conference_Location :
Sharm El-Sheikh
Print_ISBN :
978-1-4577-0475-8
Electronic_ISBN :
2161-5322
DOI :
10.1109/AICCSA.2011.6126613