Title :
Partition Incremental Discretization
Author :
Pinto, Carlos ; Gama, João
Author_Institution :
Algarve Univ., Porto
Abstract :
In this paper we propose a new method to perform incremental discretization. This approach consists in splitting the task in two layers. The first layer receives the sequence of input data and stores statistics of this data, using a higher number of intervals than what is usually required. The final discretization is generated by the second layer, based on the statistics stored by the previous layer. The proposed architecture processes streaming examples in a single scan, in constant time and space even for infinite sequences of examples. We demonstrate with examples that incremental discretization achieves better results than batch discretization, maintaining the performance of learning algorithms. The proposed method is much more appropriate to evaluate incremental algorithms, and in problems where data flows continuously as most of recent data mining applications
Keywords :
learning (artificial intelligence); artificial intelligence; data mining; learning algorithms; machine learning; partition incremental discretization; Acceleration; Bayesian methods; Classification tree analysis; Data analysis; Data mining; Learning systems; Machine learning; Machine learning algorithms; Statistics; Training data; Artificial Intelligence; Incremental Discretization; Machine Learning; Pre-Processing;
Conference_Titel :
Artificial intelligence, 2005. epia 2005. portuguese conference on
Conference_Location :
Covilha
Print_ISBN :
0-7803-9366-X
Electronic_ISBN :
0-7803-9366-X
DOI :
10.1109/EPIA.2005.341288