Title :
Multivariate supervised discretization, a neighborhood graph approach
Author :
Muhlenbach, Fabrice ; Rakotomalala, Ricco
Author_Institution :
ERIC Lab., Lumiere Univ. - Lyon II, Bron, France
Abstract :
We present a new discretization method in the context of supervised learning. This method entitled HyperCluster Finder is characterized by its supervised and polythetic behavior. The method is based on the notion of clusters and processes in two steps. First, a neighborhood graph construction from the learning database allows discovering homogenous clusters. Second, the minimal and maximal values of each cluster are transferred to each dimension in order to define some boundaries to cut the continuous attribute in a set of intervals. The discretization abilities of this method are illustrated by some examples, in particular processing the XOR problem.
Keywords :
data mining; learning (artificial intelligence); HyperCluster Finder; XOR problem; data mining; learning database; multivariate supervised discretization; neighborhood graph construction; polythetic behavior; supervised behavior; supervised learning; Artificial intelligence; Clustering algorithms; Data mining; Gaussian distribution; Laboratories; Learning systems; Machine learning; Machine learning algorithms; Performance loss; Supervised learning;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1183918