Title :
Report on Preliminary Experiments with Data Grid Models in the Agnostic Learning vs. Prior Knowledge Challenge
Author_Institution :
France Telecom R&D, Lannion
Abstract :
This paper introduces a new method1 to automatically, rapidly and reliably evaluate the class conditional information of any subset of variables in supervised learning. It is based on a partitioning of each input variable, in intervals in the numerical case and in groups of values in the categorical case. The cross-product of the univariate partitions forms a multivariate partition of the input representation space into a set of cells. This multivariate partition, called data grid, allows to evaluate the correlation between the input variables and the output variable. The best data grid is searched owing to a Bayesian model selection approach and to combinatorial algorithms. Three classification techniques exploiting data grids differently are presented and evaluated in the Agnostic Learning vs. Prior Knowledge Challenge. These preliminary experiments demonstrate the interest of using data grid in machine learning tasks.
Keywords :
Bayes methods; combinatorial mathematics; data structures; learning (artificial intelligence); Bayesian model selection approach; agnostic learning; classification techniques; combinatorial algorithms; data grid models; machine learning; multivariate partition; prior knowledge challenge; supervised learning; Bayesian methods; Frequency; Input variables; Machine learning; Machine learning algorithms; Neural networks; Partitioning algorithms; Space exploration; Supervised learning; Unsupervised learning;
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2007.4371454