DocumentCode
2708905
Title
A Non-parametric Semi-supervised Discretization Method
Author
Bondu, A. ; Boulle, M. ; Lemaire, V. ; Loiseau, S. ; Duval, B.
Author_Institution
Orange Labs., Lannion
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
53
Lastpage
62
Abstract
Semi-supervised classification methods aim to exploit labelled and unlabelled examples to train a predictive model. Most of these approaches make assumptions on the distribution of classes. This article first proposes a new semi-supervised discretization method which adopts very low informative prior on data. This method discretizes the numerical domain of a continuous input variable, while keeping the information relative to the prediction of classes. Then, an in-depth comparison of this semi-supervised method with the original supervised MODL approach is presented. We demonstrate that the semi-supervised approach is asymptotically equivalent to the supervised approach, improved with a post-optimization of the intervals bounds location.
Keywords
data mining; learning (artificial intelligence); pattern classification; data mining; minimal optimized description length; nonparametric semisupervised discretization; predictive model; semisupervised classification; semisupervised learning; supervised MODL; Bayesian methods; Bonding; Classification algorithms; Data mining; Input variables; Iterative algorithms; Maximum likelihood estimation; Optimization methods; Predictive models; Semisupervised learning; Discretization; Non-parametric; Semi-supervised;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location
Pisa
ISSN
1550-4786
Print_ISBN
978-0-7695-3502-9
Type
conf
DOI
10.1109/ICDM.2008.35
Filename
4781100
Link To Document