DocumentCode :
2535882
Title :
Optimized Representation for Classifying Qualitative Data
Author :
Cadot, Martine ; Lelu, Alain
Author_Institution :
Dept. Inf., Univ. de Nancy, Nancy, France
fYear :
2010
fDate :
11-16 April 2010
Firstpage :
241
Lastpage :
246
Abstract :
Extracting knowledge out of qualitative data is an ever-growing issue in our networking world. Opposite to the widespread trend consisting of extending general classification methods to zero/one-valued qualitative variables, we explore here another path: we first build a specific representation for these data, respectful of the non-occurrence as well as presence of an item, and making the interactions between variables explicit. Combinatorics considerations in our Midova expansion method limit the proliferation of itemsets when building level k+1 on level k, and limit the maximal level K. We validate our approach on three of the public access datasets of University of California, Irvine, repository: our generalization accuracy is equal or better than the best reported one, to our knowledge, on Breast Cancer and TicTacToe datasets, honorable on Monks-2 near-parity problem.
Keywords :
data structures; knowledge acquisition; pattern classification; Midova expansion method; Monks-2 near-parity problem; University of California; knowledge extraction; optimized representation; public access datasets; qualitative data classification; Combinatorial mathematics; Data mining; Databases; Displays; Itemsets; Kernel; Machine learning; Military computing; Support vector machine classification; Support vector machines; classification; feature construction; feature selection; machine learning; non-linear discrimination.;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Databases Knowledge and Data Applications (DBKDA), 2010 Second International Conference on
Conference_Location :
Menuires
Print_ISBN :
978-1-4244-6081-6
Type :
conf
DOI :
10.1109/DBKDA.2010.26
Filename :
5477120
Link To Document :
بازگشت