DocumentCode :
390906
Title :
On a capacity control using Boolean kernels for the learning of Boolean functions
Author :
Sadohara, Ken
Author_Institution :
Nat. Inst. of Adv. Ind. Sci. & Technol., Ibaraki, Japan
fYear :
2002
fDate :
2002
Firstpage :
410
Lastpage :
417
Abstract :
This paper concerns the classification task in discrete attribute spaces, but considers the task in a more fundamental framework: the learning of Boolean functions. The purpose of this paper is to present a new learning algorithm for Boolean functions called Boolean kernel classifier (BKC) employing capacity control using Boolean kernels. BKC uses support vector machines (SVMs) as learning engines and Boolean kernels are primarily used for running SVMs in feature spaces spanned by conjunctions of Boolean literals. However, another important role of Boolean kernels is to appropriately control the size of its hypothesis space, to avoid overfitting. After applying a SVM to learn a classifier f in a feature space H induced by a Boolean kernel, BKC uses another Boolean kernel to compute the projections fk of f onto a subspace Hk of H spanned by conjunctions with length at most k. By evaluating the accuracy of fk on training data for any k, BKC can determine the smallest k such that fk is as accurate as f and learn another f´ in Hk expected to have lower error for unseen data. By an empirical study on learning of randomly generated Boolean functions, it is shown that the capacity control is effective, and BKC outperforms C4.5 and naive Bayes classifiers.
Keywords :
Boolean functions; data mining; learning (artificial intelligence); learning automata; pattern classification; Boolean function learning algorithm; Boolean kernel classifier; Boolean kernels; Boolean literals; capacity control; classification task; discrete attribute spaces; feature spaces; hypothesis space; projections; subspace; support vector machines; training data; Aerospace industry; Boolean functions; Centralized control; Engines; Industrial control; Kernel; Machine learning; Space technology; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
Type :
conf
DOI :
10.1109/ICDM.2002.1183934
Filename :
1183934
Link To Document :
بازگشت