Title :
Employing discrete Bayes error rate for discretization and feature selection tasks
Author :
Mittal, Ankush ; Cheong, Loong-Fah
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore
Abstract :
The tasks of discretization and feature selection are frequently used to improve classification accuracy. We use discrete approximation of Bayes error rate to perform discretization on the features. The discretization procedure targets minimization of Bayes error rate within each partition. A class-pair discriminatory measure can be defined on discretized partitions which forms the basis of the feature selection algorithm. A small value of this measure for a class-pair indicates that the class-pair in consideration is confusing and the features which distinguish them well should be chosen first. A video classification problem on a large database is considered for showing the comparison of a classifier using our discretization and feature selection tasks with SVM, neural network classifier, decision trees and K-nearest neighbor classifier.
Keywords :
Bayes methods; data mining; decision trees; image classification; learning automata; neural nets; very large databases; video databases; K-nearest neighbor classifier; SVM; class-pair discriminatory measure; classification; data mining; decision trees; discrete Bayes error rate; discrete approximation; discretization; feature selection tasks; large database; minimization; neural network classifier; video classification problem; Computer errors; Computer science; Error analysis; Inference algorithms; Minimization methods; Partitioning algorithms; Probability distribution; Spatial databases; Support vector machine classification; Support vector machines;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1183916