Title :
Discriminant Subspaces of Some High Dimensional Pattern Classification Problems
Author :
Li, Hongyu ; Niranjan, Mahesan
Author_Institution :
Univ. of Sheffield, Sheffield
Abstract :
In this paper, we report on an empirical study of several high dimensional classification problems and show that much of the discriminant information may lie in low dimensional subspaces. Feature subset selection is achieved either by forward selection or backward elimination from the full feature space with support vector machines (SVMs) as base classifiers. These "wrapper" methods are compared with a "filter" method of feature selection using information gain as discriminant criterion. Publicly available data sets in areas of text categorization, chemoinformatics, and gene expression analysis are used to illustrate the idea. We found that forward selection systematically outperforms backward elimination at low dimensions when applied to these problems. These observations are known anecdotally in the machine learning community, but here we provide empirical support on a wide range of problems in different domains.
Keywords :
learning (artificial intelligence); pattern classification; support vector machines; base classifiers; chemoinformatics; discriminant criterion; discriminant subspaces; feature subset selection; gene expression analysis; high dimensional pattern classification problems; machine learning; support vector machines; text categorization; Bioinformatics; Filtering; Filters; Genomics; Machine learning; Machine learning algorithms; Pattern classification; Support vector machine classification; Support vector machines; Text categorization;
Conference_Titel :
Machine Learning for Signal Processing, 2007 IEEE Workshop on
Conference_Location :
Thessaloniki
Print_ISBN :
978-1-4244-1566-3
Electronic_ISBN :
1551-2541
DOI :
10.1109/MLSP.2007.4414277