DocumentCode
2551634
Title
Discriminant Subspaces of Some High Dimensional Pattern Classification Problems
Author
Li, Hongyu ; Niranjan, Mahesan
Author_Institution
Univ. of Sheffield, Sheffield
fYear
2007
fDate
27-29 Aug. 2007
Firstpage
27
Lastpage
32
Abstract
In this paper, we report on an empirical study of several high dimensional classification problems and show that much of the discriminant information may lie in low dimensional subspaces. Feature subset selection is achieved either by forward selection or backward elimination from the full feature space with support vector machines (SVMs) as base classifiers. These "wrapper" methods are compared with a "filter" method of feature selection using information gain as discriminant criterion. Publicly available data sets in areas of text categorization, chemoinformatics, and gene expression analysis are used to illustrate the idea. We found that forward selection systematically outperforms backward elimination at low dimensions when applied to these problems. These observations are known anecdotally in the machine learning community, but here we provide empirical support on a wide range of problems in different domains.
Keywords
learning (artificial intelligence); pattern classification; support vector machines; base classifiers; chemoinformatics; discriminant criterion; discriminant subspaces; feature subset selection; gene expression analysis; high dimensional pattern classification problems; machine learning; support vector machines; text categorization; Bioinformatics; Filtering; Filters; Genomics; Machine learning; Machine learning algorithms; Pattern classification; Support vector machine classification; Support vector machines; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning for Signal Processing, 2007 IEEE Workshop on
Conference_Location
Thessaloniki
ISSN
1551-2541
Print_ISBN
978-1-4244-1566-3
Electronic_ISBN
1551-2541
Type
conf
DOI
10.1109/MLSP.2007.4414277
Filename
4414277
Link To Document