Title :
Semi-Supervised Fisher Linear Discriminant (SFLD)
Author :
Remus, Seda ; Tomasi, Carlo
Author_Institution :
Dept. of Comput. Sci., Clarkson Univ., Potsdam, NY, USA
Abstract :
Supervised learning uses a training set of labeled examples to compute a classifier which is a mapping from feature vectors to class labels. The success of a learning algorithm is evaluated by its ability to generalize, i.e., to extend this mapping accurately to new data that is commonly referred to as the test data. Good generalization depends crucially on the quality of the training set. Because collecting labeled data is laborious, training sets are typically small. Furthermore, it is often difficult to represent all possible observation scenarios during training, so that the statistics of the training set end up differing from those of the test data, a problem known as the sample selection bias. To address sample selection bias, we introduce a Semi-Supervised Fisher Linear Discriminant (SFLD) that utilizes additional, unlabeled data to improve generalization for both small and biased training sets. We characterize the conditions under which SFLD helps, and illustrate its benefits through experiments on digit and car recognition applications.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); pattern classification; statistical analysis; car recognition; class labels; classifier; digit recognition; feature vectors; generalization; mapping; semisupervised Fisher linear discriminant; supervised learning; training set; Character recognition; Classification algorithms; Computer science; Machine learning; Sampling methods; Statistical analysis; Supervised learning; Testing; Training data; Vectors; Classification; Fisher Linear Discriminant; Generalization; Sample Selection Bias;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495365