Title :
Robust linear dimensionality reduction
Author :
Koren, Yehuda ; Carmel, Liran
Author_Institution :
AT&T Labs, Florham Park, NJ, USA
Abstract :
We present a novel family of data-driven linear transformations, aimed at finding low-dimensional embeddings of multivariate data, in a way that optimally preserves the structure of the data. The well-studied PCA and Fisher´s LDA are shown to be special members in this family of transformations, and we demonstrate how to generalize these two methods such as to enhance their performance. Furthermore, our technique is the only one, to the best of our knowledge, that reflects in the resulting embedding both the data coordinates and pairwise relationships between the data elements. Even more so, when information on the clustering (labeling) decomposition of the data is known, this information can also be integrated in the linear transformation, resulting in embeddings that clearly show the separation between the clusters, as well as their internal structure. All of this makes our technique very flexible and powerful, and lets us cope with kinds of data that other techniques fail to describe properly.
Keywords :
data reduction; data visualisation; feature extraction; image classification; principal component analysis; Fisher LDA; PCA; data visualization; data-driven linear transformations; feature extraction; linear discriminant analysis; multivariate data; principal component analysis; robust linear dimensionality reduction; Assembly; Data visualization; Feature extraction; Humans; Labeling; Linear discriminant analysis; Multidimensional systems; Principal component analysis; Robustness; Vectors; Dimensionality reduction; Fisher´s linear discriminant analysis.; classification; feature extraction; linear transformation; principal component analysis; projection; visualization; Algorithms; Computer Simulation; Discriminant Analysis; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Linear Models; Pattern Recognition, Automated; Principal Component Analysis; Reproducibility of Results; Sensitivity and Specificity;
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
DOI :
10.1109/TVCG.2004.17