• DocumentCode
    939233
  • Title

    On FastMap and the convex hull of multivariate data: toward fast and robust dimension reduction

  • Author

    Ostrouchov, George ; Samatova, Nagiza F.

  • Author_Institution
    Div. of Comput. Sci. & Math., Oak Ridge Nat. Lab., TN, USA
  • Volume
    27
  • Issue
    8
  • fYear
    2005
  • Firstpage
    1340
  • Lastpage
    1343
  • Abstract
    FastMap is a dimension reduction technique that operates on distances between objects. Although only distances are used, implicitly the technique assumes that the objects are points in a p-dimensional Euclidean space. It selects a sequence of k ≤ p orthogonal axes defined by distant pairs of points (called pivots) and computes the projection of the points onto the orthogonal axes. We show that FastMap uses only the outer envelope of a data set. Pivots are taken from the faces, usually vertices, of the convex hull of the data points in the original implicit Euclidean space. This provides a bridge to results in robust statistics, where the convex hull is used as a tool in multivariate outlier detection and in robust estimation methods. The connection sheds new light on the properties of FastMap, particularly its sensitivity to outliers, and provides an opportunity for a new class of dimension reduction algorithms, RobustMaps, that retain the speed of FastMap and exploit ideas in robust statistics.
  • Keywords
    computational geometry; principal component analysis; Euclidean space; FastMap; convex hull; dimension reduction algorithms; dimension reduction technique; multivariate data; multivariate outlier detection; principal component analysis; Bridges; Data visualization; Euclidean distance; Extraterrestrial measurements; Face detection; Multidimensional systems; Principal component analysis; Robustness; Statistical distributions; Statistics; Euclidean distance.; FastMap; Index Terms- Dimension reduction; RobustMap; convex hull; multidimensional scaling; principal components; robust statistics; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Models, Statistical; Multivariate Analysis; Pattern Recognition, Automated; Signal Processing, Computer-Assisted; Subtraction Technique;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2005.164
  • Filename
    1453521