• DocumentCode
    3217746
  • Title

    Dimension reduction in the ℓ1 norm

  • Author

    Charikar, Moses ; Sahai, Amit

  • Author_Institution
    Princeton Univ., NJ, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    551
  • Lastpage
    560
  • Abstract
    The Johnson-Lindenstrauss lemma shows that any set of n points in Euclidean space can be mapped linearly down to O((log n)/ε2) dimensions such that all pairwise distances are distorted by at most 1+ε. We study the basic question of whether there exists an analogue of the Johnson-Lindenstrauss lemma for the ℓ1 norm? Note that Johnson-Lindenstrauss lemma gives a linear embedding which is independent of the point set. For the ℓ1 norm, we show that one cannot hope to use linear embeddings as a dimensionality reduction tool for general point sets, even if the linear embedding is chosen as a function of the given point set. In particular, we construct a set of O(n) points in ℓ1n such that any linear embedding into ℓ1d must incur a distortion of Ω√(n/d). This bound is tight up to a log n factor. We then initiate a systematic study of general classes of ℓ1 embeddable metrics that admit low dimensional, small distortion embeddings. In particular, we show dimensionality reduction theorems for tree metrics, circular-decomposable metrics, and metrics supported on K2,3-free graphs, giving embeddings into ℓ1O(log(2) n) with constant distortion. Finally, we also present lower bounds on dimension reduction techniques for other ℓp norms. Our work suggests that the notion of a stretch-limited embedding, where no distance is stretched by more than a factor d in any dimension, is important to the study of dimension reduction for ℓ1. We use such stretch limited embeddings as a tool for proving lower bounds for dimension reduction and also as an algorithmic tool for proving positive results.
  • Keywords
    computational complexity; trees (mathematics); ℓ 1 embeddable metrics; ℓ 1 norm; Euclidean space; Johnson-Lindenstrauss lemma; K2,3 free graphs; algorithmic tool; circular-decomposable metrics; dimension reduction; linear embedding; low dimensional small distortion embeddings; lower bounds; pairwise distances; point sets; stretch-limited embedding; tree metrics; Books; Embedded computing; Extraterrestrial measurements; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science, 2002. Proceedings. The 43rd Annual IEEE Symposium on
  • ISSN
    0272-5428
  • Print_ISBN
    0-7695-1822-2
  • Type

    conf

  • DOI
    10.1109/SFCS.2002.1181979
  • Filename
    1181979