• DocumentCode
    1386914
  • Title

    Information distance

  • Author

    Bennett, Charles H. ; Gács, Péter ; Li, Ming ; Vitanyi, P.M.B. ; Zurek, Wojciech H.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    44
  • Issue
    4
  • fYear
    1998
  • fDate
    7/1/1998 12:00:00 AM
  • Firstpage
    1407
  • Lastpage
    1423
  • Abstract
    While Kolmogorov (1965) complexity is the accepted absolute measure of information content in an individual finite object, a similarly absolute notion is needed for the information distance between two individual objects, for example, two pictures. We give several natural definitions of a universal information metric, based on length of shortest programs for either ordinary computations or reversible (dissipationless) computations. It turns out that these definitions are equivalent up to an additive logarithmic term. We show that the information distance is a universal cognitive similarity distance. We investigate the maximal correlation of the shortest programs involved, the maximal uncorrelation of programs (a generalization of the Slepian-Wolf theorem of classical information theory), and the density properties of the discrete metric spaces induced by the information distances. A related distance measures the amount of nonreversibility of a computation. Using the physical theory of reversible computation, we give an appropriate (universal, antisymmetric, and transitive) measure of the thermodynamic work required to transform one object in another object by the most efficient process. Information distance between individual objects is needed in pattern recognition where one wants to express effective notions of "pattern similarity" or "cognitive similarity" between individual objects and in thermodynamics of computation where one wants to analyze the energy dissipation of a computation from a particular input to a particular output.
  • Keywords
    information theory; pattern recognition; thermodynamics; Kolmogorov complexity; Slepian-Wolf theorem; antisymmetric measure; cognitive similarity; density properties; discrete metric spaces; dissipationless computations; energy dissipation; finite object; information content; information distance; information theory; maximal correlation; nonreversibility; pattern recognition; pattern similarity; reversible computations; shortest program length; thermodynamic work; transitive measure; universal cognitive similarity distance; universal information metric; Additives; Computer science; Discrete transforms; Entropy; Information theory; Noise measurement; Pattern recognition; Physics computing; Thermodynamics; Turing machines;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/18.681318
  • Filename
    681318