• DocumentCode
    2966833
  • Title

    Data Compression on Multisets. Submultiset-Free Codes

  • Author

    Bonchi, Cosmin ; Izba, Cornel ; Ghergu, Gratiela ; Ciobanu, Gabriel

  • Author_Institution
    Res. Inst. e-Austria, Timisoara, Romania
  • fYear
    2008
  • fDate
    26-29 Sept. 2008
  • Firstpage
    152
  • Lastpage
    157
  • Abstract
    Regular data compression converts a sequence of symbols into a sequence of strings (descriptions), assigning the shortest descriptions to the most frequent symbols produced by the data source. By contrast, here we consider the compression of a sequence of symbols to a sequence of multisets. This is relevant if the communication channel does not preserve order (positioning) of the transmitted letters, as is the case in many biological contexts; for example, if you regard the passage of a multiset of molecules through a blood vessel as an information transfer. We introduce the concept of submultiset-free codes, a counterpart to the existing concept of prefix-free codes. We derive a theorem about the structure of optimal submultiset codes for uniform random variables and we describe an algorithm for constructing them.
  • Keywords
    codes; data compression; telecommunication channels; biological context; communication channel; data compression; multiset channel; prefix-free code; submultiset-free code; uniform random variable; Biological information theory; Blood vessels; Cells (biology); Communication channels; Computer science; Context; Data compression; Decoding; Random variables; Scientific computing; multiset; prefix-free codes; submultiset-free codes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Symbolic and Numeric Algorithms for Scientific Computing, 2008. SYNASC '08. 10th International Symposium on
  • Conference_Location
    Timisoara
  • Print_ISBN
    978-0-7695-3523-4
  • Type

    conf

  • DOI
    10.1109/SYNASC.2008.89
  • Filename
    5204803