DocumentCode :
2966833
Title :
Data Compression on Multisets. Submultiset-Free Codes
Author :
Bonchi, Cosmin ; Izba, Cornel ; Ghergu, Gratiela ; Ciobanu, Gabriel
Author_Institution :
Res. Inst. e-Austria, Timisoara, Romania
fYear :
2008
fDate :
26-29 Sept. 2008
Firstpage :
152
Lastpage :
157
Abstract :
Regular data compression converts a sequence of symbols into a sequence of strings (descriptions), assigning the shortest descriptions to the most frequent symbols produced by the data source. By contrast, here we consider the compression of a sequence of symbols to a sequence of multisets. This is relevant if the communication channel does not preserve order (positioning) of the transmitted letters, as is the case in many biological contexts; for example, if you regard the passage of a multiset of molecules through a blood vessel as an information transfer. We introduce the concept of submultiset-free codes, a counterpart to the existing concept of prefix-free codes. We derive a theorem about the structure of optimal submultiset codes for uniform random variables and we describe an algorithm for constructing them.
Keywords :
codes; data compression; telecommunication channels; biological context; communication channel; data compression; multiset channel; prefix-free code; submultiset-free code; uniform random variable; Biological information theory; Blood vessels; Cells (biology); Communication channels; Computer science; Context; Data compression; Decoding; Random variables; Scientific computing; multiset; prefix-free codes; submultiset-free codes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Symbolic and Numeric Algorithms for Scientific Computing, 2008. SYNASC '08. 10th International Symposium on
Conference_Location :
Timisoara
Print_ISBN :
978-0-7695-3523-4
Type :
conf
DOI :
10.1109/SYNASC.2008.89
Filename :
5204803
Link To Document :
بازگشت