DocumentCode :
3027355
Title :
Some theory and practice of greedy off-line textual substitution
Author :
Apostolico, Alberto ; Lonardi, Stefano
Author_Institution :
Purdue Univ., West Lafayette, IN, USA
fYear :
1998
fDate :
30 Mar-1 Apr 1998
Firstpage :
119
Lastpage :
128
Abstract :
Greedy off-line textual substitution refers to the following steepest descent approach to compression or structural inference. Given a long text string x, a substring w is identified such that replacing all instances of w in x except one by a suitable pair of pointers yields the highest possible contraction of x; the process is then repeated on the contracted text string, until substrings capable of producing contractions can no longer be found. This paper examines the computational issues and performance resulting from implementations of this paradigm in preliminary applications and experiments. Apart from intrinsic interest, these methods may find use in the compression of massively disseminated data, and lend themselves to efficient parallel implementation, perhaps on dedicated architectures
Keywords :
data compression; encoding; parallel architectures; tree data structures; word processing; compression; computational issues; contracted text string; data structures; dedicated architectures; encoding; experiments; greedy off-line textual substitution; massively disseminated data compression; parallel implementation; performance; pointers; steepest descent approach; structural inference; substring; suffix tree; text string; Costs; Labeling; Physics computing; Statistics; USA Councils;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 1998. DCC '98. Proceedings
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
0-8186-8406-2
Type :
conf
DOI :
10.1109/DCC.1998.672138
Filename :
672138
Link To Document :
بازگشت