DocumentCode
3269192
Title
A note on the estimation of string complexity for short strings
Author
Speidel, Ulrich
Author_Institution
Dept. of Comput. Sci., Univ. of Auckland, Auckland, New Zealand
fYear
2009
fDate
8-10 Dec. 2009
Firstpage
1
Lastpage
5
Abstract
While Kolmogorov´s well-known results show that absolute string complexity is not computable, its estimation is nevertheless of importance in fields such as randomness testing, data compression evaluation, similarity measurement and event detection. Several complexity estimators have been developed over the years, with the Lempel-Ziv parsers being the most prominent. The estimators´ asymptotic behaviour for long strings is generally compatible, but they differ considerably in the domain of short strings. Short strings are however exactly the kind of data encountered in many of the practical application areas. This paper proposes a method for evaluating the comparative performance of such estimators and presents experimental results for Lempel-Ziv parsers and a more recent estimator, the T-complexity.
Keywords
computational complexity; grammars; Lempel Ziv parsers; T complexity; data compression evaluation; event detection; randomness testing; short strings; similarity measurement; string complexity estimation; Compression algorithms; Computer science; Data compression; Entropy; Event detection; Production; Proposals; Testing; Upper bound; Vocabulary; LZ77; LZ78; Lempel-Ziv production complexity; T-complexity;
fLanguage
English
Publisher
ieee
Conference_Titel
Information, Communications and Signal Processing, 2009. ICICS 2009. 7th International Conference on
Conference_Location
Macau
Print_ISBN
978-1-4244-4656-8
Electronic_ISBN
978-1-4244-4657-5
Type
conf
DOI
10.1109/ICICS.2009.5397536
Filename
5397536
Link To Document