DocumentCode
2026479
Title
Common words between two random strings
Author
Jacquet, P.
Author_Institution
INRIA, Rocquencourt
fYear
2007
fDate
24-29 June 2007
Firstpage
1481
Lastpage
1485
Abstract
We investigate the problem of enumerating the words that are common to two random strings. We show that when the source models are memoryless that the number of common words is sublinear in the length of the sequence, and linear when the source models are exactly the same. We draw the same conclusions for the number of common nodes in the associated respective suffix trees.
Keywords
memoryless systems; random codes; source coding; string matching; trees (mathematics); common words; memoryless source model; random strings; suffix trees; Autocorrelation; Cats; Convergence; Dogs; Genetics; Information analysis; Information theory; Random number generation; Random sequences; Statistical analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Theory, 2007. ISIT 2007. IEEE International Symposium on
Conference_Location
Nice
Print_ISBN
978-1-4244-1397-3
Type
conf
DOI
10.1109/ISIT.2007.4557431
Filename
4557431
Link To Document