Title :
Automata and graph compression
Author :
Mehryar Mohri;Michael Riley;Ananda Theertha Suresh
Author_Institution :
Courant Institute and Google Research, USA
fDate :
6/1/2015 12:00:00 AM
Abstract :
We present a theoretical framework for the compression of automata, which are widely used representations in speech processing, natural language processing and many other tasks. As a corollary, our framework further covers graph compression. We introduce a probabilistic process of graph and automata generation that is similar to stationary ergodic processes and that covers real-world phenomena. We also introduce a universal compression scheme LZA for this probabilistic model and show that LZA significantly outperforms other compression techniques such as gzip and the UNIX compress command for several synthetic and real data sets.
Keywords :
"Mobile communication","Algorithm design and analysis","Encoding"
Conference_Titel :
Information Theory (ISIT), 2015 IEEE International Symposium on
Electronic_ISBN :
2157-8117
DOI :
10.1109/ISIT.2015.7283005