• DocumentCode
    3663535
  • Title

    Automata and graph compression

  • Author

    Mehryar Mohri;Michael Riley;Ananda Theertha Suresh

  • Author_Institution
    Courant Institute and Google Research, USA
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    2989
  • Lastpage
    2993
  • Abstract
    We present a theoretical framework for the compression of automata, which are widely used representations in speech processing, natural language processing and many other tasks. As a corollary, our framework further covers graph compression. We introduce a probabilistic process of graph and automata generation that is similar to stationary ergodic processes and that covers real-world phenomena. We also introduce a universal compression scheme LZA for this probabilistic model and show that LZA significantly outperforms other compression techniques such as gzip and the UNIX compress command for several synthetic and real data sets.
  • Keywords
    "Mobile communication","Algorithm design and analysis","Encoding"
  • Publisher
    ieee
  • Conference_Titel
    Information Theory (ISIT), 2015 IEEE International Symposium on
  • Electronic_ISBN
    2157-8117
  • Type

    conf

  • DOI
    10.1109/ISIT.2015.7283005
  • Filename
    7283005