• DocumentCode
    39608
  • Title

    Discriminative Fine-Grained Mixing for Adaptive Compression of Data Streams

  • Author

    Gedik, Bugra

  • Author_Institution
    Comput. Eng. Dept., Bilkent Univ., Ankara, Turkey
  • Volume
    63
  • Issue
    9
  • fYear
    2014
  • fDate
    Sept. 2014
  • Firstpage
    2228
  • Lastpage
    2244
  • Abstract
    This paper introduces an adaptive compression algorithm for transfer of data streams across operators in stream processing systems. The algorithm is adaptive in the sense that it can adjust the amount of compression applied based on the bandwidth, CPU, and workload availability. It is discriminative in the sense that it can judiciously apply partial compression by selecting a subset of attributes that can provide good reduction in the used bandwidth at a low cost. The algorithm relies on the significant differences that exist among stream attributes with respect to their relative sizes, compression ratios, compression costs, and their amenability to application of custom compressors. As part of this study, we present a modeling of uniform and discriminative mixing, and provide various greedy algorithms and associated metrics to locate an effective setting when model parameters are available at run-time. Furthermore, we provide online and adaptive algorithms for real-world systems in which system parameters that can be measured at run-time are limited. We present a detailed experimental study that illustrates the superiority of discriminative mixing over uniform mixing.
  • Keywords
    data compression; greedy algorithms; CPU; bandwidth; compression costs; compression ratios; data stream adaptive compression algorithm; data stream processing systems; discriminative fine-grained mixing; greedy algorithms; partial compression; relative sizes; stream attributes; uniform mixing; workload availability; Adaptive compression; stream compression;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2013.103
  • Filename
    6509891