• DocumentCode
    1689582
  • Title

    A quad-tree based multiresolution approach for two-dimensional summary data

  • Author

    Buccafurri, Francesco ; Furfaro, F. ; Sacca, D. ; Sirangelo, C.

  • Author_Institution
    Dept. of DIMET, Reggio Calabria Univ., Italy
  • fYear
    2003
  • Firstpage
    127
  • Lastpage
    137
  • Abstract
    In many application contexts, like statistical databases, scientific databases, query optimizers, OLAP, and so on, data are often summarized into synopses of aggregate values. Summarization has the great advantage of saving space, but querying aggregate data rather than the original ones introduces estimation errors which cannot be in general avoided, as summarization is a lossy compression. A central problem in designing summarization techniques is to retain a certain degree of accuracy in reconstructing query answers. In this paper we restrict our attention to two-dimensional data, which are relevant for a number of applications, and propose a hierarchical summarization technique, which is combined with the use of indices, i.e. compact structures providing an approximate description of portions of the original data. Experimental results show that the technique gives approximation errors much smaller than other "general purpose" techniques, such as wavelets and various types of multi-dimensional histogram.
  • Keywords
    approximation theory; data compression; data mining; quadtrees; compact structure; data compression; data description; data index; data summarization; estimation error; greedy algorithm; hierarchical summarization; lossy compression; quad-tree; querying aggregate data; summary data; tree based multiresolution; two-dimensional data; Approximation error; Data mining; Estimation error; Frequency estimation; Internet; Intrusion detection; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2003. 15th International Conference on
  • Conference_Location
    Cambridge, MA
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-1964-4
  • Type

    conf

  • DOI
    10.1109/SSDM.2003.1214974
  • Filename
    1214974