• DocumentCode
    338374
  • Title

    Summary grids: building accurate multidimensional histograms

  • Author

    Furtado, Pedro ; Madeira, H.

  • Author_Institution
    Dept. of Eng. Inf., Coimbra Univ., Portugal
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    187
  • Lastpage
    194
  • Abstract
    Data summarization is very important for many data analysis tasks. In this paper we propose a simple but efficient data summarization algorithm, which outputs a histogram for multidimensional data, and make a comparative study of its usage with different distributions and with existing algorithms. The idea is to iteratively grow and modify regions of homogeneous data. This is a different strategy from the commonly used strategy of iteratively fracturing subspaces using straight lines. This work compares both strategies and concludes that the new technique is better and helds good results. We also concluded that discriminate handling of outliers is important to provide good approximates
  • Keywords
    data analysis; data reduction; accurate multidimensional histogram building; approximates; data analysis; data summarization algorithm; discriminate outlier handling; homogeneous data; iterative subspace fracturing; straight lines; summary grids; Clustering algorithms; Data analysis; Data mining; Histograms; Iterative algorithms; Multidimensional systems; Partitioning algorithms; Runtime; Sampling methods; US Department of Transportation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Systems for Advanced Applications, 1999. Proceedings., 6th International Conference on
  • Conference_Location
    Hsinchu
  • Print_ISBN
    0-7695-0084-6
  • Type

    conf

  • DOI
    10.1109/DASFAA.1999.765751
  • Filename
    765751