• DocumentCode
    186017
  • Title

    Complexity aspects of multi-machine aggregations in a rough-granular computation framework

  • Author

    Synak, Piotr ; Slezak, Dominik

  • Author_Institution
    Infobright Inc., Toronto, ON, Canada
  • fYear
    2014
  • fDate
    22-24 Oct. 2014
  • Firstpage
    275
  • Lastpage
    280
  • Abstract
    We investigate computational complexity of a task of optimizing multi-machine execution of analytical data processing operations. We concentrate on an example of aggregation queries in Infobright´s database engine, which is designed using the paradigms of rough sets and granular computing. The task is to optimize decomposition of data blocks and aggregation groups among machines having access to a shared data storage. The paper includes some examples of optimization functions and constraints, as well as the proof of NP-hardness of the considered task for some of them. It can be treated as a guideline for developing scalable data processing and mining methods based on massive parallelism and approximate computing.
  • Keywords
    computational complexity; database management systems; granular computing; query processing; rough set theory; Infobright database engine; NP-hardness; analytical data processing operations; approximate computing; computational complexity aspects; data blocks; granular computing; mining methods; multimachine aggregations; multimachine execution; queries; rough sets; rough-granular computation framework; scalable data processing; shared data storage; Complexity theory; Databases; Engines; Merging; Minimization; Optimization; Schedules;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing (GrC), 2014 IEEE International Conference on
  • Conference_Location
    Noboribetsu
  • Type

    conf

  • DOI
    10.1109/GRC.2014.6982849
  • Filename
    6982849