Title :
Complexity aspects of multi-machine aggregations in a rough-granular computation framework
Author :
Synak, Piotr ; Slezak, Dominik
Author_Institution :
Infobright Inc., Toronto, ON, Canada
Abstract :
We investigate computational complexity of a task of optimizing multi-machine execution of analytical data processing operations. We concentrate on an example of aggregation queries in Infobright´s database engine, which is designed using the paradigms of rough sets and granular computing. The task is to optimize decomposition of data blocks and aggregation groups among machines having access to a shared data storage. The paper includes some examples of optimization functions and constraints, as well as the proof of NP-hardness of the considered task for some of them. It can be treated as a guideline for developing scalable data processing and mining methods based on massive parallelism and approximate computing.
Keywords :
computational complexity; database management systems; granular computing; query processing; rough set theory; Infobright database engine; NP-hardness; analytical data processing operations; approximate computing; computational complexity aspects; data blocks; granular computing; mining methods; multimachine aggregations; multimachine execution; queries; rough sets; rough-granular computation framework; scalable data processing; shared data storage; Complexity theory; Databases; Engines; Merging; Minimization; Optimization; Schedules;
Conference_Titel :
Granular Computing (GrC), 2014 IEEE International Conference on
Conference_Location :
Noboribetsu
DOI :
10.1109/GRC.2014.6982849