Title :
Federated Computing for the Masses--Aggregating Resources to Tackle Large-Scale Engineering Problems
Author :
Diaz-Montes, Javier ; Yu Xie ; Rodero, Ivan ; Zola, Jaroslaw ; Ganapathysubramanian, Baskar ; Parashar, Manish
Abstract :
The complexity of many problems in science and engineering requires computational capacity exceeding what the average user can expect from a single computational center. While many of these problems can be viewed as a set of independent tasks, their collective complexity easily requires millions of core-hours on any high-power computing (HPC) resource, and throughput that can´t be sustained by a single, multiuser queuing system. An exploration of the use of aggregated HPC resources to solve large-scale engineering problems shows that it´s possible to build a computational federation that´s easy for end users to implement, and is elastic, resilient, and scalable. Here, the authors argue that the fusion of federated computing and real-life engineering problems can be brought to the average user if relevant middleware is provided. They report on the use of federation of 10 distributed heterogeneous HPC resources to perform a large-scale interrogation of the parameter space in the microscale fluid flow problem.
Keywords :
computational fluid dynamics; middleware; parallel processing; aggregated HPC resources; computational capacity; computational center; computational federation; distributed heterogeneous HPC resources; federated computing; high-power computing resource; large-scale engineering problems; large-scale parameter space interrogation; microscale fluid flow problem; middleware; throughput; Big data; Complexity theory; Computational modeling; Metasearch; Programming; Scientific computing; Throughput; cloud computing; federated computing; fluid flow; large-scale engineering problems; scientific computing; software-defined infrastructure;
Journal_Title :
Computing in Science & Engineering
DOI :
10.1109/MCSE.2013.134