Title :
A Fault-Tolerant Distributed Solution for the Parallel Gaussian Reduction of Huge Arrays
Author :
Mocanu, Adrian ; Tapus, Nicolae
Abstract :
Solving a linear system of equations is a common task performed for most of the scientific experiments. Everyday, these applications increase their demands, pushing the computational capacity to the limits. At a certain point, our systems, running common algorithms, become unable to process such massive amounts of information. The paper analyses and modifies the standard Gaussian reduction so that a huge array may be handled by almost any distributed system. Moreover, the paper focuses on providing a decent level of fault tolerance in order to comply with a real, heterogeneous distributed system and on becoming a solution for sensitive applications.
Keywords :
Gaussian processes; computational complexity; fault tolerant computing; parallel processing; computational capacity; fault-tolerant distributed solution; heterogeneous distributed system; linear equation system; parallel Gaussian reduction; standard Gaussian reduction; Computer architecture; Equations; Fault tolerance; Fault tolerant systems; Graphics processing units; Monitoring; Peer-to-peer computing; Gaussian reduction; distributed; fault-tolerant; huge arrays;
Conference_Titel :
Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2012 14th International Symposium on
Conference_Location :
Timisoara
Print_ISBN :
978-1-4673-5026-6
DOI :
10.1109/SYNASC.2012.52