DocumentCode :
1720813
Title :
Detecting matrix multiplication faults in many-core systems
Author :
Sibai, Fadi N.
Author_Institution :
Fac. of Inf. Technol., UAE Univ., Al Ain, United Arab Emirates
fYear :
2011
Firstpage :
330
Lastpage :
335
Abstract :
Many-core systems are characterized by a large number of components based on ever-shrinking circuit geometries. System reliability becomes an issue because of the system complexity, the large number of components and nanoscale issues due to soft errors. While information redundancy techniques can be used for fault tolerance, they occupy too much memory space and increase the memory and network bandwidth. Moreover, in many-cores, resources are plentiful encouraging the design of simple cores without hardware fault tolerance. Thus in the absence of information redundancy, software fault detection techniques become necessary to detect errors. Herein, we present fault detection techniques for 2×2 matrix multiplication which we extend to nxn matrix multiplication. These tests can detect transient and some intermittent and permanent hardware faults. These tests are also suitable to computing grids and distributed heterogeneous systems where the result-forming node may run tests in software to validate the sub-results submitted by the grid nodes.
Keywords :
computational complexity; geometry; mathematics computing; matrix multiplication; multiprocessing systems; software fault tolerance; software reliability; circuit geometries; fault tolerance; information redundancy techniques; many core systems; matrix multiplication faults; software fault detection; system complexity; system reliability; Circuit faults; Fault tolerant systems; Hardware; Program processors; Redundancy; fault detection; many-core systems; parallel or distributed matrix multiplication;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovations in Information Technology (IIT), 2011 International Conference on
Conference_Location :
Abu Dhabi
Print_ISBN :
978-1-4577-0311-9
Type :
conf
DOI :
10.1109/INNOVATIONS.2011.5893843
Filename :
5893843
Link To Document :
بازگشت