DocumentCode
1237083
Title
Bisectional fault-tolerant communication architecture for supercomputer systems
Author
Ghafoor, Arif ; Bashkow, Theodore R. ; Ghafoor, Imran
Author_Institution
Dept. of Electr. & Comput. Eng., Syracuse Univ., NY, USA
Volume
38
Issue
10
fYear
1989
fDate
10/1/1989 12:00:00 AM
Firstpage
1425
Lastpage
1446
Abstract
A highly versatile communication architecture, the bisectional interconnection network, is proposed. These networks possess many attractive features such as small internode distances, ability to do self-routing which is easily extendible to failure conditions, and the capability of maximal fault tolerance. The proposed architecture allows optimal implementation of various logical configurations. Furthermore, the authors propose the use of a combinatorial structure, called the symmetric balanced incomplete block design (SBIBD), to partition these networks. This important property of partitioning allows the system´s expansion with fault tolerance and is utilized to describe two semidistributed fault-diagnostic strategies which require remarkably low overhead and at the same time identify a large number of faulty nodes. Furthermore, based on SBIBDs, a unique approach for making the diagnostic scheme itself fault tolerant is proposed
Keywords
fault tolerant computing; multiprocessor interconnection networks; SBIBD; bisectional interconnection network; combinatorial structure; communication architecture; fault tolerance; partitioning; self-routing; supercomputer systems; symmetric balanced incomplete block design; Algorithm design and analysis; Computer networks; Distributed computing; Fault tolerance; Fault tolerant systems; Multiprocessing systems; Multiprocessor interconnection networks; Partitioning algorithms; Supercomputers; Very large scale integration;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/12.35837
Filename
35837
Link To Document