Title :
Design of multi-invariant data structures for robust shared accesses in multiprocessor systems
Author :
Yen, I-Ling ; Bastani, Farokh B. ; Taylor, David J.
Author_Institution :
Dept. of Comput. Sci., Texas Univ., Dallas, TX, USA
fDate :
3/1/2001 12:00:00 AM
Abstract :
Multiprocessor systems are widely used in many application programs to enhance system reliability and performance. However, reliability does not come naturally with multiple processors. We develop a multi-invariant data structure approach to ensure efficient and robust access to shared data structures in multiprocessor systems. Essentially, the data structure is designed to satisfy two invariants, a strong invariant, and a weak invariant. The system operates at its peak performance when the strong invariant is true. The system will operate correctly even when only the weak invariant is true, though perhaps at a lower performance level. The design ensures that the weak invariant will always be true in spite of fail-stop processor failures during the execution. By allowing the system to converge to a state satisfying only the weak invariant, the overhead for incorporating fault tolerance can be reduced. We present the basic idea of multi-invariant data structures. We also develop design rules that systematically convert fault-intolerant data abstractions into corresponding fault-tolerant versions. In this transformation, we augment the data structure and access algorithms to ensure that the system always converges to the weak invariant, even in the presence of fail-stop processor failures. We also design methods for the detection of integrity violations and for restoring the strong invariant. Two data structures, namely binary search tree and double-linked list, are used to illustrate the concept of multi-invariant data structures
Keywords :
data integrity; data structures; fault tolerant computing; shared memory systems; transaction processing; tree searching; access algorithms; application programs; binary search tree; data structure; design rules; double-linked list; fail-stop processor failures; fault tolerance; fault-intolerant data abstractions; fault-tolerant versions; integrity violations; multi-invariant data structure approach; multi-invariant data structure design; multi-invariant data structures; multiple processors; multiprocessor systems; peak performance; performance level; robust access; robust shared accesses; shared data structures; strong invariant; system reliability; weak invariant; Concurrent computing; Data structures; Fault tolerant systems; Intelligent robots; Multiprocessing systems; Protocols; Reliability; Robustness; System performance; Tree data structures;
Journal_Title :
Software Engineering, IEEE Transactions on