Title :
Reliability evaluation of dependable distributed computing systems based on recursive merge and BDD
Author :
Chang, Yung-Ruei ; Lin, Hung-Yau ; Kuo, Sy-Yen
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
System reliability evaluation, sensitivity analysis, importance measures, failure frequency analysis and optimal design have become important issues for distributed dependable computing. Finding all the minimal file spanning trees (MFST) and avoiding repeatedly computing the redundant MFSTs is the key technique for evaluating the reliability of a distributed computing system (DCS) in previous works. However, identifying all the disjoint MFSTs is difficult and very time consuming for large-scale networks. Although existing algorithms have been demonstrated that they work fine on medium-scale networks, they have two inherent drawbacks. First, they do not support efficient manipulation of Boolean algebra. The sum-of-disjoint-products method used by them is inefficient in dealing with large Boolean functions. Second, the tree-based partitioning algorithm does not merge isomorphic subproblems and therefore, redundant computations cannot be avoided. We propose a new efficient algorithm for the reliability evaluation of a DCS based on recursive merge and binary decision diagram (BDD). Using the BDD substitution technique, we can easily apply our algorithm to a network with imperfect nodes. The experimental results show a significant improvement on the execution time compared to previous works.
Keywords :
Boolean functions; binary decision diagrams; distributed processing; fault tolerant computing; sensitivity analysis; system recovery; BDD technique; Boolean algebra; Boolean functions; binary decision diagram; distributed dependable computing system; failure frequency analysis; minimal file spanning trees; sensitivity analysis; system reliability evaluation; tree-based partitioning algorithm; Binary decision diagrams; Boolean functions; Distributed computing; Distributed control; Failure analysis; Frequency measurement; Large-scale systems; Partitioning algorithms; Reliability; Sensitivity analysis;
Conference_Titel :
Dependable Computing, 2004. Proceedings. 10th IEEE Pacific Rim International Symposium on
Print_ISBN :
0-7695-2076-6
DOI :
10.1109/PRDC.2004.1276570