Title :
Prototyping a fault-tolerant multiprocessor SoC with run-time fault recovery
Author :
Zhu, Xinping ; Qin, Wei
Author_Institution :
Northeastern Univ., Boston, MA
Abstract :
Modern integrated circuits (ICs) are becoming increasingly complex. The complexity makes it difficult to design, manufacture and integrate these high-performance ICs. The advent of multiprocessor systems-on-chips (SoCs) makes it even more challenging for programmers to utilize the full potential of the computation resources on the chips. In the mean time, the complexity of the chip design creates new reliability challenges. As a result, chip designers and users cannot fully exploit the tremendous silicon resources on the chip. This research proposes a prototype which is composed of a fault-tolerant multiprocessor SoC and a coupled single program, multiple data (SPMD) programming framework. We use a SystemC based modeling and simulation environment to design and analyze this prototype. Our analysis shows that this prototype as a reliable computing platform constructed from the potentially unreliable chip resources, thus protecting the previous investment of hardware and software designs. Moreover, the promising application-driven simulation results shed light on the potential of a scalable and reliable multiprocessing computing platform for a wide range of mission-critical applications
Keywords :
integrated circuit design; integrated circuit modelling; integrated circuit testing; logic design; multiprocessing systems; system-on-chip; SystemC; fault-tolerant multiprocessor; multiprocessing computing platform; multiprocessor system-on-chips; network-on-chip; programming framework; retargetable simulation; run-time fault recovery; run-time verification; Chip scale packaging; Circuit faults; Computational modeling; Fault tolerance; Manufacturing; Multiprocessing systems; Programming profession; Prototypes; Runtime; Software prototyping; Design; Experimentation; Performance; Verification; fault-tolerance; multiprocessor system; network-on-chip; retargetable simulation; run-time verification; system-on-chip;
Conference_Titel :
Design Automation Conference, 2006 43rd ACM/IEEE
Conference_Location :
San Francisco, CA
Print_ISBN :
1-59593-381-6
DOI :
10.1109/DAC.2006.229177