• DocumentCode
    1832580
  • Title

    A Gate Level Analysis of Transient Faults Effects on Dual-Core Chip-Multi Processors

  • Author

    Didehban, Moslem ; Sadafi, Ario ; Salehi, Sajjad ; Chami, Mohammad Bagher

  • Author_Institution
    Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2011
  • fDate
    22-26 Aug. 2011
  • Firstpage
    365
  • Lastpage
    370
  • Abstract
    With continuous scaling in CMOS technology the number of transistors grows more and more in a single chip and it makes modern processors prone to the risk of transient fault. In this work the effects of transient faults in MIPS-based Chip-Multi Processors (CMPs) are investigated in two phases. In the first phase a low level fault injection is performed and sensitive components is determined. In the next phase, in order to improve the reliability term in CMPs, two simple low overhead fault tolerant techniques are employed on the most vulnerable components in the MIPS-based dual-core processor. Hsiao code was used which is an optimal minimum odd-weight-column single error correction and double error detection SEC-DED code to protect MPI and program counters. TMR (Triple Modular Redundancy) technique is used to improve reliability of the Arbiter. Using fault injection improves 12.8% in error recovery and 16.6% reduction of failure rate with negligible performance overhead.
  • Keywords
    CMOS integrated circuits; fault tolerance; integrated circuit reliability; microprocessor chips; multiprocessing systems; CMOS technology; CMP; Hsiao code; MIPS-based chip-multiprocessor; MIPS-based dual-core processor; MPI; SEC-DED code; double error detection; dual-core chip-multiprocessor; error correction; fault tolerant technique; gate level analysis; low level fault injection; program counter; reliability; transient fault effect; transistor; triple modular redundancy; Circuit faults; Fault tolerance; Fault tolerant systems; Logic gates; Program processors; Tiles; Transient analysis; Chip multiprocessor; Fault injection; MIPS Architecture; Transient fault;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Availability, Reliability and Security (ARES), 2011 Sixth International Conference on
  • Conference_Location
    Vienna
  • Print_ISBN
    978-1-4577-0979-1
  • Electronic_ISBN
    978-0-7695-4485-4
  • Type

    conf

  • DOI
    10.1109/ARES.2011.61
  • Filename
    6045986