• DocumentCode
    86304
  • Title

    A Flexible, Self-Tuning, Fault-Tolerant Functional Unit Array Processor

  • Author

    Jun Yao ; Nakashima, Y. ; Saito, M. ; Hazama, Y. ; Yamanaka, R.

  • Author_Institution
    Nara Inst. of Sci. & Technol., Nara, Japan
  • Volume
    34
  • Issue
    6
  • fYear
    2014
  • fDate
    Nov.-Dec. 2014
  • Firstpage
    54
  • Lastpage
    63
  • Abstract
    Today, small feature-sized transistors and wires led by process miniaturization have shown increasing vulnerability to both transient and permanent faults. Modular redundancy in circuits and failure-unit isolation are thereby employed to guarantee the correct execution and also to lengthen the lifespan of electronic devices. This article proposes the Explicit Redundancy Linear Array (EReLA) architecture to provide highly flexible fault tolerance. EReLA follows the baseline reconfigurable architecture, which works best with failure-unit isolation and hot-swap techniques. In addition, specifically for the preparation of hot swaps, the authors propose a low-cost self-tuning scheme to quickly locate the precise position of the defective processing element or network connection. Powered by these schemes, EReLA can function the same as a traditional TMR processor in terms of fault tolerance with a third less power consumption, as indicated by the power data of a 0.18 μm prototype EReLA chip. The simulation data indicates that EReLA achieves around 4× the lifespan of the traditional TMR processor.
  • Keywords
    microprocessor chips; power consumption; TMR processor; baseline reconfigurable architecture; electronic devices; explicit redundancy linear array architecture; failure-unit isolation techniques; fault tolerance; fault-tolerant functional unit array processor; hot-swap techniques; modular redundancy; permanent faults; power consumption; size 0.18 mum; small feature-sized transistors; transient faults; Fault tolerance; Integrated circuits; Radiation detectors; Redundancy; Transistors; Tunneling magnetoresistance; VLIW; diagnostics; fault tolerance; redundant design;
  • fLanguage
    English
  • Journal_Title
    Micro, IEEE
  • Publisher
    ieee
  • ISSN
    0272-1732
  • Type

    jour

  • DOI
    10.1109/MM.2014.92
  • Filename
    6981851