• DocumentCode
    625614
  • Title

    CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled Cores

  • Author

    Mitropoulou, Konstantina ; Porpodas, Vasileios ; Cintra, M.

  • Author_Institution
    Sch. of Inf., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    513
  • Lastpage
    524
  • Abstract
    Aggressive silicon process scaling over the last years has made transistors faster and less power consuming. Meanwhile, transistors have become more susceptible to errors. The need to maintain high reliability has led to the development of various software-based error detection methodologies which target either single-core or multi-core processors. In this work, we present CASTED, a Core-Adaptive Soft- ware Transient Error Detection methodology that focuses on improving the impact of error detection overhead on single- chip scalable architectures that are composed of tightly coupled cores. The proposed compiler methodology adaptively distributes the error detection overhead to the available resources across multiple cores, fully exploiting the abundant ILP of these architectures. CASTED adapts to a wide range of architecture configurations (issue-width, inter-core delay). We evaluate our technique on a range of architecture configurations using the MediabenchII video and SPEC CINT2000 benchmark suites. Our approach successfully adapts to (and regularly outperforms by up to 21.2%) the best fixed state-of- the-art approach while maintaining the same fault coverage.
  • Keywords
    error detection; multiprocessing systems; program compilers; software architecture; software reliability; CASTED; ILP; MediabenchII video; SPEC CINT2000 benchmark suites; aggressive silicon process scaling; architecture configurations; compiler methodology; core-adaptive software transient error detection; error detection overhead; multicore processors; power consuming; reliability; single-chip scalable architectures; single-core processor; software-based error detection methodology; tightly coupled cores; Clustering algorithms; Computer architecture; Delays; Hardware; Program processors; Registers; Transient analysis; adaptivity; software error detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
  • Conference_Location
    Boston, MA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-6066-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2013.107
  • Filename
    6569838