DocumentCode
625614
Title
CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled Cores
Author
Mitropoulou, Konstantina ; Porpodas, Vasileios ; Cintra, M.
Author_Institution
Sch. of Inf., Univ. of Edinburgh, Edinburgh, UK
fYear
2013
fDate
20-24 May 2013
Firstpage
513
Lastpage
524
Abstract
Aggressive silicon process scaling over the last years has made transistors faster and less power consuming. Meanwhile, transistors have become more susceptible to errors. The need to maintain high reliability has led to the development of various software-based error detection methodologies which target either single-core or multi-core processors. In this work, we present CASTED, a Core-Adaptive Soft- ware Transient Error Detection methodology that focuses on improving the impact of error detection overhead on single- chip scalable architectures that are composed of tightly coupled cores. The proposed compiler methodology adaptively distributes the error detection overhead to the available resources across multiple cores, fully exploiting the abundant ILP of these architectures. CASTED adapts to a wide range of architecture configurations (issue-width, inter-core delay). We evaluate our technique on a range of architecture configurations using the MediabenchII video and SPEC CINT2000 benchmark suites. Our approach successfully adapts to (and regularly outperforms by up to 21.2%) the best fixed state-of- the-art approach while maintaining the same fault coverage.
Keywords
error detection; multiprocessing systems; program compilers; software architecture; software reliability; CASTED; ILP; MediabenchII video; SPEC CINT2000 benchmark suites; aggressive silicon process scaling; architecture configurations; compiler methodology; core-adaptive software transient error detection; error detection overhead; multicore processors; power consuming; reliability; single-chip scalable architectures; single-core processor; software-based error detection methodology; tightly coupled cores; Clustering algorithms; Computer architecture; Delays; Hardware; Program processors; Registers; Transient analysis; adaptivity; software error detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
Conference_Location
Boston, MA
ISSN
1530-2075
Print_ISBN
978-1-4673-6066-1
Type
conf
DOI
10.1109/IPDPS.2013.107
Filename
6569838
Link To Document