DocumentCode
48035
Title
Performance evaluation of many-core systems: case study with TILEPro64
Author
Han-Yee Kim ; Young-hwan Kim ; HeonChang Yu ; Taeweon Suh
Author_Institution
Dept. of Comput. Sci. Educ., Korea Univ., Seoul, South Korea
Volume
7
Issue
4
fYear
2013
fDate
Jul-13
Firstpage
143
Lastpage
154
Abstract
This study evaluates the performance of the 64-core-based TILEPro64, and compares it with Core i7 and Atom by executing three benchmark programs: a synthetic bench, SPEC CINT2006 and SPLASH-2. TILEPro64 is not advertised for regular applications such as SPLASH-2. However, its internal many-core structure makes it worth investigating the performance characteristic with conventional benchmarks. The synthetic benchmark shows that the stall time because of on-chip network takes up to 85% of total execution time in TILEPro64. The single-core performance with CINT2006 reports that Core i7 and Atom deliver 15.4 × and 3.8 × superior performance to TILEPro64, respectively. The parallel performance with SPLASH-2 reports a similar trend. Comparing the fastest execution times, Core i7 boasts of a 19.2 × faster performance than TILEPro64 and even Atom outperforms TILEPro64 by 2.6 × on average. It came as a surprise that even Atom outperforms TILEPro64 in most of the benchmark programs. The highest number of last-level cache misses is a major culprit for low performance. The forerunner many-core products such as TILEPro64 offer excellent test-beds for polishing, adjusting and reshaping many-core architecture in the right direction.
Keywords
multiprocessing systems; performance evaluation; 64-core-based TILEPro64; Atom; Core i7; SPEC CINT2006; SPLASH-2; internal many-core structure; last-level cache misses; on-chip network; parallel performance; performance evaluation; single-core performance; stall time; synthetic benchmark;
fLanguage
English
Journal_Title
Computers & Digital Techniques, IET
Publisher
iet
ISSN
1751-8601
Type
jour
DOI
10.1049/iet-cdt.2012.0101
Filename
6562921
Link To Document