DocumentCode
580128
Title
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications
Author
Saini, Shrikant ; Naraikin, A. ; Biswas, Rubel ; Barkai, D. ; Sandstrom, T.
Author_Institution
NASA Ames Res., Moffett Field, CA, USA
fYear
2009
fDate
14-20 Nov. 2009
Firstpage
1
Lastpage
12
Abstract
In this paper, we present an early performance evaluation of a 624-core cluster based on the Intel® Xeon® Processor 5560 (code named "Nehalem-EP", and referred to as Xeon 5560 in this paper)---the third-generation quad-core architecture from Intel. This is the first processor from Intel with a non-uniform memory access (NUMA) architecture managed by on-chip integrated memory controller. It employs a point-to-point interconnect called the Intel® QuickPath Interconnect (QPI) between processors and to the input/output (I/O) hub. It also introduces to a quad-core architecture both Intel\´s hyper-threading technology (or simultaneous multi-threading, "SMT") and Intel® Turbo Boost Technology ("Turbo mode") that automatically allow processor cores to run faster than the base operating frequency if the processor is operating below rated power, temperature, and current specification limits. It can be engaged with any number of cores or logical processors enabled and active. We critically evaluate these features using the High Performance Computing Challenge (HPCC) benchmarks, NAS Parallel Benchmarks (NPB), and four full-scale scientific applications. We compare and contrast the results of a cluster based on the Xeon 5560 with an SGI® Altix® ICE 8200EX cluster of quad-core Intel® Xeon® 5472 Processor ("Xeon 5472" from here on) and another cluster of Intel® Xeon® 5462 Processor ("Xeon 5462"; the Xeon 5400 Series Processors are previous generation quad-core Intel processors and were code named Harpertown).
Keywords
benchmark testing; input-output programs; multi-threading; multiprocessing systems; multiprocessor interconnection networks; parallel processing; performance evaluation; HPCC benchmarks; I/O hub; Intel QuickPath Interconnect; Intel Turbo Boost Technology; Intel Xeon Processor 5560; NAS parallel benchmarks; NPB; NUMA architecture; Nehalem cluster; Nehalem-EP; QPI; SGI Altix ICE 8200EX cluster; SMT; Xeon 5560; base operating frequency; current specification limits; engineering application; four full-scale scientific applications; high performance computing challenge benchmarks; hyper-threading technology; input/output hub; logical processors; nonuniform memory access architecture; on-chip integrated memory controller; performance evaluation; point-to-point interconnect; processor cores; quad-core Intel Xeon 5472 processor; simultaneous multithreading; third-generation quad-core architecture; turbo mode;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
Conference_Location
Portland, OR
Type
conf
DOI
10.1145/1654059.1654084
Filename
6375577
Link To Document