DocumentCode
2787212
Title
Multicore Surprises: Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine
Author
Petrini, Fabrizio ; Fossum, Gordon ; Fernandez, Juan ; Varbanescu, Ana Lucia ; Kistler, Mike ; Perrone, Michael
Author_Institution
Pacific Northwest Nat. Lab., Richland, WA
fYear
2007
fDate
26-30 March 2007
Firstpage
1
Lastpage
10
Abstract
The Cell Broadband Engine (BE) processor provides the potential to achieve an impressive level of performance for scientific applications. This level of performance can be reached by exploiting several dimensions of parallelism, such as thread-level parallelism using several synergistic processing elements, data streaming parallelism, vector parallelism in the form of 128-bit SIMD operations, and pipeline parallelism by issuing multiple instructions in the same clock cycle. In our exploration to achieve the optimum level of performance for Sweep3D, we have enjoyed many pleasant surprises, such as a very high floating point performance, reaching 64% of the theoretical peak in double precision, and an over all performance speedup ranging from 4.5 times when compared with "heavy iron" processors, up to over 20 times with conventional processors.
Keywords
computer architecture; natural sciences computing; parallel processing; pipeline processing; 128-bit SIMD operations; Cell Broadband Engine processor; Sweep3D optimization; multicore surprises; pipeline parallelism; scientific applications; Buildings; Computer architecture; Concurrent computing; Energy consumption; Engines; High performance computing; Laboratories; Multicore processing; Parallel processing; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location
Long Beach, CA
Print_ISBN
1-4244-0910-1
Electronic_ISBN
1-4244-0910-1
Type
conf
DOI
10.1109/IPDPS.2007.370252
Filename
4227980
Link To Document