DocumentCode
2011346
Title
Atomic Vector Operations on Chip Multiprocessors
Author
Kumar, Sanjeev ; Kim, Daehyun ; Smelyanskiy, Mikhail ; Chen, Yen-Kuang ; Chhugani, Jatin ; Hughes, Christopher J. ; Kim, Changkyu ; Lee, Victor W. ; Nguyen, Anthony D.
Author_Institution
Microprocessor Technol. Labs., Intel Corp., Santa Clara, CA
fYear
2008
fDate
21-25 June 2008
Firstpage
441
Lastpage
452
Abstract
The current trend is for processors to deliver dramatic improvements in parallel performance while only modestly improving serial performance. Parallel performance is harvested through vector/SIMD instructions as well as multithreading (through both multithreaded cores and chip multiprocessors). Vector parallelism can be more efficiently supported than multithreading, but is often harder for software to exploit. In particular, code with sparse data access patterns cannot easily utilize the vector/SIMD instructions of mainstream processors. Hardware to scatter and gather sparse data has previously been proposed to enable vector execution for these codes. However, on multithreaded architectures, a number of applications spend significant time on atomic operations (e.g., parallel reductions), which cannot be vectorized using previously proposed schemes. This paper proposes architectural support for atomic vector operations (referred to as GLSC) that addresses this limitation. GLSC extends scatter-gather hardware to support atomic memory operations. Our experiments show that the GLSC provides an average performance improvement on a set of important RMS kernels of 54% for 4-wide SIMD.
Keywords
multi-threading; vector processor systems; GLSC; SIMD instructions; atomic vector operations; chip multiprocessors; multithreaded architectures; parallel performance; vector parallelism; Application software; Computer architecture; Hardware; Microprocessors; Multithreading; Parallel processing; Read-write memory; Scattering; Software performance; Yarn; SIMD; locks; multiprocessors; reductions; vector;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Architecture, 2008. ISCA '08. 35th International Symposium on
Conference_Location
Beijing
ISSN
1063-6897
Print_ISBN
978-0-7695-3174-8
Type
conf
DOI
10.1109/ISCA.2008.38
Filename
4556746
Link To Document