DocumentCode
1876934
Title
Addressing Cache/Memory Overheads in Enterprise Java CMP Servers
Author
Shiv, Kumar ; Iyer, Ravi ; Bhat, Mahesh ; Illikkal, Ramesh ; Jones, Michael ; Makineni, Srihari ; Domer, Jason ; Newell, Don
Author_Institution
Intel Corp., Santa Clara
fYear
2007
fDate
27-29 Sept. 2007
Firstpage
66
Lastpage
75
Abstract
As we enter the era of chip multiprocessor (CMP) architectures, it is important that we explore the scaling characteristics of mainstream server workloads on these platforms. In this paper, we analyze the performance of two significant enterprise Java workloads (SPECjAppServer2004 and SPECjbb2005) on CMP platforms -present and future. We start by characterizing the core, cache and memory behavior of these workloads on the newly released Intel core 2 Duo Xeon platform (dual-core, dual-socket). Our findings from these measurements indicate that these workloads have a significant performance dependence on cache and memory subsystems. In order to guide the evolution of future CMP platforms, we perform a detailed investigation of potential cache and memory architecture choices. This includes analyzing the effects of thread sharing and migration, object allocation and garbage collection. Based on the observed behavior, we propose architectural optimizations along three dimensions: (a) data-less cache line initialization (DCLI), (b) hardware-guided thread collocation (HGTC) and (c) on-socket DRAM caches (OSDC). In this paper, we will describe these optimizations in detail and validate their performance potential based on trace-driven simulations and execution-driven emulation. Overall, we expect that the findings in this paper will guide future CMP architectures for enterprise Java servers.
Keywords
Java; cache storage; computer architecture; electronic engineering computing; microprocessor chips; Intel core 2 Duo Xeon platform; architectural optimization; cache/memory overhead; chip multiprocessor architecture; data-less cache line initialization; enterprise Java CMP server; execution-driven emulation; hardware-guided thread collocation; on-socket DRAM cache; trace-driven simulation; Current measurement; Emulation; Frequency; Hardware; Java; Memory architecture; Microprocessors; Performance analysis; Random access memory; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Workload Characterization, 2007. IISWC 2007. IEEE 10th International Symposium on
Conference_Location
Boston, MA
Print_ISBN
978-1-4244-1561-8
Electronic_ISBN
978-1-4244-1562-5
Type
conf
DOI
10.1109/IISWC.2007.4362182
Filename
4362182
Link To Document