• DocumentCode
    42472
  • Title

    A Performance Analysis Methodology for Multicore, Multithreaded Processors

  • Author

    Miao Ju ; Hun Jung ; Hao Che

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Texas, Arlington, TX, USA
  • Volume
    63
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    276
  • Lastpage
    289
  • Abstract
    A key challenge to program a chip multiprocessor (CMP) is how to evaluate the performance of various possible program-task-to-core mapping choices during the initial programming phase, when the executable program is yet to be developed. In this paper, we put forward a thread-level modeling methodology to meet this challenge. The idea is to model thread-level activities only and overlook the instruction-level and microarchitectural details, except those having significant impact on the thread-level performance. Moreover, since the thread-level modeling is much coarser than the instruction-level modeling, the analysis at this level turns out to be significantly faster than that at the instruction level. These features make the methodology particularly amenable for fast performance evaluation of a large number of program-task-to-core mapping choices during the initial programming phase. Based on this methodology, an analytic modeling technique based on queuing theory and a fast simulation tool are developed, both allowing for fast performance prediction of CMPs. Case studies based on a large number of code samples available in IXP1200/2400 workbenches demonstrate that the maximal sustainable line rates estimated using our simulation tool and queuing network models are consistently within 6 and 8 percent of cycle-accurate simulation results, respectively.
  • Keywords
    multi-threading; multiprocessing systems; performance evaluation; queueing theory; CMP; IXP1200 workbench; IXP2400 workbench; analytic modeling technique; chip multiprocessor; executable program; fast simulation tool; instruction-level modeling; multicore processors; multithreaded processors; performance analysis methodology; program-task-to-core mapping choices; programming phase; queuing theory; thread-level modeling methodology; thread-level performance; Analytical models; Instruction sets; Mathematical model; Organizations; Programming; Servers; Analytical models; Design space; Instruction sets; Mathematical model; Organizations; Programming; Servers; performance evaluation of chip multiprocessor; queuing theory; thread-level modeling;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2012.223
  • Filename
    6302123