• DocumentCode
    146455
  • Title

    Analysis and realization of Relaxed Consistency Memory model for multi-core CPU or GPU

  • Author

    Mohanty, Ramanarayan ; Behera, Dipti Prakash ; Routray, A.

  • Author_Institution
    Adv. Technol. Dev. Centre, Indian Inst. of Technol., Kharagpur, Kharagpur, India
  • fYear
    2014
  • fDate
    25-26 Sept. 2014
  • Firstpage
    866
  • Lastpage
    870
  • Abstract
    Parallel and distributed systems that support the shared memory paradigm are becoming widely accepted in many areas of computing. The memory consistency model of a shared-memory multiprocessor system influences both the performance and the programmability of the system. Under optimal condition it is found that multithreading contributes to more than 50 percent of performance improvement, while the improvement from Relaxed Consistency Memory (RCM) models varies between 30-40 percent of total performance gain. The relaxed consistency memory model has been realized on a Graphics Processing Unit (GPU) using Open Computing Language (OpenCL) as the programming language. This memory model has been applied on a case study of high order matrix multiplication and their performance has been analysed in terms of two metrics: GPU Computation Percentage (GCP) and GPU Load Balance (GLB). With sufficient parallelism and high PCIe data transfer bandwidth, the RCM model on GPU gives the better performance in comparison to a sequential model on CPU.
  • Keywords
    graphics processing units; mathematics computing; matrix multiplication; multi-threading; performance evaluation; resource allocation; shared memory systems; storage management; GCP; GLB; GPU computation percentage; GPU load balance; Open Computing Language; PCIe data transfer bandwidth; RCM model; distributed systems; graphic processing unit; high order matrix multiplication; memory consistency model; multicore CPU; multithreading; performance analysis; performance gain; performance improvement; programmability; programming language; relaxed consistency memory model; shared memory paradigm; shared-memory multiprocessor system; Computational modeling; Graphics processing units; Kernel; Load modeling; Performance evaluation; GPU Computation Percentage; GPU Load Balance; Graphics Processing Unit; OpenCL; Relaxed Consistency Memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-4237-4
  • Type

    conf

  • DOI
    10.1109/CONFLUENCE.2014.6949247
  • Filename
    6949247