• DocumentCode
    775035
  • Title

    The potential of compile-time analysis to adapt the cache coherence enforcement strategy to the data sharing characteristics

  • Author

    Mounes-Toussi, Farnaz ; Lilja, David J.

  • Author_Institution
    Dept. of Electr. Eng., Minnesota Univ., Minneapolis, MN, USA
  • Volume
    6
  • Issue
    5
  • fYear
    1995
  • fDate
    5/1/1995 12:00:00 AM
  • Firstpage
    470
  • Lastpage
    481
  • Abstract
    Cache coherence schemes that dynamically adapt to memory referencing patterns have been proposed to improve coherence enforcement in shared-memory multiprocessors. By using only run-time information, however, these existing schemes are incapable of looking ahead in the memory referencing stream. We present a combined hardware-software strategy that uses the predictive capability of the compiler to select updating or invalidating for each write reference. To determine the potential performance improvement that can be achieved with this optimization, three different levels of compiler capabilities are examined. Simulations using memory traces show that with an ideal compiler, this optimization can potentially reduce the miss ratio by 0.4% to 15% compared to an invalidating-only scheme, while reducing the generated network traffic by 13% to 94% compared to an updating-only scheme. In addition, this optimization can potentially reduce the miss ratio by up to 13%, while reducing the generated network traffic by up to 92%, compared to a dynamic adaptive scheme. Furthermore, performance can be potentially improved even with a compiler capable of performing only imprecise array subscript analysis and no interprocedural analysis
  • Keywords
    cache storage; performance evaluation; program compilers; shared memory systems; cache coherence; cache coherence enforcement; compile-time analysis; compiler capabilities; data sharing characteristics; memory referencing; performance improvement; shared-memory multiprocessors; Broadcasting; Delay effects; Large-scale systems; Optimizing compilers; Performance analysis; Runtime; Senior members; Switches; Telecommunication traffic; Traffic control;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.382316
  • Filename
    382316