• DocumentCode
    2981234
  • Title

    A Memory Access Model for Highly-threaded Many-core Architectures

  • Author

    Lin Ma ; Agrawal, Kunal ; Chamberlain, Roger D.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Washington Univ. in St. Louis, St. Louis, WA, USA
  • fYear
    2012
  • fDate
    17-19 Dec. 2012
  • Firstpage
    339
  • Lastpage
    347
  • Abstract
    Many-core architectures are excellent in hiding memory-access latency by low-overhead context switching among a large number of threads. The speedup of algorithms carried out on these machines depends on how well the latency is hidden. If the number of threads were infinite, then theoretically these machines should provide the performance predicted by the PRAM analysis of the programs. However, the number of allowable threads per processor is not infinite. In this paper, we introduce the Threaded Many-core Memory (TMM) model which is meant to capture the important characteristics of these highly-threaded, many-core machines. Since we model some important machine parameters of these machines, we expect analysis under this model to give more fine-grained performance prediction than the PRAM analysis. We analyze 4 algorithms for the classic all pairs shortest paths problem under this model. We find that even when two algorithms have the same PRAM performance, our model predicts different performance for some settings of machine parameters. For example, for dense graphs, the Floyd-Warshall algorithm and Johnson´s algorithms have the same performance in the PRAM model. However, our model predicts different performance for large enough memory-access latency and validates the intuition that the Floyd-Warshall algorithm performs better on these machines.
  • Keywords
    computer architecture; multiprocessing systems; random-access storage; PRAM analysis; TMM; highly threaded manycore architectures; machine parameters; memory access latency; memory access model; Algorithm design and analysis; Analytical models; Computational modeling; Hidden Markov models; Instruction sets; Phase change random access memory; Prediction algorithms; All Pairs Shortest Paths (APSP); PRAM; TMM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2012 IEEE 18th International Conference on
  • Conference_Location
    Singapore
  • ISSN
    1521-9097
  • Print_ISBN
    978-1-4673-4565-1
  • Electronic_ISBN
    1521-9097
  • Type

    conf

  • DOI
    10.1109/ICPADS.2012.54
  • Filename
    6413678