• DocumentCode
    2447714
  • Title

    A reconfigurable architecture for multicore systems

  • Author

    Avakian, Annie ; Nafziger, Jon ; Panda, Amayika ; Vemuri, Ranga

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Cincinnati, Cincinnati, OH, USA
  • fYear
    2010
  • fDate
    19-23 April 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Various studies concluded that bus-based multiprocessor architectures outperform Network-on-Chip (NoC) architectures when the number of processors is relatively small. On the other hand, NoC architectures offer distinct performance advantages when the number of processors is large. This led to recent proposals for hybrid architectures where each node in a mesh-style packet-switched NoC architecture contains a bus-based subsystem with a small number of processors. Experimental results using select benchmarks demonstrated that these hybrid architectures offer superior performance when compared with purely bus based or purely NoC style architectures. Our studies indicate that while a hybrid architecture is preferable, the optimal number of processors on each bus subsystem varies based on the application. This number appears to vary between 1 and 8 depending on the communication requirements of the application. Further, various applications simultaneously executing on the same system require differing numbers of processors on each bus-based subsystem to minimize the overall throughput time. In this paper, we present a new reconfigurable NoC architecture which allows scalable bus-based multiprocessor subsystems on each node in the NoC. Following configuration, the system provides a multi-bus execution environment where each processor is connected to a bus and the bus-based subsystems communicate via routers connected in a mesh-style configuration. The system can be reconfigured to vary the number of bus subsystems and the number of processors on each subsystem. Each processor contains a Level 1 (L1) cache and each bus, connected to a router, has access to a Level 2 (L2) cache. The L2 caches distributed across the network together form a large virtual L2 that can shared by all the processors in the system via the router network. We present the architecture in detail, discuss a configuration algorithm, and discuss experimental results (using the NS2 and SIMICS simulators) - - on standard and synthetic benchmarks indicating the performance advantages of the proposed architecture.
  • Keywords
    multiprocessing systems; network-on-chip; reconfigurable architectures; system buses; NoC; bus based multiprocessor architectures; bus based subsystem; mesh style configuration; multicore systems; network-on-chip; reconfigurable architecture; Computer architecture; Computer networks; Delay; Hardware; Multicore processing; Network-on-a-chip; Proposals; Reconfigurable architectures; Routing; Throughput; architecture; multicore; network on chip; reconfigurable;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4244-6533-0
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2010.5470753
  • Filename
    5470753