Instruction set extensions for photonic synchronous coalesced accesses

Author

Keltcher, Paul ; Whelihan, David ; Hughes, John

Author_Institution

Lincoln Lab., Massachusetts Inst. of Technol., Lexington, MA, USA

fYear

2013

fDate

10-12 Sept. 2013

Firstpage

1

Lastpage

4

Abstract

Microprocessors have evolved over the last forty-plus years from purely sequential single operation machines, to pipelined super-scalar, to threaded and SIMD, and finally to multi-core and massive multi-core/thread machines. Despite these advances, the conceptual model programmers use to program them is still that of a single threaded register file bound math unit that can only be loosely synchronized with other such processors. This lack of explicit synchrony, caused by limitations of metal interconnect, limits parallel efficiency. Recent advances in silicon photonic-enabled architectures [1, 5, 7] promise to greatly enable high synchrony over long distances (centimeters or more). In this paper, it is shown that global synchrony changes the way computers can be programmed by introducing a new class of ISA level instruction: the globally-synchronous load-store. In the context of multiple load-store machines, the globally synchronous load-store architecture allows the programmer to think about a collection of independent load-store machines as a single load-store machine. This operation is described, and its ISA implications explored in the context of the distributed matrix transpose, which exhibits a high degree of data non-locality, and is difficult to efficiently parallelize on modern architectures.

Keywords

computer architecture; instruction sets; shared memory systems; ISA level instruction; distributed matrix transpose; global synchrony; globally synchronous load-store architecture; independent load-store machines; instruction set extensions; multiple load-store machines; photonic synchronous coalesced accesses; single load-store machine; Graphics processing units; Instruction sets; Memory management; Photonics; Transmission line matrix methods; coalesced memory; computer architectures; instruction set; photonics; shared memory;

fLanguage

English

Publisher

ieee

Conference_Titel

High Performance Extreme Computing Conference (HPEC), 2013 IEEE

Conference_Location

Waltham, MA

Print_ISBN

978-1-4799-1364-0

Type

conf

DOI

10.1109/HPEC.2013.6670326

Filename

6670326