DocumentCode
646619
Title
Instruction set extensions for photonic synchronous coalesced accesses
Author
Keltcher, Paul ; Whelihan, David ; Hughes, John
Author_Institution
Lincoln Lab., Massachusetts Inst. of Technol., Lexington, MA, USA
fYear
2013
fDate
10-12 Sept. 2013
Firstpage
1
Lastpage
4
Abstract
Microprocessors have evolved over the last forty-plus years from purely sequential single operation machines, to pipelined super-scalar, to threaded and SIMD, and finally to multi-core and massive multi-core/thread machines. Despite these advances, the conceptual model programmers use to program them is still that of a single threaded register file bound math unit that can only be loosely synchronized with other such processors. This lack of explicit synchrony, caused by limitations of metal interconnect, limits parallel efficiency. Recent advances in silicon photonic-enabled architectures [1, 5, 7] promise to greatly enable high synchrony over long distances (centimeters or more). In this paper, it is shown that global synchrony changes the way computers can be programmed by introducing a new class of ISA level instruction: the globally-synchronous load-store. In the context of multiple load-store machines, the globally synchronous load-store architecture allows the programmer to think about a collection of independent load-store machines as a single load-store machine. This operation is described, and its ISA implications explored in the context of the distributed matrix transpose, which exhibits a high degree of data non-locality, and is difficult to efficiently parallelize on modern architectures.
Keywords
computer architecture; instruction sets; shared memory systems; ISA level instruction; distributed matrix transpose; global synchrony; globally synchronous load-store architecture; independent load-store machines; instruction set extensions; multiple load-store machines; photonic synchronous coalesced accesses; single load-store machine; Graphics processing units; Instruction sets; Memory management; Photonics; Transmission line matrix methods; coalesced memory; computer architectures; instruction set; photonics; shared memory;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Extreme Computing Conference (HPEC), 2013 IEEE
Conference_Location
Waltham, MA
Print_ISBN
978-1-4799-1364-0
Type
conf
DOI
10.1109/HPEC.2013.6670326
Filename
6670326
Link To Document