DocumentCode :
2726653
Title :
Low latency high throughput memory-processor interface
Author :
Wang, Qidong ; Guidotti, Daniel ; Lin, Fujiang ; Zhu, Guang ; Cui, Jie ; Wang, Qian ; Cao, Liqiang ; Ye, Tianchun ; Wan, Lixi
Author_Institution :
Inst. of Microelectron., Beijing, China
fYear :
2012
fDate :
May 29 2012-June 1 2012
Firstpage :
2098
Lastpage :
2105
Abstract :
Scaling to ExaFLOPS computing, or 100 times faster than the present version of the Fujitsu K-supercomputer, presents well known challenges, among which are power dissipation, memory capacity and access bandwidth, data locality and fault tolerance. The optimum Amdahl´s speed-up strategy is multi faceted, with greater memory bandwidth and lower access latency being generally recognized as areas to improve. To this end, evolutionary compute node architecture is considered based on a multichip interposer platform and a millimeter wave memory interface. The interposer serves as the compute node physical platform and wiring distribution layer connecting the chip multiprocessor (CMP) with on-interposer memory to an organic board. For example, the interposer may be composed of glass to reduce through-via parasitic and support one multi-GFLOPS CMP with sufficient on-interposer DRAM for balanced operation. The memory interface consists of dense arrays of millimeter waveguide with integrated mm wave transceivers and should support 40 Gb/s per channel for an aggregate throughput of 1 TB/s with estimated latency of 10-15 clock cycles. This paper examines channel impediments, design and construction. Data transmission on a 72 GHz carrier frequency and 12 Gb/s OOK modulation will be presented at the conference if available.
Keywords :
DRAM chips; fault tolerance; integrated circuit design; mainframes; microprocessor chips; millimetre wave integrated circuits; parallel machines; wiring; ExaFLOPS computing; Fujitsu K-supercomputer; OOK modulation; access bandwidth latency; bit rate 12 Gbit/s; bit rate 40 Gbit/s; byte rate 1 TByte/s; channel impediment; chip multiprocessor; compute node physical platform; data locality; data transmission; evolutionary compute node architecture; fault tolerance; frequency 72 GHz; integrated mmwave transceiver; interposer DRAM; low latency high throughput memory-processor interface; memory bandwidth; memory capacity; millimeter wave memory interface; millimeter waveguide; multi-GFLOPS CMP; multichip interposer platform; on-interposer memory; optimum Amdahl speed-up strategy; organic board; power dissipation; through-via parasitic reduction; wiring distribution layer; Bandwidth; Cutoff frequency; Millimeter wave propagation; Optical fiber dispersion; Optical waveguides; Propagation constant;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronic Components and Technology Conference (ECTC), 2012 IEEE 62nd
Conference_Location :
San Diego, CA
ISSN :
0569-5503
Print_ISBN :
978-1-4673-1966-9
Electronic_ISBN :
0569-5503
Type :
conf
DOI :
10.1109/ECTC.2012.6249131
Filename :
6249131
Link To Document :
بازگشت