DocumentCode :
1414882
Title :
One billion transistors, one uniprocessor, one chip
Author :
Patt, Yale N. ; Patel, Sanjay J. ; Evers, Marius ; Friendly, Daniel H. ; Stark, Jared
Author_Institution :
Michigan Univ., MI, USA
Volume :
30
Issue :
9
fYear :
1997
fDate :
9/1/1997 12:00:00 AM
Firstpage :
51
Lastpage :
57
Abstract :
Billion-transistor processors will be much as they are today, just bigger, faster and wider (issuing more instructions at once). The authors describe the key problems (instruction supply, data memory supply and an implementable execution core) that prevent current superscalar computers from scaling up to 16- or 32-instructions per issue. They propose using out-of-order fetching, multi-hybrid branch predictors and trace caches to improve the instruction supply. They predict that replicated first-level caches, huge on-chip caches and data value speculation will enhance the data supply. To provide a high-speed, implementable execution core that is capable of sustaining the necessary instruction throughput, they advocate a large, out-of-order-issue instruction window (2,000 instructions), clustered (separated) banks of functional units and hierarchical scheduling of ready instructions. They contend that the current uniprocessor model can provide sufficient performance and use a billion transistors effectively without changing the programming model or discarding software compatibility
Keywords :
cache storage; microprocessor chips; multiprocessor interconnection networks; performance evaluation; technological forecasting; billion-transistor processors; clustered functional unit banks; data memory supply; data value speculation; hierarchical instruction scheduling; high-speed implementable execution core; implementable execution core; instruction supply; instruction throughput; instruction width; multi-hybrid branch predictors; multiprocessor system; on-chip caches; out-of-order fetching; out-of-order-issue instruction window; performance; programming model; replicated first-level caches; scalability; software compatibility; superscalar computers; trace caches; uniprocessor chip; Bandwidth; Delay; Design optimization; Dynamic scheduling; Engines; High performance computing; History; Logic; Multithreading; Optimizing compilers;
fLanguage :
English
Journal_Title :
Computer
Publisher :
ieee
ISSN :
0018-9162
Type :
jour
DOI :
10.1109/2.612249
Filename :
612249
Link To Document :
بازگشت