DocumentCode :
3345060
Title :
LcVc: Low-complexity vector-core for executing scalar/vector instructions
Author :
Soliman, Mostafa I.
Author_Institution :
Electr. Eng. Dept., South Valley Univ., Aswan, Egypt
fYear :
2011
fDate :
27-28 Dec. 2011
Firstpage :
19
Lastpage :
24
Abstract :
This paper proposes a low-complexity vector-core called LcVc for executing both scalar and vector instructions on the same execution datapath. A unified register file in the decode stage is used for storing both scalar operands and vector elements. The execution stage accepts a new set of operands each cycle and produces a new result. Rather than issuing vector instruction (1-D operations) as a whole, each vector operation is issued sequentially with the existing scalar issue hardware. All loads and stores of registers take place from the data cache in the memory access stage in a rate of one element per clock cycle. The hardware required to support the enhanced vector capability is insignificant (few incrementers and multiplexers), which results in reducing the area per core and increasing the number of cores available in a given chip area. Three key features distinguish LcVc architecture: a unified ISA to scalar and vector processing, low cost, and simplicity of organization. The use of LcVc approximately doubles the speedup of executing vector/matrix kernels such as vector addition, vector scaling, SAXPY, dot-product, matrix-vector multiplication, and matrix-matrix multiplication.
Keywords :
cache storage; computational complexity; computer architecture; matrix multiplication; multiprocessing systems; 1D operations; LcVc architecture; SAXPY; data cache; data-level parallelism; dot-product; low-complexity vector-core; matrix kernels; matrix-matrix multiplication; matrix-vector multiplication; memory access stage; multicore processor; scalar instruction execution; scalar operands; unified register file; vector addition; vector capability; vector elements; vector instruction execution; vector kernels; vector scaling; data-level parallelism; multi-core processor; pipelining; unified ISA; unified register fil; vector processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering Conference (ICENCO), 2011 Seventh International
Conference_Location :
Giza
Print_ISBN :
978-1-4673-0730-7
Type :
conf
DOI :
10.1109/ICENCO.2011.6153927
Filename :
6153927
Link To Document :
بازگشت