مرکز منطقه ای اطلاع رساني علوم و فناوري - LcVc: Low-complexity vector-core for executing scalar/vector instructions

DocumentCode :

3345060

Title :

LcVc: Low-complexity vector-core for executing scalar/vector instructions

Author :

Soliman, Mostafa I.

Author_Institution :

Electr. Eng. Dept., South Valley Univ., Aswan, Egypt

fYear :

2011

fDate :

27-28 Dec. 2011

Firstpage :

Lastpage :

Abstract :

This paper proposes a low-complexity vector-core called LcVc for executing both scalar and vector instructions on the same execution datapath. A unified register file in the decode stage is used for storing both scalar operands and vector elements. The execution stage accepts a new set of operands each cycle and produces a new result. Rather than issuing vector instruction (1-D operations) as a whole, each vector operation is issued sequentially with the existing scalar issue hardware. All loads and stores of registers take place from the data cache in the memory access stage in a rate of one element per clock cycle. The hardware required to support the enhanced vector capability is insignificant (few incrementers and multiplexers), which results in reducing the area per core and increasing the number of cores available in a given chip area. Three key features distinguish LcVc architecture: a unified ISA to scalar and vector processing, low cost, and simplicity of organization. The use of LcVc approximately doubles the speedup of executing vector/matrix kernels such as vector addition, vector scaling, SAXPY, dot-product, matrix-vector multiplication, and matrix-matrix multiplication.

Keywords :

cache storage; computational complexity; computer architecture; matrix multiplication; multiprocessing systems; 1D operations; LcVc architecture; SAXPY; data cache; data-level parallelism; dot-product; low-complexity vector-core; matrix kernels; matrix-matrix multiplication; matrix-vector multiplication; memory access stage; multicore processor; scalar instruction execution; scalar operands; unified register file; vector addition; vector capability; vector elements; vector instruction execution; vector kernels; vector scaling; data-level parallelism; multi-core processor; pipelining; unified ISA; unified register fil; vector processing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Engineering Conference (ICENCO), 2011 Seventh International

Conference_Location :

Giza

Print_ISBN :

978-1-4673-0730-7

Type :

conf

DOI :

10.1109/ICENCO.2011.6153927

Filename :

6153927

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3345060