Title :
An Efficient Parallel Architecture for Matrix Computations
Author :
Pedram, Ardavan ; Daneshtalab, Masoud ; Fakhraie, Sied Mehdi
Author_Institution :
Sch. of Electr. & Comput. Eng., Tehran Univ.
Abstract :
This paper introduces a new versatile and high-performance parallel hardware engine for matrix computations based on distributed memory. The proposed architecture reduces memory bandwidth by taking advantage of data redundancies. The core computes matrix power, multiplication, and inversion. The matrix power presented in this paper is mathematically proven to be two times faster than normal computations, scalable and just needs two input elements of the input matrix per each clock cycle. As well, the design is optimized to suitably perform least square computations in signal processing applications. The synthesis results on FPGA platforms indicate that the proposed architecture can operate in 75 MHz for 16 bit word length and the peak attained performance is about 2400 MMAC operations with 32 concurrent MAC modules
Keywords :
access protocols; field programmable gate arrays; least squares approximations; matrix algebra; parallel architectures; signal processing; 16 bit; 75 MHz; FPGA platforms; MAC modules; MMAC operations; data redundancies; distributed memory; high-performance parallel hardware design; least square computations; matrix computations; matrix power; parallel architecture; signal processing; Bandwidth; Clocks; Computer architecture; Concurrent computing; Distributed computing; Engines; Hardware; Memory architecture; Parallel architectures; Signal design;
Conference_Titel :
Norchip Conference, 2006. 24th
Conference_Location :
Linkoping
Print_ISBN :
1-4244-0772-9
DOI :
10.1109/NORCHP.2006.329271