DocumentCode :
124004
Title :
Scalable parallel architecture for singular value decomposition of large matrices
Author :
Martinez-Corral, Unai ; Basterretxea, Koldo ; Finker, Raul
Author_Institution :
Grupo de Diseno en Electron. Digital (GDED), Univ. of the Basque Country (UPV/EHU), Bilbao, Spain
fYear :
2014
fDate :
2-4 Sept. 2014
Firstpage :
1
Lastpage :
4
Abstract :
Singular Value Decomposition (SVD) is a key linear algebraic operation in many scientific and engineering applications, many of them involving high dimensionality datasets and real-time response. In this paper we describe a scalable parallel processing architecture for accelerating the SVD of large m × n matrices. Based on a linear array of simple processing-units (PUs), the proposed architecture follows a double data-flow paradigm (FIFO memories and a shared-bus) for optimizing the time spent in data transferences. The PUs, which perform elemental column-pair evaluations and rotations, have been designed for an efficient utilization of available FPGA resources and to achieve maximum algorithm speed-ups. The architecture is fully scalable from a two-PU scheme to an arrangement with as many as n/2 PUs. This allows for a trade-off between occupied area and processing acceleration in the final implementation, and permits the SVD processor to be implemented both on low-cost and high-end FPGAs. The system has been prototyped on Spartan-6 and Kintex-7 devices for performance comparison.
Keywords :
field programmable gate arrays; logic design; matrix algebra; parallel architectures; singular value decomposition; FIFO memories; FPGA resources; Kintex-7 devices; SVD; Spartan-6 devices; double data-flow paradigm; elemental column-pair evaluations; large matrices; linear algebraic operation; linear array; scalable parallel processing architecture; shared-bus; simple processing-units; singular value decomposition; two-PU scheme; Algorithm design and analysis; Arrays; Field programmable gate arrays; Jacobian matrices; Matrix decomposition; Singular value decomposition; CORDIC; FPGA; Singular Value Decomposition; adaptive threshold; co-processor; scalable architecture;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2014 24th International Conference on
Conference_Location :
Munich
Type :
conf
DOI :
10.1109/FPL.2014.6927393
Filename :
6927393
Link To Document :
بازگشت