DocumentCode :
1786275
Title :
A VLIW-Vector co-processor design for accelerating Basic Linear Algebraic Operations in OpenCV
Author :
Puppala, Venkata Ganapathi
Author_Institution :
QLogic India Pvt Ltd., India
fYear :
2014
fDate :
16-18 July 2014
Firstpage :
1
Lastpage :
6
Abstract :
OpenCV is a widely used computer vision library written in C++. Basic Linear Algebraic Operations (BLAOP) involving matrices are at the heart of OpenCV. Though OpenCV provides ubiquity in the computer vision field, it runs slow when ported on embedded processors. Accelerating the LAOPs using a co-processor certainly helps improving the throughput. In this paper we present a floating point VLIW-Vector Co-processor Architecture with Vector Floating Point Datapath (VFPDP) and a 4-slot VLIW processor core to accelerate BLAOps achieving performance of two GFLOPS when run at 500MHz clock frequency. We also demonstrate a detailed mapping strategy of One sided Jacobi Singular Value Decomposition (OJSVD) algorithm onto the proposed architecture. The proposed architecture is designed using Verilog HDL and it is synthesized using Synopsis Design Compiler with 28nm TSMC target libraries. The clock period is set to 2ns and the timing constraints are met. Using the Altera´s SOPC builder, an experimental system is created with the co-processor interfaced to the NIOS II soft processor and implemented in Cyclone IV FPGA. The OJSVD algorithm is ported onto both the standalone NIOS II processor based system and the system with the proposed co-processor. The results show that 15X performance improvement achieved with this co-processor.
Keywords :
C++ language; computer vision; coprocessors; embedded systems; field programmable gate arrays; hardware description languages; linear algebra; singular value decomposition; BLAOP; C++; Cyclone IV FPGA; NIOS II soft processor; OJSVD algorithm; OpenCV; SOPC builder; TSMC target library; VFPDP; VLIW processor core; VLIW-vector coprocessor design; Verilog HDL; basic linear algebraic operation; clock period; computer vision field; computer vision library; embedded processor; floating point VLIW-vector coprocessor architecture; mapping strategy; one sided jacobi singular value decomposition algorithm; standalone NIOS II processor based system; synopsis design compiler; timing constraint; vector floating point datapath; Computer architecture; Jacobian matrices; Matrix decomposition; Program processors; Registers; VLIW; Vectors; Computer Vision; Fused MAC; OpenCV; VLIW; Vector Floating Point Datapath;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
VLSI Design and Test, 18th International Symposium on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4799-5088-1
Type :
conf
DOI :
10.1109/ISVDAT.2014.6881085
Filename :
6881085
Link To Document :
بازگشت