Title :
An FPGA Implementation of Information Theoretic Visual-Saliency System and Its Optimization
Author :
Bae, Sungmin ; Cho, Yong Cheol Peter ; Park, Sungho ; Irick, Kevin M. ; Jin, Yongseok ; Narayanan, Vijaykrishnan
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Abstract :
Biological vision systems use saliency-based visual attention mechanisms to limit higher-level vision processing on the most visually-salient subsets of an input image. Among several computational models that capture the visual-saliency in biological system, an information theoretic AIM(Attention based on Information Maximization) algorithm has been demonstrated to predict human gaze patterns better than other existing models. We present an FPGA based implementation of this computationally intensive AIM algorithm to support embedded vision applications. Our implementation provides performance of processing about 4M pixels/sec for 25 basis functions with a convolution kernel size of 21 by 21 for each of the R, G, and B color-channels, when implemented on a Virtex-6 LX240T. We also provide an optimization aimed at controlling the trade-off between power consumption and latency, and performance comparisons with a GPU implementation.
Keywords :
computer vision; field programmable gate arrays; information theory; FPGA; Virtex-6 LX240T; attention based on information maximization algorithm; biological vision system; embedded vision; human gaze pattern prediction; information theoretic visual-saliency system; power consumption; saliency-based visual attention mechanism; Computational modeling; Computer architecture; Histograms; Kernel; Niobium; Pixel; Power demand; AIM algorithm; Biological vision systems; FPGA; GPU; Visual saliency system;
Conference_Titel :
Field-Programmable Custom Computing Machines (FCCM), 2011 IEEE 19th Annual International Symposium on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-61284-277-6
Electronic_ISBN :
978-0-7695-4301-7
DOI :
10.1109/FCCM.2011.41