DocumentCode :
1127868
Title :
Optimization Strategies for High-Performance Computing of Optical-Flow in General-Purpose Processors
Author :
Anguita, Mancia ; Díaz, Javier ; Ros, Eduardo ; Fernández-Baldomero, F. Javier
Author_Institution :
Dept. of Comput. Archit. & Technol., Univ. of Granada, Granada, Spain
Volume :
19
Issue :
10
fYear :
2009
Firstpage :
1475
Lastpage :
1488
Abstract :
In this paper, we describe the high-performance implementation of an optical-flow algorithm that takes advantage of the processor´s architecture. Tuning the code, i.e., adapting it to take full advantage of the processor, is challenging, time consuming, and requires efficient programming at different levels but can lead to significant improvements in performance. The optimized implementation presented here is highly interesting for a number of applications since it delivers real-time motion estimations at high-image resolution on a PC or in an embedded system based on a general-purpose processor. In a 2.83 GHz Core 2 Quad PC, it achieves a speedup of 14 compared to our first code version and 2052.7f/s for the well-known 252 times 316 Yosemite sequence, and a speedup of 17.6 and 68.5 f/s for a 1016 times 1280 sequence. But the description of how this high-performance is achieved goes beyond a specific application since the paper presented here illustrates how inherently dense, low-level visual algorithms (pixel-wise computation) can be structured and improved to take full advantage of a standard processor. The implementation is compared with other hardware (based on FPGAs and GPUs) and software (based on clusters, PCs, and special-purpose processors) optical-flow implementations, showing that it outperforms them.
Keywords :
embedded systems; image resolution; image sequences; motion estimation; optimisation; parallel architectures; Core 2 Quad PC; embedded system; general-purpose processor architecture; high-image resolution; high-performance computing; low-level visual algorithm; optical-flow algorithm; optimization strategy; real-time motion estimation; Code optimization; image-motion analysis; motion estimation; parallel architectures; shared-memory systems;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2009.2026821
Filename :
5159425
Link To Document :
بازگشت