Title :
An 86 mW 98GOPS ANN-Searching Processor for Full-HD 30 fps Video Object Recognition With Zeroless Locality-Sensitive Hashing
Author :
Gyeonghoon Kim ; Jinwook Oh ; Seungjin Lee ; Hoi-Jun Yoo
Author_Institution :
Dept. of Electr. Eng., KAIST, Dae-jeon, South Korea
Abstract :
Approximate nearest neighbor (ANN) searching is an essential task in object recognition. The ANN-searching stage, however, is the main bottleneck in the object recognition process due to increasing database size and massive dimensions of keypoint descriptors. In this paper, a high throughput ANN-searching processor is proposed for high-resolution (full-HD) and real-time (30 fps) video object recognition. The proposed ANN-searching processor adopts an interframe cache architecture as a hardware-oriented approach and a zeroless locality-sensitive-hashing (zeroless-LSH) algorithm as a software-oriented approach to reduce the external memory bandwidth required in nearest neighbor searching. A four-way set associative on-chip cache has a dedicated architecture to exploit data correlation at the frame-level. Zeroless-LSH minimizes data transactions from external memory at the vector-level. The proposed ANN-searching processor is fabricated as part of an object recognition SoC using a 0.13 μm 6 metal CMOS technology. It achieves 62 720 vectors/s throughput and 1140 GOPS/W power efficiency, which are 1.45 and 1.37 times higher than the state-of-the-art, respectively, enabling real-time object recognition for full-HD 30 fps video streams.
Keywords :
CMOS memory circuits; approximation theory; cache storage; cryptography; image resolution; object recognition; object-oriented programming; search problems; system-on-chip; video coding; video streaming; visual databases; ANN searching processor; approximate nearest neighbor; data transactions; database size; external memory; external memory bandwidth; full HD; hardware oriented approach; interframe cache architecture; keypoint descriptors; massive dimensions; metal CMOS technology; nearest neighbor searching; object recognition process; power 86 mW; size 0.13 mum; video object recognition; video streams; zeroless locality sensitive hashing; zeroless-LSH; zeroless-LSH algorithm; Indexes; Object recognition; Search problems; System-on-chip; Throughput; Vectors; ANN searching; Approximate nearest neighbor (ANN) searching processor; interframe cache architecture; locality-sensitive-hashing (LSH); zeroless-LSH;
Journal_Title :
Solid-State Circuits, IEEE Journal of
DOI :
10.1109/JSSC.2013.2253220