DocumentCode :
638898
Title :
Real-time scene recognition on embedded system with SIFT keypoints and a new descriptor
Author :
Han Xiao ; Wenhao He ; Kui Yuan ; Feng Wen
Author_Institution :
Inst. of Autom., Beijing, China
fYear :
2013
fDate :
4-7 Aug. 2013
Firstpage :
1317
Lastpage :
1324
Abstract :
The vision system of a mobile robot has to interpret the environment in real time at low power. As a good algorithm for extracting information from images, SIFT (Scale Invariant Feature Transform) is widely used in computer vision. However, the high computational complexity makes it hard to achieve real-time performance of SIFT with pure software. This paper presents a machine vision system implementing the SIFT algorithm on an embedded image processing card, where real-time scene recognition is accomplished with low power consumption through the cooperation between an FPGA (Field Programmable Gate Array) and a DSP (Digital Signal Processor) chip. The original SIFT keypoint detection algorithm is adapted for parallel computation and implemented with a hardware pipeline in the FPGA. Although our current system is designed for 360×288 video frames, this pipelined architecture can be applied to images with arbitrary resolution. Meanwhile, the original 128-dimensional SIFT descriptor is replaced by an 18-dimensional new descriptor which can be generated more efficiently and can be matched according to an absolute distance threshold with the distance defined by infinity-norm. On this basis, a five-branch-tree data structure is designed for fast searching and matching of descriptors, and robust scene recognition is realized through the combination of keypoints. Since our new descriptor allows one keypoint to be matched to several keypoints, which is a distinct property from the original SIFT algorithm, our system can recognize multiple images with overlapping contents simultaneously. In addition, compared with traditional work that needs off-line training, our system can perform fast on-line learning, which is a desirable property for mobile robots.
Keywords :
computational complexity; digital signal processing chips; embedded systems; field programmable gate arrays; image recognition; low-power electronics; mobile robots; natural scenes; parallel architectures; pipeline processing; robot vision; transforms; tree data structures; video signal processing; 18-dimensional descriptor; DSP chip; FPGA; SIFT algorithm; SIFT keypoint detection algorithm; absolute distance threshold; computational complexity; computer vision; digital signal processor; embedded image processing card; embedded system; fast descriptor matching; fast descriptor searching; fast online learning; field programmable gate array; five-branch-tree data structure; hardware pipeline; information extraction; machine vision system; mobile robot vision system; parallel computation; pipelined architecture; real-time performance; real-time scene recognition; robust scene recognition; scale invariant feature transform; video frames; Computer architecture; Feature extraction; Field programmable gate arrays; Filtering; Hardware; Real-time systems; Standards; Descriptor; FPGA; Image Recognition; Real-time; SIFT;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mechatronics and Automation (ICMA), 2013 IEEE International Conference on
Conference_Location :
Takamatsu
Print_ISBN :
978-1-4673-5557-5
Type :
conf
DOI :
10.1109/ICMA.2013.6618104
Filename :
6618104
Link To Document :
بازگشت