Abstract :
In order to recognize multi-class vehicles, traditional methods are typically based on license plates and frontal images of vehicles. These methods rely heavily on specific datasets and thus are not applicable in real-world tasks. In this paper, we propose a novel method based on a hierarchical model, HMAX, which simulates visual architecture of primates for object recognition. It can extract features of shift-invariance and scale-invariance by Gabor filtering, template matching, and max pooling. In particular, we adopt a model of saliency-based visual attention to detect salient patches for template matching, also we drop inefficient features via an all-pairs linear SVM. During experiments, high accuracy and great efficiency are achieved on a dataset which has 31 types and over 1400 vehicle images with varying scales, orientations, and colors. With comparisons with Original-HMAX, Salient-HMAX, and Sifted-HMAX model, our method achieves classifying accuracy at 92% and time for each image at around 1.5s, while reduces 73% of the time consumed by original HMAX model.
Keywords :
Gabor filters; feature extraction; image classification; image colour analysis; object recognition; pattern matching; support vector machines; Gabor filtering; HMAX; all-pairs linear SVM; classifying accuracy; feature extraction; hierarchical model; max pooling; object recognition; original-HMAX model; primate visual architecture simulation; saliency-based visual attention; salient patch detection; salient-HMAX model; scale-invariance features; shift-invariance features; sifted-HMAX model; template matching; vehicle image color; vehicle image orientation; vehicle image scale; vehicle recognition; Accuracy; Computational modeling; Computer architecture; Feature extraction; Support vector machines; Vehicles; Visualization; Feature Selection; Hierarchical Architecture; Multi-class Vehicle Classification; Visual Saliency;