DocumentCode :
3420107
Title :
Regionlets for Generic Object Detection
Author :
Xiaoyu Wang ; Ming Yang ; Shenghuo Zhu ; Yuanqing Lin
Author_Institution :
NEC Labs. America, Princeton, NJ, USA
fYear :
2013
fDate :
1-8 Dec. 2013
Firstpage :
17
Lastpage :
24
Abstract :
Generic object detection is confronted by dealing with different degrees of variations in distinct object classes with tractable computations, which demands for descriptive and flexible object representations that are also efficient to evaluate for many locations. In view of this, we propose to model an object class by a cascaded boosting classifier which integrates various types of features from competing local regions, named as region lets. A region let is a base feature extraction region defined proportionally to a detection window at an arbitrary resolution (i.e. size and aspect ratio). These region lets are organized in small groups with stable relative positions to delineate fine grained spatial layouts inside objects. Their features are aggregated to a one-dimensional feature within one group so as to tolerate deformations. Then we evaluate the object bounding box proposal in selective search from segmentation cues, limiting the evaluation locations to thousands. Our approach significantly outperforms the state-of-the-art on popular multi-class detection benchmark datasets with a single method, without any contexts. It achieves the detection mean average precision of 41.7% on the PASCAL VOC 2007 dataset and 39.7% on the VOC 2010 for 20 object categories. It achieves 14.7% mean average precision on the Image Net dataset for 200 object categories, outperforming the latest deformable part-based model (DPM) by 4.7%.
Keywords :
feature extraction; image classification; image representation; object detection; visual databases; DPM; ImageNet dataset; PASCAL VOC 2007 dataset; VOC 2010; arbitrary resolution; cascaded boosting classifier; deformable part-based model; descriptive object representations; detection mean average precision; detection window; feature extraction region; fine-grained spatial layouts; flexible object representations; generic object detection; local regions; multiclass detection benchmark datasets; object bounding box proposal; object categories; object classes; one-dimensional feature; regionlets; segmentation cues; tractable computations; Boosting; Deformable models; Feature extraction; Layout; Object detection; Prototypes; Search problems; DPM; Deformation; Detection; ImageNet; Object Detection; PASCAL; Regionlet; Subcategory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2013.10
Filename :
6751111
Link To Document :
بازگشت