Title :
Learning object classes from image thumbnails through deep neural networks
Author :
Chen, Erkang ; Yang, Xiaokang ; Zha, Hongyuan ; Zhang, Rui ; Zhang, Wenjun
Author_Institution :
Inst. of Image Commun. & Inf. Process., Shanghai Jiao Tong Univ., Shanghai
fDate :
March 31 2008-April 4 2008
Abstract :
We propose a new approach for recognizing object classes which is based on the intuitive idea that human beings are able to perform the task well given only thumbnails (coarse scale version) of images. Unlike previous work which uses local image features at fine scales, our approach uses thumbnails directly, and captures their high-order correlations at coarse scales through deep multi-layer neural networks based on restricted Boltzmann machines. Specifically, the pretraining stage of such networks takes on the role of feature extraction. Experimental results show that the proposed approach is comparable to other state-of-the-art recognition methods in terms of accuracy. The merits of the proposed approach come from the simplicity of the workflow and the parallelizability of the implementation structure.
Keywords :
neural nets; object recognition; deep multi-layer neural networks; feature extraction; high-order correlations; image thumbnails; restricted Boltzmann machines; Computer vision; Data mining; Feature extraction; Humans; Image communication; Image recognition; Information processing; Multi-layer neural network; Neural networks; Robustness; Deep Neural Networks; High-order Correlations; Object Class Recognition; Thumbnail;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517738