DocumentCode :
2721082
Title :
Fast, recurrent, attentional modulation improves saliency representation and scene recognition
Author :
Shi, Xun ; Bruce, Neil D B ; Tsotsos, John K.
Author_Institution :
Dept. of Comput. Sci. & Eng., York Univ., Toronto, ON, Canada
fYear :
2011
fDate :
20-25 June 2011
Firstpage :
1
Lastpage :
8
Abstract :
The human brain uses visual attention to facilitate object recognition. Traditional theories and models envision this attentional mechanism either in a pure feedforward fashion for selection of regions of interest or in a top-down task-priming fashion. To these well-known attentional mechanisms, we add here an additional novel one. The approach is inspired by studies of biological vision pertaining to the asynchronous timing of feedforward signals among different early visual areas and the role of recurrent connections from short latency areas to facilitate object recognition. It is suggested that recurrence elicited from these short latency dorsal areas improves the slower feedforward processing in the early ventral areas. We therefore propose a computational model that simulates this process. To test this model, we add such fast recurrent processes to a well-known model of feedforward saliency, AIM and show that those recurrent signals can modulate the output of AIM to improve its utility in recognition by later stages. We further add the proposed model to a back-propagation neural network for the task of scene recognition. Experimental results on standard video sequences show that the discriminating power of the modulated representation is significantly improved, and the implementation consistently outperforms existing work including a benchmark system that does not include recurrent refinement.
Keywords :
backpropagation; image sequences; neural nets; object recognition; video signal processing; AIM; attentional mechanism; back-propagation neural network; biological vision; feedforward processing; object recognition; saliency representation; scene recognition; video sequences; Computational modeling; Feedforward neural networks; Modulation; Neurons; Object recognition; Spatiotemporal phenomena; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition Workshops (CVPRW), 2011 IEEE Computer Society Conference on
Conference_Location :
Colorado Springs, CO
ISSN :
2160-7508
Print_ISBN :
978-1-4577-0529-8
Type :
conf
DOI :
10.1109/CVPRW.2011.5981745
Filename :
5981745
Link To Document :
بازگشت