• DocumentCode
    2721082
  • Title

    Fast, recurrent, attentional modulation improves saliency representation and scene recognition

  • Author

    Shi, Xun ; Bruce, Neil D B ; Tsotsos, John K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., York Univ., Toronto, ON, Canada
  • fYear
    2011
  • fDate
    20-25 June 2011
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    The human brain uses visual attention to facilitate object recognition. Traditional theories and models envision this attentional mechanism either in a pure feedforward fashion for selection of regions of interest or in a top-down task-priming fashion. To these well-known attentional mechanisms, we add here an additional novel one. The approach is inspired by studies of biological vision pertaining to the asynchronous timing of feedforward signals among different early visual areas and the role of recurrent connections from short latency areas to facilitate object recognition. It is suggested that recurrence elicited from these short latency dorsal areas improves the slower feedforward processing in the early ventral areas. We therefore propose a computational model that simulates this process. To test this model, we add such fast recurrent processes to a well-known model of feedforward saliency, AIM and show that those recurrent signals can modulate the output of AIM to improve its utility in recognition by later stages. We further add the proposed model to a back-propagation neural network for the task of scene recognition. Experimental results on standard video sequences show that the discriminating power of the modulated representation is significantly improved, and the implementation consistently outperforms existing work including a benchmark system that does not include recurrent refinement.
  • Keywords
    backpropagation; image sequences; neural nets; object recognition; video signal processing; AIM; attentional mechanism; back-propagation neural network; biological vision; feedforward processing; object recognition; saliency representation; scene recognition; video sequences; Computational modeling; Feedforward neural networks; Modulation; Neurons; Object recognition; Spatiotemporal phenomena; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition Workshops (CVPRW), 2011 IEEE Computer Society Conference on
  • Conference_Location
    Colorado Springs, CO
  • ISSN
    2160-7508
  • Print_ISBN
    978-1-4577-0529-8
  • Type

    conf

  • DOI
    10.1109/CVPRW.2011.5981745
  • Filename
    5981745