Title :
Sequential knowledge-driven scene recognition model
Author :
Chernyak, Dimitri A. ; Stark, Lawrence W.
Author_Institution :
VISX Inc, Santa Clara, CA, USA
Abstract :
Eye movements are an important aspect of human visual behavior. The temporal and space-variant nature of sampling a visual scene requires frequent attentional gaze shifts, saccades, to fixate onto different parts of an image. Experimental evidence suggests that fixations are often directed towards the most informative regions in the visual scene. We develop a model and its simulation that can select such regions based on prior knowledge of similar scenes. Having representations of scene categories as a probabilistic combination of hypothetical objects, i.e., prototypical regions with certain properties, it is possible to assess the likely contribution of each image region to the successive recognition process. Using conditional probabilities for each region given the scene category, the model can then predict its informative value and initiate a sequential spatial information-gathering algorithm, analogous to an eye movement saccade to a new fixation. This algorithm establishes the most likely scene category for a given image.
Keywords :
digital simulation; image recognition; image sampling; image segmentation; probability; conditional probabilities; eye movement saccade; eye movements; fixations; frequent attentional gaze shifts; human visual behavior; hypothetical objects; image region; informative regions; informative value; prior knowledge; probabilistic combination; prototypical regions; saccades; scene categories; scene category; segmentation techniques; sequential knowledge-driven scene recognition model; sequential spatial information-gathering algorithm; similar scenes; successive recognition process; temporal nature; visual scene; visual scene sampling; Humans; Image motion analysis; Image recognition; Image sampling; Image segmentation; Layout; Partitioning algorithms; Predictive models; Robustness; Shape;
Conference_Titel :
Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on
Print_ISBN :
0-7695-1272-0
DOI :
10.1109/CVPR.2001.990986