Title :
Hierarchical and modular attention
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
Abstract :
The flow of visual input reaching the eye consists of huge amounts of time-varying information. It is crucial for both biological vision and automated systems to perceive and comprehend such a constantly changing environment within a relatively short processing time. To cope with such a computational challenge, one should locate and analyze only the information relevant to the current task by quickly focusing on selected areas of the scene as needed. Attention makes perception computationally tractable and helps with tasks such as object recognition. Attention permeates the whole stream of visual computation, it is both hierarchical and modular, and it involves representations, processing and strategies. Attentional mechanisms are intimately related to adaptation processes, and high-level attention corresponds to competitive, functional and learned behavioral programs. Attention consists of both data- and model-driven processes and their relationships, and it covers several levels such as sensory, reactive and behavioral processes. An example of how attention can be implemented considers time-varying imagery and it shows how functional linked pyramids and zoom lens operations lead to the generation of visual saccades. Both the time-varying imagery and the corresponding recognition memory are organized as pyramids and uniform indexing and classification interfaces using an attention pyramid are established. This paper concludes with a discussion on promising venues for future research that are most likely to enhance our understanding of attentional mechanisms
Keywords :
active vision; lenses; object recognition; visual perception; adaptation processes; attention focusing; automated vision systems; behavioral programs; biological vision; classification interfaces; computational tractability; data-driven processes; functional linked pyramids; hierarchical attention; indexing interfaces; model-driven processes; modular attention; object recognition; perception; relevant information; time-varying information; visual computation; visual input flow; visual representations; visual saccades; zoom lens operations; Application specific processors; Biology computing; Concurrent computing; Data acquisition; Eyes; Image recognition; Information analysis; Layout; Machine vision; Object recognition;
Conference_Titel :
Computer Architectures for Machine Perception, 1995. Proceedings. CAMP '95
Conference_Location :
Como
Print_ISBN :
0-8186-7134-3
DOI :
10.1109/CAMP.1995.521044