DocumentCode
425399
Title
Interpretation of Complex Scenes Using Generative Dynamic-Structure Models
Author
Todorovic, Sinisa ; Nechyba, Michael C.
Author_Institution
University of Florida, Gainesville, FL
fYear
2004
fDate
27-02 June 2004
Firstpage
184
Lastpage
184
Abstract
We propose a generative modeling framework - namely, Dynamic Tree Structured Belief Networks (DTSBNs) and a novel Structured Variational Approximation (SVA) inference algorithm for DTSBNs - as a viable solution to object recognition in images with partially occluded object appearances. We show that it is possible to assign physical meaning to DTSBN structures, such that root nodes model whole objects, while parent-child connections encode component-subcomponent relationships. Therefore, within the DTSBN framework, the treatment and recognition of object parts requires no additional training, but merely a particular interpretation of the tree/subtree structure. As such, DTSBNs naturally allow for multi-stage object recognition, in which initial recognition of object parts induces recognition of objects as a whole. As our reported experiments show, this explicit, multi-stage treatment of occlusion outperforms more traditional object-recognition approaches, which typically fail to account for occlusion in any principled or unified manner.
Keywords
Approximation algorithms; Cameras; Image sequences; Inference algorithms; Layout; Lighting; Object detection; Object recognition; Uncertainty; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition Workshop, 2004. CVPRW '04. Conference on
Type
conf
DOI
10.1109/CVPR.2004.98
Filename
1384984
Link To Document