DocumentCode :
2697044
Title :
A visual language model for estimating object pose and structure in a generative visual domain
Author :
Narayanaswamy, Siddharth ; Barbu, Andrei ; Siskind, Jeffrey Mark
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
fYear :
2011
fDate :
9-13 May 2011
Firstpage :
4854
Lastpage :
4860
Abstract :
We present a generative domain of visual objects by analogy to the generative nature of human language. Just as small inventories of phonemes and words combine in a grammatical fashion to yield myriad valid words and utterances, a small inventory of physical parts combine in a grammatical fashion to yield myriad valid assemblies. We apply the notion of a language model from speech recognition to this visual domain to similarly improve the performance of the recognition process over what would be possible by only applying recognizers to the components. Unlike the context-free models for human language, our visual language models are context sensitive and formulated as stochastic constraint-satisfaction problems. And unlike the situation for human language where all components are observable, our methods deal with occlusion, successfully recovering object structure despite unobservable components. We demonstrate our system with an integrated robotic system for disassembling structures that performs whole-scene reconstruction consistent with a language model in the presence of noisy feature detectors.
Keywords :
pose estimation; robot vision; stochastic processes; generative visual domain; integrated robotic system; noisy feature detector; object pose estimation; speech recognition; stochastic constraint-satisfaction problem; visual language model; Assembly; Estimation; Grammar; Image edge detection; Image segmentation; Random variables; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2011 IEEE International Conference on
Conference_Location :
Shanghai
ISSN :
1050-4729
Print_ISBN :
978-1-61284-386-5
Type :
conf
DOI :
10.1109/ICRA.2011.5980161
Filename :
5980161
Link To Document :
بازگشت