Abstract :
We propose a new method of retrieving multi-part, articulate objects from images and video. The scheme is particularly well suited for analyzing images and video for objects that can pose differently with possible shape deformation and articulated motion. The scheme involves computing an invariant signature for each segmented region in the image, in a manner that is insensitive to translation, rotation, scale, and shear. Using circular cross-correlation, these signatures can then be efficiently compared with that of user-defined regions of interest. Ambiguities between individual region matches are then resolved through relaxation labeling techniques. A final match is established when a collection of segmented regions conform to the query object, both in terms of local shape description and global structural relation. The scheme thus allows for articulated movement of object parts within the scene. The procedure is easy to implement, yet shows promising results in its ability to isolate interesting regions in images and video, to account for structural and relational constraints among regions, and to integrate both local shape and global structural information for a detailed examination of the scene in a way that is invariant to many visual variations.
Keywords :
image retrieval; image segmentation; multimedia databases; video databases; visual databases; articulate objects retrieval; articulated motion; articulated movement; circular cross-correlation; global structural information; global structural relation; image analysis; image segmentation; invariant signatures; local shape description; local shape information; multimedia archives; query object; relational constraints; shape deformation; structural constraints; video analysis; Computer science; Employment; Image analysis; Image motion analysis; Image retrieval; Image segmentation; Labeling; Layout; Motion analysis; Shape;