Title :
Image Annotation Within the Context of Personal Photo Collections Using Hierarchical Event and Scene Models
Author :
Cao, Liangliang ; Luo, Jiebo ; Kautz, Henry ; Huang, Thomas S.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL
Abstract :
Most image annotation systems consider a single photo at a time and label photos individually. In this work, we focus on collections of personal photos and exploit the contextual information naturally implied by the associated GPS and time metadata. First, we employ a constrained clustering method to partition a photo collection into event-based subcollections, considering that the GPS records may be partly missing (a practical issue). We then use conditional random field (CRF) models to exploit the correlation between photos based on 1) time-location constraints and 2) the relationship between collection-level annotation (i.e., events) and image-level annotation (i.e., scenes). With the introduction of such a multilevel annotation hierarchy, our system addresses the problem of annotating consumer photo collections that requires a more hierarchical description of the customers´ activities than do the simpler image annotation tasks. The efficacy of the proposed system is validated by extensive evaluation using a sizable geotagged personal photo collection database, which consists of over 100 photo collections and is manually labeled for 12 events and 12 scenes to create ground truth.
Keywords :
Global Positioning System; image retrieval; image segmentation; information retrieval systems; ontologies (artificial intelligence); pattern clustering; random processes; GPS; clustering method; conditional random field model; event-based subcollection; geotagged personal photo collection database; hierarchical event model; image retrieval system; image-level annotation system; ontology method; scene model; time-location constraint; Clustering methods; Computer vision; Context modeling; Global Positioning System; Image databases; Image retrieval; Information retrieval; Lakes; Layout; Multimedia systems; CRF; Consumer photo collections; GPS; scene and event annotation;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2008.2009693