مرکز منطقه ای اطلاع رساني علوم و فناوري - Multimodal pLSA on visual features and tags

DocumentCode :

2930402

Title :

Multimodal pLSA on visual features and tags

Author :

Romberg, Stefan ; Hörster, Eva ; Lienhart, Rainer

Author_Institution :

Multimedia Comput. Lab., Univ. of Augsburg, Augsburg, Germany

fYear :

2009

fDate :

June 28 2009-July 3 2009

Firstpage :

414

Lastpage :

417

Abstract :

This work studies a new approach for image retrieval on largescale community databases. Our proposed system explores two different modalities: visual features and community-generated metadata, such as tags. We use topic models to derive a high-level representation appropriate for retrieval for each of our images in the database. We evaluate the proposed approach experimentally in a query-by-example retrieval task and compare our results to systems relying solely on visual features or tag features. It is shown that the proposed multimodal system outperforms the unimodal systems by approximately 36%.

Keywords :

image representation; image retrieval; meta data; probability; visual databases; community-generated metadata; high-level representation; image retrieval; image tagging; largescale community database; multimodal pLSA; multimodal system; probabilistic latent semantic analysis; query-by-example retrieval task; topic model; visual feature; Feature extraction; Image databases; Image retrieval; Information retrieval; Large-scale systems; Multimedia computing; Spatial databases; Technical Activities Guide -TAG; Visual databases; Vocabulary; SIFT; image retrieval; multimodal pLSA; tags;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on

Conference_Location :

New York, NY

ISSN :

1945-7871

Print_ISBN :

978-1-4244-4290-4

Electronic_ISBN :

1945-7871

Type :

conf

DOI :

10.1109/ICME.2009.5202522

Filename :

5202522

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2930402