DocumentCode
2932448
Title
Multimodal image retrieval via bayesian information fusion
Author
Zhang, Rui ; Guan, Ling
Author_Institution
Ryerson Multimedia Res. Lab., Ryerson Univ., Toronto, ON, Canada
fYear
2009
fDate
June 28 2009-July 3 2009
Firstpage
830
Lastpage
833
Abstract
In this paper, a multimodal image retrieval framework integrating the information in both audio and visual domain via Bayesian decision level fusion is proposed. In both domains, a statistical model for each semantic class is learned. Based on the Bayes´ theorem, the a posteriori probability of each class given a query is calculated in the audio domain, which is propagated to the images classified into the corresponding semantic class in the visual domain. These probabilistic measures are utilized as the a priori probability in the overall framework, which is combined with the likelihood evaluated based on nearest neighbor content-based image retrieval. Through the Bayes´ theorem again, the images are ranked based on their a posteriori probabilities given the audio and visual feature of a query. To further improve the system, we also propose a relevance feedback scheme in the audio domain. Experimental results demonstrate the advantage of the proposed method over the retrieval simply based on visual features.
Keywords
Bayes methods; content-based retrieval; image classification; image retrieval; maximum likelihood estimation; Bayes theorem; Bayesian decision level fusion; Bayesian information fusion; a posteriori probability; audio domain; image classification; multimodal image retrieval; nearest neighbor content-based image retrieval; semantic class; statistical model; visual domain; Bayesian methods; Content based retrieval; Feedback; Humans; Image retrieval; Information retrieval; Multimedia databases; Natural languages; Nearest neighbor searches; Probability; Multimodal Bayesian image retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location
New York, NY
ISSN
1945-7871
Print_ISBN
978-1-4244-4290-4
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2009.5202623
Filename
5202623
Link To Document