Author_Institution :
Key Lab. of Intell. Inf. Process. of Chinese Acad. of Sci. (CAS), Inst. of Comput. Technol., Beijing, China
Abstract :
Our goal is to boost text-based image search results via image reranking. There are diverse modalities (features) of images that we can leverage for reranking, however, the effects of different modalities are query-dependent. The primary challenge we face is how to fuse multiple modalities adaptively for different queries, which has often been overlooked in previous reranking research. Moreover, multimodality fusion without an understanding of the query is risky, and may lead to incorrect judgment in reranking. Therefore, to obtain the best fusion weights for the query, in this paper, we leverage click-through data, which can be viewed as an “implicit” user feedback and an effective means of understanding the query. A novel reranking algorithm, called click-based relevance feedback, is proposed. This algorithm emphasizes the successful use of click-through data for identifying user search intention, while leveraging multiple kernel learning algorithm to adaptively learn the query-dependent fusion weights for multiple modalities. We conduct experiments on a real-world data set collected from a commercial search engine with click-through data. Encouraging experimental results demonstrate that our proposed reranking approach can significantly improve the NDCG@10 of the initial search results by 11.62%, and can outperform several existing approaches for most kinds of queries, such as tail, middle, and top queries.
Keywords :
image fusion; image retrieval; learning (artificial intelligence); search engines; click-through data; commercial search engine; implicit user feedback; multimodality fusion; multiple kernel learning algorithm; novel image search reranking algorithm; query-dependent click-based relevance feedback; query-dependent fusion weights; text-based image search results; user search intention; Data mining; Equations; Fuses; Kernel; Search engines; Support vector machines; Visualization; Image search; click-based relevance feedback; multiple kernel learning; search re-ranking;