• DocumentCode
    721058
  • Title

    Multi2Rank: Multimedia Multiview Ranking

  • Author

    Etter, David ; Domeniconi, Carlotta

  • fYear
    2015
  • fDate
    20-22 April 2015
  • Firstpage
    80
  • Lastpage
    87
  • Abstract
    Multimedia retrieval is a search and ranking task defined over multiple modalities. These modalities include speech, image, and text, which provide different views of the multimedia object. Queries to a multimedia retrieval system often take the form of a text only query and return a ranked result set which combines these multiple views. The text only query includes multiple phrases which identify features of a specific view. This multiview problem presents a challenge in mapping these phrases into the correct view feature space. A second challenge for the multimedia retrieval system is in building a ranking model which considers the unique feature space of each view. In this paper, we propose a hierarchical multimedia multiview rank learning model, called Multi2Rank, to overcome the challenges of this unique ranking problem. The first layer of our model uses natural language processing techniques to identify view specific phrases and output a ranked mapping of the phrases into their respective views. Next, we model the individual feature space for each multimedia view and create a view specific model using gradient boosted regression trees. The ranked set from each unique view is then passed to the final layer of the hierarchy, where the model generates a final ranked result set. We show that our multiview rank learning approach is effective by evaluating the methods using a large Internet video repository, queries, and ground truth, from the TRECVid evaluations.
  • Keywords
    learning (artificial intelligence); multimedia computing; natural language processing; query processing; Multi2Rank; TRECVid evaluations; feature space; hierarchical multimedia multiview rank learning model; large Internet video repository; multimedia multiview ranking; multimedia object; multimedia retrieval; natural language processing techniques; text only query; Electronic mail; Information retrieval; Multimedia communication; Regression tree analysis; Speech; Streaming media; Visualization; Information Search and Retrieval; Retrieval models; Search process;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Big Data (BigMM), 2015 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-8687-3
  • Type

    conf

  • DOI
    10.1109/BigMM.2015.47
  • Filename
    7153859